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ACATTAGGTG CCATGGGCTT ACCGGCACAA GGTTTAGCAT TAATTATTGG TGTTGACCGT 114 0 

ATCTTAGATA TGGTACGTAC ATGTGTAAAC GTTATTGGTA ATGCATTATC AACAATCGTT 120 0 

ATAGCTAAAT GGGAAAACGT ATATGACAAA GCAAAAGGTC AAGAATATTT AAAATCAATT 12 6 0 

TAAAAAATAC TATCTGACAT TTAArGn CCC TTACAACCTT TGGTTgTnAG GGCTnTTTTA 1320 

TGTCATGCGT CTTAAAGCCA GGCCGTATAn CGGTAAGCGT A 1361 



10 

(2) INFORMATION FOR SEQ ID NO: 24 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1489 base pairs 

(B) TYPE: nucleic acid 

'5 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 240; 

TCAATATGTA AATCGGCGAT TGTCCCTATC tTCATACCCT TACCTTCTTT ACTCAAATTA 6 0 

AAATGTTATT TATTATGCCA TaGCTTATCT AATATATATA GTTAACTGCT TCATTTAGAT 12 0 

25 GATTATTTTA TATTTTTGCA TAAAAACTTA TATCTTTTCA AAAAAATCGA TAAGTTTTAG 18 0 

TTATCATACC CTTACCTATC AAATGTTTTT TCTTATATTT AAAAAAATAA TTGCTTTATT 24 0 

AAATGGATTT CTTTAGTATT TATAATTAAG AAAACGCTTA CACACAACTT TTTTATTTGC 3 00 

30 TTTATCCTGA GGAGGAAAAT TATGGCAAGA AAATTGCATA GAGAGTTGAA TAACAGACAC 36 0 

ATCCAATTAA TAGCAATTGG GGGCGCAATT GGAACTGGGT TATTCCTAGG ATCAGGTCAA 42 0 

ACAATAT CTT TAACTGGTCC ATCACTGTTA TTCACATACA TGATTATTGG GGTTGTACTA 480 

35 TTCGCTTTTA TGCGCGCATT AGGCGAATTG TTGTTGAGCA ATACAAGATT TAATTCATTT 54 0 

GTTGATATTG CAAATGAATA TTTAGGCCCT TTTGGTGGCT TTGTCATTGG CTGGACTTAC 600 

TGGTTATGTT GGATTGTATC AAGTATGTCA GACCTAACTG CGATGGGACA ATACTTTGCA 66 0 

40 

TTTTGGTATC CACAAGTCCC AAATTGGATT ACCGTGCTAT TTATTGTTTT AATCTTGATT 72 0 

AGCTTCAACT TATTAGGTGC CAGATTATTT GGTGAACTGG AGTTTTGGTT CTCGATTATT 780 

AAAGTTGTCA CAATTATTGC GATGGTTATC GTTGGTCTTG TATTAATCTT TTTCTCATTT 84 0 

45 

AAAACACATT ATGGACATGC ATCATTCACA AACTTAATCA GTCACGGTGG CATGTTCCCT 9 00 

GGTGGAaCAT TTGGTTTCTT AATGTCATTC CAAATTGCTG TATATTCATT CATTGGTATT 960 

5Q GAACTTATAG GTGTAaCTGC TGGTGAAa CG AAAGATCCTG AAAAAACCTT ACCGAAAGCA 102 0 

ATTAATAATG TACCTATCCG TATTTTATTA TTCTATATCG GTGGTCTATT AGTAATTATG 10 80 
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ATATAATAAC CATGACAAAA TTAGAGATTT TATAATCATT GAAGCATATA TGtXlTCGTTT 34 80 

TAAGAAAAAA GTCAAGCCTG AAGTCGATAT GACTATAAAA GAATTTATAT TACTGACTTA 3 540 

TTTATTTCAT CAGCAAGAAA ACACACTTCC ATTTAAGAAG ATTGTTTCAG ATTTATGTTA 360 0 

TAAACAATCG GATTTAGTAC AG CAT AT AAA AGTACTTGTG AAACATTCAT ATATTAGTAA 3660 

AGTTCGAAGT AAAATTGATG AGCGTAATAC TTACATTTCA ATATCTGAAG AACAACnAGA 372 0 

nAAAATTGCA GAACnTGTTA CATTGTTTGA TCAAATCATT AAACAATTTA ACCTT 3 775 



(2) INFORMATION FOR SEQ ID NO: 23 9: 

(i) SEQUENCE CHARACTERISTICS: 
is (A) LENGTH: 1361 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 9: 
AGAAAAATTA GCCTACCTAT GCAAGTTGTT ATTGCATTAG TGTTAGGTGT TGTCGTAGGA 60 

25 CTTTTATTAT ATGGACAAGA AAACGTAGCA AATTACATTA AACCATTTGG TGACGTATTT 120 

TTAAATTTAA TTAAAATGAT CGTTATACCA GTTGTATTTT GCTCACTAGC GCTTTCTATT 180 

TCGAACGTTG GGGAATCGAA AACTGTAGGG CGTTATGGCT GGaAAACAAT TTTATACTTT 240 

30 GAAATTATTA CAACAATCGC AATAGGTTTA GGGATTATCT TCGGTAACCT ATTTAAACCA 3 00 

GGTGCTGGAT TAGACCCAAC AAAATTACCT AAAGGTGATA TTTCTAAATA TCAATCAACT 3 60 

GCACATGCAG CAGAACAATC TACATATGGa AATCATTTTA TTGATACCAT TGTACATATT 4 20 

35 ATTCCGACAA ACTTTTTTGA AGCTTTAAAT AAGGGTGAAT TATTACCTAT TATCTTCTTC 4 80 

GCAGTATTCT TTGGATTAGG ATTAGCTGCT GTAGGTAAAA AAGCAGAACC AGTTAAAGAA 540 

TTTTTAAGCG GATCGCTTGA AGCTGTGTTC TGGATGATTA ATAAAATTTT AAAATTAGCA 600 

40 

CCACTTGGAG TGTTTGCATT CATTTGTACT ACAATTATTA CATTTGGTGC ATCCGCATTA 66 0 

TTACCACTAT TAAAATTAGT ATTAGTTGTT GTCTTTGCAA TGGTGTTCTT TGTATTCGCT 720 

ATACTAGGAC TAGTTGCATG GATGTGTGGT ATTAATATCA TGAATATTAT TAGAATCTTG 780 

45 

AAAAGTGAAT TGCTTTTAGC ATTTTCTACA TCAAGTTCGG AAGCTGTACT TCCTGTAATG 84 0 

ATGAAGAAAA TGGAAAACTT CGGTTCTCCA AAAGAAATTA CTTCTTTTGT TATACCAATT 90 0 

50 GGTTATACGT TTAACTTAGA TGGATCAGCA CTTTATCAAT CTATTGCAGC ATTATTCGTT 960 

GCACAGATGT ATGGAATGCA CTTAACATTA TCAGAGCAAA TTGTGTTGAT GTTAACATTA 1020 
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15 
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AATATTTGGG ATATCCCATG GATTATTCCG CTTGTATTGA TACTTATTTT AATTGCATTT 16 80 

AGCATGGCTG CACACATCAA CATCTTGATG ACAAGTGACG ACATTGCAAC CGGCCTCGGT 174 0 

CAAAACATAA AATTAATCAA ATGGATGATT ATTATGCTCA TCAGTATGTT AGCCGGTATT 18 00 

TCGGTAGC CG TAGCTGGATC AATCGTCTTT GTGGGTCTTA TCGTACCGAA TATTAGCAAA 186 0 

CGATTATTAC CACCAAACTA TAAGTATTTA ATTCCTTTTA CTGCATTAGC TGGAGCAATC 192 0 

CTAATGATCA TTTCAGACAT TGTTGCTCGT ATAATAATTA AGCCACTAGA GTTGCCTATC 19 8 0 

GGTGTCGTTA CCGCTGTCAT TGGCGCTATT GTCTTAATCT ATATTATGAA GAAAGGACGT 204 0 

CAACGCTTAT GACCGAAAAG ATTAATAAAA AAGACAATTA CCATCTCATC TTCGCGTTAA 2100 

TCTTTTTAGC CATCGTTTCA GTGGTAAGTA TGATGATTGG TTCAAGCTTT AT AC CATTAC 2160 

AACGCGTACT GATGTACTTT ATAAATCCAA ATGACAGTAT GGATCAATTC ACTTTAGAAG 22 20 

TATTACGCTT ACCTCGCATT ACACTTGCGA TTTTAGCAGG TGCCGCACTA GGAATGAGTG 22 80 

GTTTAATGTT GCAAAATGTA TTAAAAAATC CAATTGCCTC ACCTGATATT ATCGGTATCA 23 4 0 

CAGGTGGTGC TAGCTTAAGT GCTGTTGTCT TTATTGCATT TTTCAGCCAT TTAACAATAC 24 0 0 

25 ATTTACTTCC ACTATTTGCA GTATTAGGTG GCGCAGTTGC AATGATGATA CTATTAGTGT 24 6 0 

TTCAAACGAA AGGACAAATA CGCCCGACAA CACTCATAAT CATCGGTATT TCGATGCAAA 2520 

CGTTGTTTAT TGCGCTTGTC CAAGGATTAC TCATTACAAC GAAGCAATTA TCTGCTGCCA 2 58 0 

30 AAGCTTATAC ATGGCTAGTC GGAAGTCTTT ACGGTGCTAC GTTTAAAGAT ACAATCATTT 2 64 0 

TGGGTATGGT TATTTTAGCT GTTGTGCCGT TGTTATTTCT TGTTATACCA AAAATGAAAA 2 7 00 

TATCTATACT TGATGACCCT GTAGCGATTG GCTTAGGCTT ACATGTACAA CGTATGAAAC 27 60 

TAATCCAATT AATCACTTCT ACTATACTCG TATCTATGGC AATCAGTTTA GTAGGTAACA 2 820 

TTGGGTTTGT CGGTTTAATC GCACCACATA TCGCGAAAAC AATCGTTCGC GGAAGTTATG 2 8 80 

CTAAAAAGTT ACTAATGTCA GCAATGATTG GTGCCATATC AATTGTTATT GCAGACTTAA 2 54 0 

TTGGGCGTAC CTTATTCTTG CCTAAAGAAG TGCCAGCAGG TGTATTTATT GCTGCTTTTG 3 000 

GTGCCCCATT CTTCATATAC TTATTATTAA CCGTGAAAAA GTTATAACGA TATTATTAAA 3 06 0 

ACAAAATGAC CTCACAACGA AGTTAGCTAA ATGATTCAGT TAACTAACCG TTGCGAGGTT 312 0 

TTTTTATACA TATAGTTGTT GTTATTGTTA ACAAGCGTCG ACTTTCTTAA TTACATATTA 3180 

ATACTTTATA TACAAATAAC ACCGACTCAT ATTCTATAAT ATCAATCAAT ATTCTTCGAT 32 4 0 

TTTTCAAATA TCGATAACTA TTTCTTATTT AAATATAGTG TTTGATAATG TCATTTATTC 3 3 00 

AAAAACACAA ATTTTAATAA AAATATCATA TTATTTTTAA TTGTAAATTA TGGATTATTT 3 36 0 
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CO STRANDEDNESS : double 
(D) TOPOLOGY: linear 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 238: 

TGATAATGAT TCTCATTGTC ATACATCACG AAGGAGGCTA ATTAGTCAAT GAATAAAGTA 60 

10 ATTAAAATGC TTGTTGTTAC GCTTGCTTTC CTACTTGTTT TAGCAGGATG TAGTGGGAAT 120 

TCAAATAAAC AATCATCTGA TAACAAAGAT AAGGAAACAA CTTCAATTAA ACATGCAATG 180 

GG tACAACTG AAATTAAAGG GAAACCAAAG CGTGTTGTTA CGCTATATCA AGGTGCCACT 24 0 

15 GACGTCGCTG TATCTTTAGG TGTTAAACCT GTAGGTGCTG TAGAATCATG GACACAAAAA 3 00 

CCGAAATTCG AATACATAAA AAATGATTTA AAAGATACTA AGATTGTAGG TCAAGAACCT 360 

GCACCTAACT TAGAGGAAAT CTCTAAATTA AAACCGGACT TAATTGTCGC GTCAAAAGTT 420 

20 AGAAATGAAA AAGTTTACGA TCAATTATCT AAAATCGCAC CAACAGTTTC TACTGATACA 4 80 

GTTTTCAAAT TCAAAGATAC AACTAAGTTA ATGGGGAAAG CTTTAGGGAA AGAAAAAGAA 540 

GCTGAAGATT TACTTAAAAA GTACGATGAT AAAGTAGCTG CATTCCAAAA AGATGCAAAA 600 

25 

GCAAAGTATA AAGATGCATG GCCATTGAAA GCTTCAGTTG TTAACTTCCG TGCTGATCAT 660 

ACAAGAATTT ATGCTGGTGG ATATGCTGGT GAAATCTTAA ATGATTTAGG ATTCAAACGT 720 

AATAAAGACT TACAAAAACA AGTTGATAAT GGTAAAGATA TTATCCAACT T ACAT CT AAA 780 

30 

GAAAGCATTC CATTAATGAA CGCTGATCAT ATTTTTGTAG TAAAATCAGA TCCAAATGCG 840 

AAAGATGCTG CATTAGTTAA AAAGACTGAA AGCGAATGGA CTTCAAGTAA AGAGTGGAAA 900 

AATTTAGACG CaGTTAAAAA CAACCAAGTA TCTGATGATT TAGATGAAAT CACTTGGAAC 960 

35 

TTAGCTGGCG GATATAAATC TTCATTAAAA CTTATTGACG ATTTATATGA AAAGTTAAAT 1020 

ATTCAAAAAC AATCAAAATA ATTAAGGAGT TTTACGATGC TACTTAAACC AAAATACCAA 1080 

40 ATCGTTATTG CTGGTTTATG TCTTGCAATA GTAGCTATCT TAAGTTTAAT GATTGGAAAT 1140 

ACGCTTGTGT CACCAGGTAC GGTGATACAG GCGTTATTCA ACTTTGATAG TGAAAACGAT 1200 

TTACATGATG TTGTCACTGG TGCACGGGCG TCGAGAACAA TCATTGCGTT ATTGACTGGT 1260 

45 GCTGCCCTTG CTGTCTCAGG TTTGTTGATG CAAGCACTTA CACGAAACCC AATAGCCTCA 1320 

CCAGGGCTTT TCGGTGTCAA TGCAGGCGCA GTATTTTTTG TCATTTTTAG TATTACATTT 13 80 

ATCCAAATTC AATCTTTTAA AATGATTGTA GTTATTGCAT TTTTGGGGGC TATTGTTGTT 144 0 

50 

ACTGTATTAG TTGTTGCACT AGGTATGTTT AGACAAACAC TATTCTCACC TCACCGTGTC 15 00 

ATTTTGG CAG GTGCTGCGAT TGCGATGCTA TTTACAGCCT TTACTCAAGG CATACTTATT 1560 
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AAAGATATTC ATGTAGATTG CGACAATGAC ACTATTTTAA TTGATGTCAT ACCAAATGGA 4 8 00 

CCAACATGTC ATACAGGCAG TCAAAGTTGT TTCAACACAG AAGTTCCATT TTCAGTGCAA 4 860 

ACATTAGCGC AGACAGTTCA AGATAGTGCC CAATCCAATA ATGAAAAGTC ATATACAAAA 4 92 0 

TATTTATTAA CAGAAGGTAT AGAAAAGATT ACAAAAAAAT ACGGTGAAGA AGCTTTTGAA 4 9 80 

GTCGTAATTG AAGCAATTAA AGGTGACAAA AAAGCATTTG TAAGTGAAGT AGCAGATGAA 504 0 

CTTTATCATT TATTTGTCTT GATGCATGCG CTTGGCGTCG ATTTTTCAGA AATTGAGGCG 5100 

GAATTAGCGC GTAGACATCA TAAGCGCAAT AACTTTAAAG GTGAACGACA AAATATCGAA 5160 

CAGTGGTAAA GCAAGTATGG ACTAAGATAT AAGGAAAAGG ATCATGGCTT ATACACTTAC 5220 

AAATATTGTG GAAAACGTGA CATTTTCAAG TTTAAAATAC GACACCAACA TATTTTAACT 52 8 0 

ATGAATGCTG TGATGGTACT AAAGTTGCGA ACTCGTTATA GATAAGTAGT GGATAATCAC 534 0 

AATACGAAAT CAAAAATAAT TATAAAAAGT AAATTGAGCA ACTCAGGAAT AGATGTCACT 54 00 

GTTAAAGATG TCGAAAAGTA TATGAATCGA TATAATGAAG TTATGAAGGG AAAAAATGGC 5460 

GAAAAAGCTA AAGAGTTATG TTTGTCGTTA CTACCTATTA ATATCATAGT TGTCTTTACA 552 0 

25 TTCTTTGTAT TTATACTATA AATACAAATA TATCTAGCCT GAAATAGAAA TGTCATAGCC 558 0 

TATTTAAAAG ACAATCTCCA TTAGAACTAA GAT ATG CATC CCGAAAGTTA GACTAAAAAA 564 0 

CTAACTTTAT GGGATGTATT TTTATGCTAA TCATCATAAA TTCGAGATTA AGTTAAAGGT 5700 

AGTT CAAGAG TAATTAAACA ATAAATwAAA AATAGTAGGA TACTTACTTT GAGGGAAGAA 576 0 

AATTAACTGT ATATATTTAG TTTAGGAACA AGTATTACGG TTTATCCTGA TACAATTATT 5820 

GTGGATGGGA TGATATTTTT AGGTTTAAAA TACGACACCA GCAAACATAA TAACTGTAAT 5880 

AG CT CAT AAA TCTCCCCATA TAGCTAATCT AAAAAAATAA TACATCATTG GAATTAAGCC 594 0 

CCAAQCATGT AAATATTAAA AATCAAAAmA GATATmTGTA AAAaAGTTAC AATTtGCATA 6000 

ATTAAATTGT GTCTAATTAT TGACTAATTA AATTTTGCCA AATATAATAT TAATTAATAA 606 0 

TTTGaAATGA TTAGCGTATA CACTTTAAAT TCTCTTTGGA GAATATATTT TTTAAATACA 6120 

AATGTAAACG CTTTCTCGTC AAATTAAACA ATAGAAAGGA TGGTCATTAT GAGTGCTTGG 6180 

TTAAGTAAAT TATTTGAGTT TATTCCTCGA ATAATTATCA ATTTGTTTAT CTAAAATAAA 624 0 

45 

AAAATAGAGG TGCTGACAAT GATGAAAAGT CAAAATAAGT ATAGTATTCG TAAATTTAGT 63 0 0 

GTAGGTGCAT CTTCCATTTT AATAGCTACA TTACTATTTT TAAGTG 634 6 
50 (2) INFORMATION FOR SEQ ID NO: 23 8: 
(i) SEQUENCE CHARACTERISTICS ■ 
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AATTGCATAT GCGCAGTATG GGGCTGATAT TCCGGCAATT GTTCAATTTA ACAATTATAT 3 00 0 

TGGTATTCAA TTCCATCCTG AAAAAAGCGG TACATATGGG TTACAAATTT TGCGTCAGGC 3 060 

5 AATACAAGGG GGATTTATAA ATGATTGAAT TATGGCCAGC GATTGATTTG ATTGGGTCAA 3120 

CAAGTGTGAG GTTAACAGAG GGTAAATATG ATAGTGAAGA AAAAATGTCA CGCTCGGCTG 3180 

AAGAAAGTAT TGCTTACTAT AGTCAATTTG AATGTGTGAA TCGTATTCAT ATCGTCGACT 3240 

10 

TGATAGGTGC TAAGGCACAG CATGCCCGAG AGTTTGATTA TATTAAGTCA TTGAGGAGAT 3 3 00 

TAACAACCAA AGATATTGAA GTAGGTGGTG GCATTCGTAC GAAGTCACAA ATCATGGACT 3 360 

ACTTTGCCGC AGGGATTAAT TATTGCATAG TTGGAACGAA AGGTATTCAA GATACTGATT 3420 

15 

GGCTTAAAGA GATGGCACAT ACATTTCCAG GTCGCATTTA TTTATCTGTT GATGCCTATG 34 8 0 

GAGAAGATAT TAAAGTGAAC GGATGGGAAG AGGACACAGA GTTAAATTTA TTTAGTTTTG 3 54 0 

20 TGAGACGGTT ATCGGATATA CCTCTTGGCG GCATTATATA TACTGATATT GCTAAAGATG 3 600 

GCAAAATGTC CGGACCTAAC TTTGAATTAA CTGGTCAATT AGTAAAGGCA ACGACGATTC 36 60 

CTGTCATTGC TTCCGGTGGT ATTAGACATC AGCAAGATAT TCAACGATTA GCGTCGCTAA 372 0 

25 ATGTTCACGC TGCTATTATA GGAAAGGCTG CACATCAAGC ATCTTTTTGG GAGGGGCTAA 3 780 

AATGATTAAA AAACGTATCA TTCCATGTTT AGATGTCAAA GATGGTCGTG TCGTTAAAGG 3 84 0 

GATTCAATTT AAAGGATTAA GGGATATTGG GAATCCTGTT GATTTAGCAA TGTATTACAA 3900 

30 TGAAGCGGGT GCTGATGAAT TAGTATTTTT AGACATCTCT AAGACGGAAG AGGGTCATAG 3 96 0 

CTTAATGCTA GAAGTGATTG AACAGACAGC GTCACGCTTG TTTATCCCTC TTACTGTAGG 4 02 0 

GGGTGGGATT CAAAGTCTCG ATGATATTAC CCAATTGCTA AATCATGGTG CAGATAAAGT 4 080 

35 

ATCATTAAAT TCAAGTGCTT TAAAAAATCC ACAGCTCATT AAACAAGCGA GTGATAAATT 414 0 

CGGTAGACAA TGCATCTGCA TAGCAATTGA TAGCTATTAT GATCCTGAAA GAAAAGCACA 4200 

TTATTGTTGT ACGACTGGTG GTAAAAAAAT GACAAATATT AAAGTATATG ACTGGGTACA 42 6 0 

40 

GCAAGTAGAA CAGTTAGGTG CAGGTGAGCT CCTCGTTACA AGTATGGGAC ATGATGGTAT 432 0 

GAAACAAGGC TTTGATATTG AACACCTAGC AAATATTAAG TCTCTTGTAA ATATTCCAAT 4380 

45 CATTGCTTCT GGTGGTGGTG GCAATGCACA ACACTTTGTA GAATTATTTG ATCAGACGGA 444 0 

TGTTTCTGCA GGTTTAGCTG CAAGTATATT ACATGATCGA GAAACGACGG TTCAATCTAT 4500 

TAAAGAAGTG ATACGGCAAG GGGGTATAGC AGTAAGATGA CCAAATATAA AATTGATTTT 4560 

50 AGCAAAGGTT TAGTGCCAGC AATTTTACAA GATAATCAAA CAAAACAAGT ATTGATGTTG 4620 

GGTTATATGA ACCAAGAAGC TTTTGATAAA ACGATAGAAG ATGGTGTGGT ATGTTTCTAT 46 80 
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GATTCAAAAG TTAATGCTGA TCATGCCAGA AGGTCCGGCA TTAACGCTAA ATCCTGATTT 12 00 

TTTTATGTAT CAAGCATATG CGGCACAAGT AAATCGTGAA ATTGCATTTG TAGATGCAGG 12 60 

ATCAGATTTA ACGTTTGATT TGGAAACCAT TTTAACGAAA ATCGATGAAG TACAACCATC 13 20 

ATTTTTTATT ATGAGXAATC CACATAACCC TTCAGGCAAG CAATTTGATA CGG CATTTTT 13 80 

AACAGCTATT GCAGATAAGA TGAAAGCATT AAACGGATAC TTTGTCATTG ATGAAGCATA 14 4 0 

TTTAGATTAT GGTACGGCAT ATGACGTGGA ACTGGCACCA CACATCTTAA GAATGCGTAC 1500 

ATTATCAAAG GCGTTTGGAA TTGCCGGCTT AAGATTAGGT GTCTTAATTA GTACTGCTGG 156 0 

AACGATAAAG CATATTCAAA AAATAGAACA TCCATATCCA TTAAATGTAT TTACGCTAAA 1620 

TATTGCGACT TATATTTTTA GACATAGAGA AGAGACAAGA CAATTTTTAA CGATGCAACG 16 8 0 

ACAGTTAGCT GAGCAGTTAA AACAAATATT TGATACACAT GTTGCAGATA AAATGTCAGT 174 0 

GTTCCCATCA AATGCTAATT TTGTACTTAC TAAAGGCTCA GCAGCGCAAC AATTAGGACA 18 00 

ATACGTATAT GAACAAGGAT TTAAACCTCG CTTTTATGAT GAGCCGGTGA TGAAGGGCTA 186 0 

TGTAAGATAC TCAATTGCAA CAGCATCACA GTTAAAGCAA TTAGAAGAAA TTGTTAAAGA 192 0 

25 ATGGAGTGCA AAATATGATT TATCAAAAAC AACGAAACAC AGCTGAAACG CAACTAAATA 198 0 

TTTCAATATC TGATGATCAG TCACCATCGC ATATTAATAC AGGTGTGGGC TTTTTAAATC 204 0 

ATATGTTAAC CTTGTTTACA TTTCATAGCG GTCTGTCATT AAACATTGAG GCACAAGGTG 210 0 

ATATTGACGT AGATGATCAC CACGTAACTG AAGATATCGG CATTGTCATT GGCCAATTGT 216 0 

TACTTGAAAT GATTAAAGAT AAAAAGCATT TCGTTCGTTA TGGAACGATG TACATTCCAA 222 0 

TGGATGAAAC ATTAGCACGT GTCGTTGTGG ATATAAGTGG GCGCCCATAC CTATCATTCA 22 8 0 

ATGCATCATT AAGTAAAGAA AAAGTTGGTA CGTTTGATAC GGAGTTAGTA GAAGAATTTT 234 0 

TTAGAGCGGT CGTAATCAAT GCAAGATTAA CAACGCATAT TGATTTAATT CGTGGAGGTA 2400 

ATACACACCA TGAAATTGAA GCTATATTCA AAGCGTTTTC CCGTGCATTA GGCATAGCGC 24 6 0 

TAACTGCAAC TGATGATCAG CGTGTGCCGT CATCGAAAGG TGTGATTGAA TGATTGTCAT 2 52 0 

CGTTGATTAT GGATTAGGGA ATATTAGTAA TGTAAAACGC GCTATTGAAC ATTTAGGGTA 2 58 0 

45 TGAGGTGGTT GTCTCAAATA CCTCAAAAAT AATCGATCAA GCAGAAACAA TCATATTGCC 2 64 0 

CGGTGTCGGC CATTTTAAAG ATGCGATGTC AGAGATAAAA CGATTAAATC TCAATGCAAT 2 70 0 

ATTGGCTAAG AATACTGATA AGAAGATGAT TGGTATTTGT TTAGGCATGC AATTAATGTA 276 0 

50 TGAGCATAGT GATGAAGGCG ATGCATCTGG ATTAGGGTTT ATCCCAGGAA ATATTTCGCG 2 820 

TATCCAAACA GAATACCCAG TGCCACACTT AOH^TGGAAT AATTTATT^A nT^.r:^^- - 
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TCCAATGCTT GTTGGCGATA TGGCATATGA TGGAATTGCA CACGATCCTT TGTCAATACA 72 60 

CCTAGCGGAC CATGATTTGG ACTTTGAAAT GTGTTTGTAT TAGACGTATG TGTTTTGGTA 73 2 0 

ACATTACGCG CCGTATGAAT TTCATCATTA AaTACAACCA TCACGCCTTT ATGACGGGCC 73 8 0 

TTTTCATCAG AGGCAACGCG AATAGCGGAA ATATAATTAT ATAATCCGTC AGAACCAATT 744 0 

TCATTAGACG AGCGCATTGG CCAAGAAnTA ACAACAGGTT G 74 81 



10 

(2) INFORMATION FOR SEQ ID NO: 23 7: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6346 base pairs 

(B) TYPE: nucleic acid 

15 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

2Q (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 237: 

ATGAATTATA CCTTCCACTA GAAGTGTCGG TATTTATGTG CCTGGTGGTA AAGCAAGTTA 60 

TCCTTCAACG GTATTAATGA CAGCGACGTT AGCACAAGTA GCAGGTGTGG AAAATATTGT 120 

25 CGTTGTGACA CCACCTCAAC CTAATGGTGT GTCTCAAGAA GTATTAGCAG CATGCTATAT 180 

TACGCAAGTT GATCAAGTAT TTCAAGTTGG TGGTGCTCAA AGTATTGCTG CGTTAACTTA 24 0 

TGGCACAGAA ACGATACCTA AAGTTGATAA GATTGTAGGT CCAGGCAATC AATTTGTTGC 300 

30 ATATGCCAAG AAATATTTAT TTGGACAAGT AGGTATTGAC CAAATAGCTG GAC CAACAG A 36 0 

AATAGCACTT ATTATTGATG AAACAGCAGA TTTAGATGCC ATCGTATATG ATGTTTTTGC 42 0 

ACAAGCAGAA CATGATGAAT TAGCACGTAC ATATGCCATT AGTGAAGATG CGCAAGTCCT 480 

35 

TAAAGATTTA GAATCACGTA TTGCTAAAGC ATTGCCTAAT GTGGACAGAT ACGACATTGT 54 0 

TTCTAAAAGT ATCGCTAATC AACACTACCT TATCCATGCT AGTAATTTTG ATGAAGCATG 600 

CCATGTCATG AATACAATCG CGCCTGAACA TGCGTCGATT CAAACAGTAA ATCCTCAACC 6 60 

40 

ATATATTGAA AAAGTGAAAT ATGTGGGTGC ATTGTTTATT GGACATTATT CGCCAGAGGT 720 

CATAGGAGAT TACGTTGCAG GTCCAAGTCA TGTATTACCT ACAAATAGAA CAGCTAGATT 7 80 

TACCAATGGG TTATCGGTCA ATGATTTCTT AACACGGAAC ACGGTCATCC ATTTATCAAA 84 0 

45 

AGATACGTTT GAACAAATTG CTGATTCAGC ACAACATATT GCTCATGTTG AAG CATT AT A 90 0 

CAATCACCAG CAGTCTATTT TAATACGTCA GTCTTAGGGG AGTGTAATTG AAATGATTTA 96 0 

50 TATTGATAAA AATGAAAGTC CAGTTACGCC GTTGGATGAA AAAACAATGA CGTCTATTAT 102 0 

TAGTGCAACg CnATATAATT TATATCCTGA TGCAGCATAT GAACAATTCA AGGAAGCTTA 1080 
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TAAATTGCGC TTATAAGTAT GTAGCGGTTT TTTCATTTTT CAAAGTTTGT TATTTAACAA 54 60 

GGTCTTGTCT CGAATATTGG CATATCAATT TAACTTTTTA AATAGTCATC AAAAAGATAA 552 0 

AACACCACAA TCAACAAATT TAACGAGGAA GAATAAAAAA TAAATCAACA TATTAAATTG 558 0 

TAGTGTTATT CAACTCCGTA GCTAACAATT CTCTATTCAC ATTAAACAAA TTGTCAAAAA 564 0 

TATATCATAA ATCTTCAAGC ACAGACTTAG CGCATCAATC ACTGAACTGT TATAATAGTT 570 0 

TGGGATTAAA GGAGGCCGAA ACAATGCAAA AAGTTGAAAG TATCATAATT GGTGGAGGGC 5 76 0 

CATGCGGATT AAGTGCGGCT ATTGAACAAA AAAGAAAAGG TATTGATACC TTAATTATTG 5 82 0 

AAAAGGGTAA TGTCGTTGAA TCAATCTACA ATTATCCTAC TCACCAAACA TTTTTCTCAT 58 80 

CAAGTGATAA ATTAAGTATT GGGcGAgTAC CGTTTATCGT TGAAGAAAGT AAACCAAGAC 594 0 

GTAATCAAGC GCTAGTTTAT TACCGAGAAG TTGTAAAACA TCATCAATTA AAAGTAAATG 6000 

CATTTGAAGA AGTATTAACT GTTAAAAAAA TGAATAATAA ATTTACTATT ACTACGACGA 6 0 60 

AAGATGTTTA TGAATGTCGA TTTTTAACAA TCGCGACAGG CTATTATGGT CAGCATAATA 6120 

CATTAGAAGT TGAAGGTGCG GATTTACCTA AAGTGTTCCA TTATTTTAAA GAGGCACATC 6180 

25 CGTATTTTGA TCAAGATGTT GTAATTATCG GTGGTAAGAA TTCGGCTATC GATGCTGCTT 624 0 

TGGAGTTGGA AAAAGCTGGT GCTAACGTGA CGGTTCTATA TCGTGGTGGA GATTATTCGC 63 00 

CTTCAATTAA ACCGTGGATA CTTCCAAATT TCACAGCATT AGTAAATCAT GAAAAAATTG 63 60 

ACATGGAATT TAATG CTAAT GTTACCCAAA TAACTGAAGA TACTGTGACT TATGAAGTAA 64 20 

ATGGTGAAAG TAAAACGATA CACAATGATT ATGTATTTGC GATGATTGGT TATCATCCCG 64 80 

ATTATGAATT TTTAAAATCT GTAGGCATTC AAATTAATAC AAATGAATTT GGAACAGCGC 6 54 0 

CTATGTATAA TAAAGAAACA TACGAAACAA ATATCGAAAA TTGCTATATT GCAGGTGTAA 66 00 

TTGCTGCAGG GAACGATGCG AATACCATTT TTATTGAAAA TGGTAAATTC CACGGGGGCA 6660 

TTATTGCTCA AAGCATGCTA GCTAAGAAAC AAACGCCCTT AGAATCATAA AAATAAAGGT 6720 

CTATGTACTA AATAACTTAG TTTTACAACG ACTGACATTC ATGATATGTC AAATGAGGTT 6 7 80 

GATGACTATT GATTGTACAT AGACCTTTTT ATGTTACGTA TTCATTATAA TTCAAAATAT 6 84 0 

45 GATTTGATTT CAGCTTTATC TAAATTGTTG CTTAACGCGA CTAATAATTT TAATCTTGCT 6 9 00 

TTTGGACCAT TCAAGCCGTT AGAAAAAATA AAACCTTGTT GTGCGAGTTG GTAACCACCA 6 96 0 

CCATCGTATG CGTAAGTTGG ACTCACAATA CCATTAAAGG AACGTGAAAC TAGCACAATA 702 0 

50 GGTATATTTA AAGATACTAA TTGTTGAATG CCTTCTAATG CGCTTGGAGG TATGTTGCCT 70 8 0 

T^Tp^aarr: r "VT r 'AATJKA r '".RTA^^A^^ A^s T'"t , <" rn^^snr.A pi s - r - — -* 
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ATGTTGTGAC AATGATTCTA ATATAACTGA TATCGCAATT TTAAATAAGA AGAAGGTAAT 3660 

TAGAAGTATT GGATTTGATG AAAAGTTGCA AAATTTATTT CTCAGATAGT ATTACTTTAC 372 0 

5 

TAAAAGAAAA TTGACAAGCT ATAATTAGTG TATACACAAT TGAAAAATGA TTGAAATAAT 3 780 

TTTGAAAAAT ATACATAAAC ATATGTCATG TGGGTATATT TTATGTAAAA TCATTGTAAT 3 84 0 

AGAATAGAAA GGAAGATGGC TATGTCTAAT AATTTTAAAG ATGACTTTGA AAAAAATCGT 3 900 

10 

CAATCGATAG ACACAAATTC ACATCAAGAC CATACGGAAG ATGTTGAAAA AGACCAATCA 3 96 0 

GAATTAGAAC ATCAGGATAC AATAGAGAAT ACGGAGCAAC AGTTTCCGCC AAGAAATGCC 4 020 

15 CAAAGAAGAA AAAGACGCCG TGATTTAGCA ACGAATCATA ATAAACAAGT TCACAATGAA 4 080 

TCACAAACAT CTGAAGACAA TGTTCAAAAT GAGGCTGGCA CAATAGATGA TCGTCAAGTC 414 0 

GAATCATCAC ACAGTACTGA AaGTCAAGAA CCTAGCCATC AAGACAGTAC ACCTCAACAT 4200 

20 GAAGAGGAAT ATTATAATAA GAATGCTTTT GCAATGGATA AATCACATCC AGAACCAATC 4260 

GAAGACAATG ATAAACACGA TACTATTAAA AATGCAGAAA ATAACACTGA GCATTCAACA 432 0 

GTTTCTGATA AGAGTGAAGC TGAACAATCT CAGCAACCTA AACCATATTT TACAACAGGT 4380 

25 GCTAACCAAT CAGAAACATC AAAAAATGAA CATGATAATG ATTCTGTAAA ACAAGATCAA 444 0 

GATGAACCTA AAGAACATCA TAATGGTAAA AAAGCAGCAG CTATTGGTGC TGGAACAGCA 4 50 0 

GGTGTTGCAG GTGCAGCTGG TGCAATGGCT GCTTCTAAAG CTAAGAAACA TTCAAATGAC 4 560 

30 

GCTCAAAACA AAAGTAATTC TGGCAAGGCG AATAACTCGA CTGAGGATAA AGCGTCTCAA 4 620 

GATAAGTCTA AAGATCATCA TAATGGCAAA AAAGGTGCAG CGATCGGTGC TGGAACAGCA 4 6 80 

35 GGTTTGGCTG GAGGCGCAgC AAGTAAAAGT GCTTCTGCCG CTTCAAAACC ACATGCCTCT 4 740 

AATAATGCAA GCCAAAACCA TGATGAACAT GACAATCATG ACAGAGATAA AGAACGTAAA 4 800 

AAAGGTGGCA TGGCCAAAGT ATTGTTACCA TTAATTGCAG CTGTACTAAT TATCGGTGCA 4 860 

40 TT AG CG AT AT TTGGAGGCAT GGCATTAAAC AATCATAATA ATGGTACAAA AGAAAATAAA 4 920 

ATCGCGAATA CAAATAAAAA TAATGCTGAT GAAAGTAAAG ACAAAGACAC ATCTAAAGAC 4 98 0 

GCTTCTAAAG ATAAATCAAA ATCTACAGAC AGTGATAAAT CAAAAGAGGA TCAAGACAAA 504 0 

45 GCGACTAAAG ATGAATCTGA TAATGATCAA AACAACGCTA ATCAAGCGAA CAATCAAGCA 5100 

CAAAATAATC AAAATCAACA ACAAGCTAAT CAAAATCAAC AACAGCAACA ACAACGTCAA 516 0 

GGTGGTGGCC AAAGACATAC AGTGAATGGT CAAGAAAACT TATACCGTAT CGCAATTCAA 5220 

50 

TACTACGGTT CAGGTTCACC GGAAAATGTT GAAAAAATTA GACGTGCCAA TGGTTTAAGT 5280 

GGTAACAATA TTAGAAACGG TCAACAAATC GTTATTCCAT AATATAACTA TATAAATTGT 534 0 
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AATGAACTAC ATAACTTGTT TAAAGCAATA ACTTTAAAAG GGCCATGTTA CTTACATTAT i86 0 

TATTTGCAAG GCTATGATGA ACCAATGTAT ACGAGACAGC AAGTTAGTTT AATAGAAAAG 192 0 

CTATCTCAAC AGCAATTGTT TGAATACGAA ATGAATAATT TAGTGACAAT GATGTTTGAA 198 0 

TTAGAAAGTG GAGAATATAC TATTTTATCA AAAATAATAA TGAAACCTAC ATTATTAAAT 2 04 0 

CAAACTTATA TTACTTATAC AAAATTGCTT GAACAATTCA CGATGGAAGA TATAG CGGCT 210 0 

CAACAACAAG TTAAAATCAA TACTATCGAA GATCATGTAC TTGAAATCTT AATCAAAGGT 216 0 

TACATGTCTA ATTACGATGA TTATGTTGAA CTAGAAGATC AACTCCAGTT TTTGAATTTT 222 0 

TATCAACAGC ATCGTGGCGA ACGATTAAAA TTTTACAAAG AACAATTTGA CACGTTATCA 22 8 0 

TATTTTCAAT TAAAAGTATT AATCGTTGGA TTTGAAAGAG GTGATCTGAA TGTTG CATGA 234 0 

TATTTTACGA AACAAATTTG GATTCGAGAG TTTTAAACCG GGACAACAGG AAATTATAGA 24 00 

20 AAGTATAATG TCTCAACAAC ACACTCTAGG TATACTTfTA ACTGGAAGTG GAAAGAGTTT 24 60 

GTGTTATCAA ATACCTACGT ATTTATCAGG TAAGCCGACA TTAATT AT CT CACCGTTAAT 252 0 

ATCTTTAATG GATGACCAAG TTATGCAGTT GAAAATAAAT GGAGAAAAAC GTGTAACATG 25 8 0 

TATTCACTCT GGTATGGATG AAATTGAGAA AAAGCATAAT ATTAAATGTT TACGACATAG 2 64 0 

CCGCTTCATC TTTCTAAGTC CAGAATTTCT CCTGCAACCG TCAAATTTTA AATTAATATC 270 0 

TATGATAGAC TTTGG CATGA TTGTTCTAGA TGAAGCACAT TGCCTATCTG AATGGGGATA 276 0 

TGATTTCAGA CCACATTATG CTCTAATAGG AAAAGTAACA AAG CATTTT A AAGAAG CGG T 2 82 0 

TGTCTTAGCA TTGACAGCAA CTGCACCACC GCATTTACAA GATGATTTGA CGGAAATGTT 2880 

AGCGATTCAA TTCAATGTTA TTAAAACTAC AATGAATCGC CCAAATATAA GCTTTAAGCA 2 94 0 

TCTTAATTTT CATGATGATG AAGATAAAAT TGAATGGTTG CTGCCGTTTC TACAACAGTC 3 00 0 

GGGAGCAACG ATTATTTATG TCTCATCGAA AAAGATGTGT CTGAATTTAG CGCAACTTAT 306 0 

TTATGATTCA GGTTTTCTTA CAGGTATTTA TCATGGTGAT ATGAATTATC AAGAGCGACA 312 0 

CACAGTTCAA CAACAATTTT TAAATAATGA TATTCCGATT ATAGTCGCAA CGAGTGCTTT 318 0 

TGGTATGGGA ATTAATAAAA AAGATATTCG CACAATCATT CACTTTCATC TTTCAACAAG 3 24 0 

TCCTTCTAAC TACATTCAAG AAATTGGCCG TGCGGGTCGC GATGGTGAAC TAAGTCAGGC 33 GO 

AATTAGTTTA -pTCCAACCGG ACGATAAATA TATTTTAGAA ACGTTATTAT TTGCAGATAT 3 3 60 

GATAACAGAA GAAGATGTAC AAAATTTCGA AATAGGAGAA TTTTTAGCTC CCGATAAACA 34 2 0 

AGCCGTTTTG ACAACGTTGc AATCATTCTA TAGTATCGGC GCCTTGaAAC AGATATTTAA 34 8 0 
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TGAGTGATAG AATCAAAAAA GC CAT CTCAA AAATTAATCA AGCAAACAAC ATTCCAAACA 6 0 

ATGsTCGCAA ATCACCAATG TATCACTCTC CAATTACGTA ACTATGATTT AATTTAAGCA 12 0 

TAGTTATTGA GGTTTTGTGA TATATAGTAT AAAATTAATG AGAATTAAAT TTAATAATGT 180 

AAAATTCATm TTCgGGGTCG GGTGTAATTC CCAACCGGCA GTAAATAAAG CCTGCGACCT 24 0 

GCTAGTATGT ATCATATTAG TGGCTGATCT AGTGAGATTC TAGAGCCGAC AGTATAGTCT 300 

GGATGGGAGA AGATGGAGGT TTTTTGTTGT GCAATAATCC TCCTATTCTT ACGAGATGAA 3 60 

TGGAAGGAGA AAATTGAATA TGCAACAAAA TAAACGTCTT ATCACAATAA GTATGTTGAG 420 

CGCGATTGCG TTTGTGTTAA CTTTTATCAA GTTTCCTATA CCATTTTTGC CACCATACTT 4 80 

AACTTTAGAT TTTAGTGATG TACCGTCACT ACTAGCTACA TTTACGTTTG GACCAGTTGC 54 0 

CGGTATCGTA GTTGCACTGG TTAAAAATTT ATTGAACTAC TTATTTAGTA TGGGCGATCC 600 

20 AGTTGGACCA TTTGCTAACT TTTTAGCAGG CGCAAGTTTC TTATTAACTG CTTACGCCAT 660 

CTATAAAAAT AAACGTTCAA CAAAATCTTT GATTACTGGA TTAATCATTG CAACAATCGT 720 

TATGACTATC GTGTTGAGTA TTTTGAACTA TTTCGTTCTA TTACCTTTGT ACGGTATGAT 7 80 

25 ATTTAACTTA GCTGATATCG CAAATAATCT TAAAGTAATC ATTGTTTCAG GAATTATACC 84 0 

ATTCAATATT ATTAAAGGTA TCGTTATTTC TATTGTATTT ATTTTACTAT ATAGAAGGCT 900 

TGCGAATTTC TTGAAAAGAA TTTAATCAAA TTAAAGCAAA ATAATATACA CATAATAATA 960 

AAAAGCAGGT GACTATCAAT AAACGATAGC TTGCCTGCTT TTTCTATAGA ACATTTGTCT 1020 

AAAAAATCAA TTATTCaAAT TTTAAAGCGT CCCCATCAAA TGATTCGTCT GCAATTTTAA 1080 

TAGAATCTGt AGGGCATCCa TCAATTGCAT CTTCCATATC TTCATATAAT TCCtCAGGTA 114 0 

CTTCTGCAGT ACCTTGGTTA TCGTCAAGGA TTACGAAAGC AATACCTTCG TCGTCGTAAT 1200 

CATATATATC TGGCGCTGCT GCACCGcATG CACCACATGC AATACAAGTA TCCATATCAA 12 60 

CGATTGTATA TTTTGCCAAT GTCTTCGCCT CCTTTGATAA AAATGCTAAA ATAGTAATGT 1320 

GACTAAAATT TTAGACAGCA TCATTTTTAT TTTCAAATTA TCCGTTTTAC AGAGTGAGGG 13 BO 

TTAAATTTGC AACACATTAT AAAAACAGCA TTACAACAAA CATTTAACTA TAAAACAAAT 144 0 

45 AAAAGTATTT ACAATATCTT AGTTGGTAAG AAATCTCACC AAACCTTTTT TGACGCTTGT 1500 

AGTCAACAAC AGTTGTCATT ATATCACAGT TTACCACTAT TAAAATATCC GTCTTTTGAG 1560 

CTATTTtTAG AAAAAATCAA TGAATTTAAT GCTGAAATGG AAATCATGTT GCATCCTAGA 1620 

50 

TATACATTTG aAAGCATGGG GCAAACATTT CAAGCAATTC AACTATTAGT GCAAaCCATG 1680 

TCTAATACCA AACAACATGT TTTTCATTTT GTACCAATCT CTCAAAATAA TAAGATACAm 174 0 
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AAAGCGAATT AGATAGAGTA GATAAAAAGC TCTCTAATGA AAACTTTGTA AGTAAAGCAC 2160 

CTGAAAAGGT TATAAATGAA GAAAAACGTA AAAAACAAGA TTATCAAGAA AAATATGATG 2220 

5 

GTGTGAAGGC AAGAATTGAA CAATTAAAAG C AT AGG AG TT AGTAACAATG AATTACCTAG 22 80 

AGAGCTTGTA TTGGATACAC GAAAGAACTA AATTTGGCAT CAAACCAGGT GTTAAACGTA 2 34 0 

TGGAATGGAT GCTAGCACAA TTTAATAATC CTCAAAATAA CATTAAGGGT ATTCATGTAG 2400 

w 

GTGGCACAAA TGGTAAAGGC TCTACAGTTG CTTACCTTAG AACAGCTTTA GTTGAAAATG 24 60 

GTTATGAAGT AGGTACATTT ACGTCGCCGT TTATTGAAAC ATTTAATGAA CG AATTAGTC 2 520 

J5 TAAATGGTGT GCCAATATCA AATGACGCTA TTGTAGAATT AGTATCACGT ATTAAACCAG 2 58 0 

TAAGTGAAAT GATGGAACGT GAAACAGATT TAGGTGTTGC AACTGAATTC GAAATAATCA 264 0 

CAGCGATGAT GTTTTTATAT TTTGGTGAAA TACATCCTGT TGATTTTGTC ATTGTTGAGG 270 0 

pn CTGGATTGGG TA.TAAAGAAC GATTCGACAA ATGTCTTTAC AC CGGTTTTA TCAAT CTTAA 2 76 0 

CTAGTATCGG TCTAGACCAT ACAGATATTT TAGGTGGTAC TTATCTAGAT ATTGCTAGGG 2 820 

ATAAAGGCGC GATTATAAAG CCTAACGTTC CAGTGATATA TGCTGTTAAA AATGAAGATG 2 88 0 

25 CATTAAAATA TGTTCGTGAA CGCGCAATTG AACAACATGC AAAGCCAATT GAATTAGATA 2 94 0 

GAGAAATTGT TGTTGTATCG CAAAATGATG AATTTACTTA CCGTTATAAA GATTATGAAT 300 0 

TAGAAACAAT CATTTTAAGC ATGTTAGGTG AACATCAGAA ACAAAATGCT GCATTAGCCA 3 060 

30 

TAACAG CTCT TATTGAATTA AATGAACAAG GATTAATTGA ATTAGATTTC AATAAGATGA 312 0 

TAGACGGTAT TGAATCAGTT CGTTGGACTG GACGTATTGA GCAGGTGCAT GACAAACCTT 3180 

TAATCATTTT GGATGGCGCA CATAATTCAG AGAGTATAGA TGCTCTAATT GATACAATTA 324 0 

35 

AACAGTACCA TGATAAAGAA AAAGTAGATA TTTTGTTCTC AGCAATAAAC GGAAAACCGA 3 3 00 

TTAACGAGAT GGTCAAACAT TTAAGTTTAA TTGCGCATAC GTTTTATGCA ACTGAATTTG 3 3 60 

40 ATTTTCCGAA AGCGTTACGC AAAGAAGAAA TTGTAGGTAG TATTGAAAAT GATGAAATAC 34 2 0 

AATTAGTAGA TGACTACGTT GAATTTATAA AAAATTATCA AGGTGATACA TTAGTAATTA 34 8 0 

CCGGTAGTCT GTATTTCATA AGTGAAGTTA AATCAA 3 516 
J5 (2) INFORMATION FOR SEQ ID NO: 236: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7481 base pairs 

(B) TYPE: nucleic acid 

50 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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TACAGCGATT GTTGTTAACC CTAATGACGA ACGATACAAA GATGTAATCG GTAAAACTGT 3 60 

TATATTACCA ATCGTAGGAC GCGAACTGCC TATTTTAGCA GATGAGTATG TTGATATAGA 420 

CTTCGGTTCT GGTGCTATGA AAGTGACACC AGCACATGAC CCTAATGATT TTGAAATTGG 4 80 

TCAAAGACAT CAATTAGAAA ATATTATCGT TATGGATGAA AATGGTAAAA TGAACGACAA 540 

AGCGGGTAAA TATGAAGGTA TGGACCGTTT TGATTGTCGT AAACAGCTAG TTAAAGATTT 600 

AAAAGAACAA GATTTAGTTA TCAAGATTGA AGATCATGTT CATTCTGTAG GTCATTCAGA 660 

ACGATCTGGC GCTGTTGTTG AACCATATTT ATCAACACAA TGGTTTGTGC GCATGGAAGA 720 

CTTAGCGAAA CGTTCATTAG ATAACCAAAA AACAGATGAT CGTATTGATT TTTATCCGCA 780 

ACGTTTCGAA CATACATTTA ACCAATGGAT GGAAAATATT AGAGATTGGA CGATTTCAAG 84 0 

ACAATTATGG TGGGGTCATC AAATTCCGGC TTGGTATCAT AAAGAAACAG GCGAAATATA 900 

TGTTGGAGAA GAAGCGCCAA CTGATATTGA AAATTGGCAA CAAGATGAAG ATGTATTAGA 960 

TACGTGGTTC TCaAGTGCTT TATGGCCtTT CTCyACGTTA GGTTGGCCTG ATTTAGAAAG 1020 

TGAAGACTTT AAACGATACT ACCCAACAAA TGCCTTAGTT ACAGGTTACG ATATTATCTT 1080 

TTTCTGGGTA GCACGCATGA TATTCCAAGG CTTAGAATTT ACAGATCGTC GTCCATTTAA 114 0 

TGATGTATTA TTACACGGTT TAGTTCGTGC TGAAGACGGG CGTAAGATGA GTAAATCATT 12 00 

AGGTAATGGT GTGGATCCAA TGGATGTTAT TGACGAATAC GGTGCTGATA GCTTGCGTTA 126 0 

30 CTTCTTAGCA ACAGGTTCAT CTCCAGGACA TGATTTAAGA TACTCAACTG AAAAAGTTGA 1320 

GTCAGTGTGG AACTTTATCA ATAAAATCTG GAATGGGGCA CGTTTCAGTT TAATGAATAT 13 80 

CGGTGAAGAC TTTAAAGTTG AAGATATCGA TTTAAGTGGT AACTTATCAT TAGCAGATAA 144 0 

ATGGATTCTA ACACGTTTAA ATGAAACGAT TGCAACAGTT ACTGATTTAA GTGACAAATA 1500 

TGAATTCGGC GAAGTTGGAC GTGCATTATA TAATTTCATT TGGGATGATT TCTGTGATTG 1560 

GTACATTGAA ATGAGTAAAA TTCCAATGAA TAGTAATGAT GAAGAACAAA AACAAGTTAC 1620 

ACGTTCAGTA TTGAGTTATA CTTTAGACAA TATTATGAGA ATGCTACATC CATTCATGCC 1680 

ATTTGTAACA GAGAAAATAT GGCAAAGTTT ACCACATGAA GGTGACACAA TTGTTAAAGC 174 0 

TTCATGGCCA GAAGTGCGTG AATCATTGAT TTTTGAAGAA AGTAAACAAA CAATGCAACA 1800 

ACTTGTTGAA ATCATTAAAT CTGTAAGACA ATCACGTGTA GAAGTAAATA CGCCATTGTC 1860 

TAAAGAAATA CCTATTTTAA TTCAAGCTAA AGATAAAGAA ATTGAAACAA CTTTATCACA 1920 

50 AAACAAAGAT TATTTAATCA AATTCTGTAA TCCTAGTACC TTAAATATTA gCtGACGTGG 1980 

AAAwT CCTGA GAAAGCAATG ACaTCAGTTG TAATTGCAGG TAAAGTGGTA TTACCATTAG 204 0 
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TTCATGGGCC ATTAAATGCT CTTATTAA7T ATGATTATGT ACATACTATG CAACAGGCCA 3 66 0 

TAGACAAGCG TATCTCGAAT CCATACTTGC GACAAATGTT AGGCTATTTT ATCAAATATG 3 72 0 

TAGGTTCTTC AT CATACG AT GCGCCAgCTG TATTATCTAT GTTATTCCAT ATGCAACAAG 3 78 0 

AGGAAGGCCT TTGGTATGTA GAAGGTGGAA TCCATCATTT AGCCAATGCC TTGGAAAAGc 384 0 

tAGCGCGTGA AGAAGGTGTC ACAATTCATA CAGGTGCACG TGTGGACAAT ATTAAAACAT 3900 

ATCAAAGACG TGTGACGGGT GTCAGATTAG ATACAGGTGA GTTTGTAAAG GCAGATTATA 3 96 0 

TTATTTCAAA TATGGAAGTC ATACCTACTT ATAAATATTT AATTCACCTT GATACTCAAC 4 02 0 

GATTAAACAA ATTAGAGAGG G AATTTG AG C CGGCAAGCTC AGGATATGTG ATGCATTTAG 4 08 0 

GTGTTGCTTG CCAATACCCG CAATTAGCAC ATCATAATTT CTTTTTTACG GAAAATGCTT 414 0 

ATCTCAATTA TcAACAAGTT TTTCATGAAA AGGTATTGCC AGATGATCCG ACCATTTATC 42 0 0 

TAGTAAATAC GAATAAAACT GATCACACAC AAGCGCCAGT AGGTTATGAA AATATCAAAG 4 26 0 

TCTTACCACA TATTCCATAT ATTCAAGATC AGCCTTTTAC CACTGAAGAT TATGCGAAGT 4 32 0 

TTAGGGATAA AATTTTGGAT AAATTAGAAA AAATGGGACT TACTGATTTA AGAAAACACA 43 80 

25 TTATTTATGA AGATGTTTGG ACACCGGAGg ATATTGAAAA AAATTATCGT TCTAATCGTG 4 44 0 

GTGCAATATA TGGTGTTGTA GCAGATAAAA AGAAAAACAA AGGATTTAAA TTTCCTAAAG 4 50 0 

AAAGTCAGTA TTTTGAAAAC TTGTACTTTG TAGGTGGATC AGTAAATCCT GGTGGTGGCA 4 56 0 

TGCCAATGGT TACATTAAGT GGGCAACAAG TCGCAGcAAg ATAAACGCGC GAGAAGCGAA 4 62 0 

GAATAGGAAG TGATATCTAT GAAATGGTTA TCACGAATAT TAACAGTAAT AGTGACCATG 4 68 0 

TCTATGGCGT GTGGTGCaTT GATATTTaAT CgTAGACATC A 4 721 
(2) INFORMATION FOR SEQ ID NO: 235: 

f~ Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3516 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235: 

TATTCGTGCG CAATGGGCTA AATTAGGTCT AGGTTTAGAT TATAGTAGAG AACGTTTTAC 60 

TTTAGATGAA GGTTTAAGTA AAGCAGTTAA AAAAGTTTTT GTTGATTTAT ACAATAAAGG 120 
AATTATTTAT CGTGGCGAAC GTATTATAAA TtGGGATCCn AAAGCACGTA CAGCTTTATC 180 
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ACAAAATAAC AATGGATATA ATTCTAATGA CGCTCAATCA TACAGCTATA CGTATACAAT 1860 

TGATGCACAA GGTAATTATC ATTACACTTG GACAGGAAAT TGGAATCCAA GTCAATTAAC 192 0 

GCAAAACAAC ACATACTACT ACAACAACTA CAATACTTAT AGTTATAACA ATGCATCTTA 198 0 

CAATAACTAC TATAATCATT CATATCAATA CAATAACTAT ACAAACAATA GTCAAACAGC 204 0 

AACAAATAAC TATTATACTG GTGGTT CAGG TGCAAGTTAT AGCACAACAA GTAATAATGT 2100 

TCATGTGACT ACAACTGCAG CGCCATCTTC AAATGGTCGT TCAATTTCTA ATGGTTATGC 216 0 

ATCAGGAAGT AACTTATATA CTTCAGGACA ATGTACTTAT TATGTATTTG ATCGTGTTGG 2 22 0 

TGGGAAAATT GGTTCAACAT GGGGTAACGC AAGTAATTGG GctAACGCAG CTGCATCATC 22 80 

TGGCTATACA GTGAACAATA CACCAAAAGT TGGTGCTATC ATGCAAACAA CACAAGGCTA 234 0 

TTACGGTCAT GTTGCTTACG TTGAAGGCGT TAACAGCAAC GGTTCTGTTC GTGTTTCAGA 24 00 

AATGAACTAT GGACATGGTG CTGGTGTGGT TACGTCTCGT ACAATTTCAG CAAACCAAGC 24 6 0 

AGGTTCATAT AATTTCATTC ATTAATCAAA TGTAAATCAA ATGACGTCAA TATTCTCTAA 2520 

CATGAGAGTA TTGGCGTTTT TGTTTTATAT AAATATAAAT GAGAGCGGTT TATTCACTGA 2580 

TCTTTAGGGA ACTAAGTAAT AAAGTGATAA TTTATACTAT GTCAGTATGA TTGAGAGTGA 264 0 

TTCAATTTAG ATGAAAACCA TGAAAAAATA TATTAAAACA GCATTTTTTT GTAGTATGTA 2700 

TTGGTTAATT GTTCAACTAA ATATAGCAAA TTTAGGTACA AGAATTCCTG ATAAGTATTT 2760 

TCGTCAGAAG TACATAATAT TTAAATCATT TAACTTTGAG AAGCATGGAA AATTTTGGAA 2820 

CAAATGGTTT TACGTAAGAA AATGGAAACA TAAGATTTTA GATGGTCATC AGCTTAATCA 2 88 0 

AAATATATAT GATCAGCGTC ATTTAATGAC AATCAATACT GATGAAATTG AAAAAATGAT 2 94 0 

TATAGAGACA AAGAGGGCAG AGTTGATTCA TTGGATATCG ATACTTCCAG TCATCATATT 3000 

CAATAAAGGC CCTCGTTTAG TAAAGTATAT AAATATTTTC TATGCAATGA TAGCTAATGT 3 060 

TCCAATCATT ATTGTGCAAC GCTATAATCG ACCGAGATTA ACGCAGTTAC TACGCATATT 3120 

AAAACGAAGA GGTGAACGTC ATGACTAAAC ATATCATCGT TATTGGTGGT GGCTTAGGTG 3180 

GGATTTCTGC AG CAATTCGA ATGGCACAAA GTGGCTATTC GGTCTCATTA TATGAACAAA 324 0 

ATAATCATAT AGGAGGCAAA GTGAATCGTC ATGAATCAGA TGGCTTTGGC TTTGATTTAG 33 00 

GTCCATCTAT TTTAACGATG CCTTATATTT TTGAAAAATT ATTCGAATAT AGCAAGAAGC 3360 

AAATGTCAGA CTACGTTACA ATCAAGCGAT TGCCACATCA ATGGCGTAgC TTTTTTCCAG 3420 

ATGGAACGAC TATCGATTTG TATGAAGGTA TTAAAGAAAC AGGTCAGCAT AATGCGATAT 34 80 

TGTCGAAACA GGATATAGAG GAACTGCAAA ATTATTTGAA TTATACAAGA CGAATCGATC 3 54 0 
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GCCCATGAGA CAATTTTACT TGCTTTTCCC ATTGGTTATC ACGTTCTTAT TACATAGATT 60 

TAAACCGAGA AATATTATTC AAACGCTATT TATTGTATCG TTGATTTCTT TAGGACTTAT 120 

GAT AG TG ATT CATTTCATCA CTGGAGATAA TTCACGTGTG TATTTTGGGA CAGATACACG 180 

ACTGCAAACT TTATTGCTTG GTTGTATATT AGCATTTATT TGGCCTCCGT TTGCTTTGAA 24 0 

AAAAGATATT TCTAAAAAGA TTGTCGTATC ATTAGATATT ATAGGGATAT CTGGTTTTGC 3 00 

GGTTCTAATG ACTTTGTTCT TTATAGTTGG AGACCAAGAT CAATGGATCT ATAATGGAGG 3 60 

ATTTTACATT ATATCATTTG CAACTTTATT CATTATTGCA ATTGCGGTAC ATCCTTCTAG 4 20 

TTTATTTGCT AAATTTTTAA GTATGAAACC TTTACTAATT ATAGGTAAAC GATCATATAG 4 80 

CTTATACTTA TGGCATTATC CTATCATTGT TTTTGTGAAC AGTTATTACG TACAAGGACA 54 0 

AATACCGGTA TACGTTTATA TTATAGAAAT TTTGTTAACA GCGTTAATGG CTGAAATTTC 6 00 

GTATCGCTTT ATTGAAACAC CTATACGTAA AAAAGGATTT AAAGCTTTTG CATTTTTACC 660 

TAAAAAGAAG GGGCAATTTG CTAGAACAGT GTTAGTTATC CTATTATTGG TTCCGTCTAT 720 

CGTTGTGCTC AGTGGACAGT TTGATGCACT TGGCAAACAA CATGAAGCCG AGAAGAAAGA 780 

25 GAAGAAGACG GAATTTAAAA CAACGAAGAA AAAAGTCGTT AAAAAAGATA AGCAAGAGGA 84 0 

TAAGCAGACA GCGAATAGCA AAGAGGATAT TAAAAAGTCA TCACCACTAT TAATTGGTGA 900 

CTCGGTCATG GTGGATATTG GTAATGTCTT TACTAAGAAA ATAC CAAATG CACAAATTGA 96 0 

TGGTAAAGTT GGACGGCAAC TCGTTGATGC TACACCAATT GTGAAATCGC AATATAAAGA 1020 

CTATGCTAAA AAAGGTCAAA AAGTTGTAGT AGAGCTTGGT ACAAATGGGG CATTTACGAA 108 0 

AGATCAATTA AATGAACTAT TGGATAGTTT TGGAAAAGCA GACATATATT TAGTTTCTAT 114 0 

TAGAGTACCT AGAGATTATG AAGGTAGAAT AAATAAATTA ATTTATGAGG CAGCTGAAAA 120 0 

GCGCTcTAAT GTACATCTAG TCGATTGGTA TAAAGCTTCT GCAGGTCATC CGGAATACTT 12 6 0 

TGCATATGAC GGTATTCACT TAGAATATGC AGGTAGTAAA GCGCTGACTG ATTTGATTGT 13 2 0 

AAAAACGATG GAAACACATG CTACAAATAA GAAATAATTT GATGCACTAA ACTTTTGAAA 1380 

TATTACATTA CTTCTGATAT TTATTATCAA AAATGATGTA TTTCATTAAA AGTTTAGTGC 14 4 0 

45 TTTTTTATTT TCAAATCCCA TAGTAACGGT GCAGAAAAAG TGTTGTAAAC ATTCTAATTG 150 0 

GTATATTACA TTCAATGAAG CTTTATTAGG AACAGATTAC ATTATGATAA CAAAGCCCGC 156 0 

AAGACACCTA ATCTCTGTTA TAGTTTGTTT TGTCGCAAAA CTATAAAAGT TATAATTGTT 16 2 0 

TGCATACTAA AAAAATAAAA AATATAAAAT TTAAAATAAT TGAGTCGCTA ATGACTATAT 16 80 
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TAAGTATAAT GAATAATATT AGAATTCATG CACTAGTTTA TTAAAATAAA GAGTAATTTA 5100 

AAATATCATT CCGTGTATTA AAGTGAATGG AAATGATTAG TTATTATTTT TAACAGTATC 5160 

TTTTTGTTCA ATAGCTTCTA ACATTAATTT AGTCATGCTC GCTAAATCAT ATTTAGGATC 5220 

AAATCCCCAT TCGCCACGTG AACAGCTTGT ATCAATAGAA TCCGGCCAAC TATTAGCGAT 5280 

ACCTTGTCTA ATAGGATCAA CATCGTAATC TAATGTAAAA TTGGGATAGT ATTCTTGAAT 534 0 

TGCTTCTTTT ACCATCTCTG GATCAAAACT CATTGCGCTC AAATTATAAC CATTTCTAGT 5400 

TTCTAATTTA GCGTCGTCTG CTTCCATAAG TTTAATAATT GCTTCAATTG CAT CATC CAT 5460 

ATACATCATA TCCATATACG TGCCTTTATC TATGAAGCTT GTATAATGAC CCTCTCTTAC 552 0 

TGCTTTGAAG TATATTTCAA CAGCATAGTC TGTAGTACCG CCACCTGGCT CTTTAACATG 5580 

CGAGATTAAA CCTGGGAATC TAACACTTCT TGTATCTACA CCAAAACGTT TGAAATAGTA 564 0 

TTGACACAAT AATTCTCCAG CTACTTTATT TACACCATAC ATTGTCGTAG GTTGCTGAAT 570 0 

CGTTACTTGT GGCGTATTAA CTTTAGGAGT TGAGTCTCCA AATGCACCAA TTGAACTTGG 5760 

TGTGAAAAAG TGCAAATTAT AAGTTCTTGC AGCTTCTAAT GCATTCATTA ATCCACCCAT 5820 

25 ATTTAAATCC CAAGCTAGAA TTGGATTTTT CTCAGCAGTT GCTGATAATA ATGCTGCCAT 5880 

ATGCATTAGA CTATCCGCTT CAAAGTCCCT AACTAACTCA AACATACGGT CACGATCTGT 594 0 

TACGTCTAAG ATTTCAAATG GTCCATTTTG TACAGGTGAG TCTGCTTCAG GTTCCCTAAT 6000 

ATCTGTAGCA AGAACATTAT CTGTCCCATA AATTTCTCTG CACTTAACAA CTAATTCTGT 6 060 

ACCAATTTGT CCTAATGCAC CAGTAATCAT AATTTTTTTC ATAGAAATAT CTCCTTTGtC 6120 

TCTTCTATAT AGCTATAGTC CATCACAAGC GGaCATAATA TTCATTTTCA TAATAATTAT 618 0 

AATATAAAAG CGCTTTCTTG TATATATGAC ATGTACATGT TGCTGATATk TCTGTAAATG 624 0 

GAAATTCTAG TTGTATTAAT TGATTTTAGT AATTTATAGC GTTTATTATT GCTAATTACT 63 00 

GtCAAATTAA ATTTTTTATC CCTCAACTCT TAAACTCTGG ATATCTTTCA TTATATTAGC 63 60 

TTTTTTATAA C CAT GG AT AT CATGTAAAGC CTTATAAGCn TTAAATAATG TTTCATACCT 6420 

TTGTACTTnT TCCGCTTCTG GATT 6444 
45 (2) INFORMATION FOR SEQ ID NO: 234: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4721 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

50 (D) TOPOLOGY: linear 
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GTATCTTGTC CCGTCACATC ATTAAATGCC TGGATAATTT GTTGTAAAGA TCCTGAACCC 3 3 00 

ATCACTGATT TTGGATCAAT AGATTCTTTA ATGACACCGA GTTGTCTTAA AGCAG CTTCT 33 60 

5 

TTATTATGTA CTGGGAAAGT CTCAAGCAAT GATTGTACAA ATCGTACCCC TATTTTTTCC 342 0 

CAACTTCTAT CAAATTCAAT TAACGTACCA TCTTTATCAA ATAATATCCA TTCCATTGaT 34 8 0 

ATCAATACTC CTATTTATTT ATTTCGTATT ATGCTGATTC TATGATATTC GTTATCCCCT 3 54 0 

10 

GAAAATGAAC TCGTAGTATT GTTCTATTTA AATATTGaAT TAAATATAAT AATAAGTGAA 3 6 00 

ATCCCCTTCA ATACTTAACA ATAAACATTG TAAACTTAAT TTATTACCAT GCTTCGCTTC 36 6 0 

15 ATTGAAAGGG ATTTTAGTCA TGATTAACTT TTGCATATTG TTTTCATGAT TATATTCAAT 3720 

TTTTATTAAT ATTTTGGTAC AACGACTCTC CAACCATTTT TATCTTCTAA AGTACCATTT 3780 

TGAATACCAG TATAGACGTC GTATAATTTT TGAGTAATTT CACCAGTCTC ATTATTATTA 384 0 

20 ATAACGATTT CACfJATCTTP GTATCTCAAT GTACCCACAG GTGAAATAAC TCCTCCAGTA 2000 

CCACTACCAA ATACTTCTGT TAACTCACCT TTATCATATG ATTCGAATAA TTCATCGATT 3 96 0 

GAAACGCGGC GCTCTTCGAC TTCATATCCT AAGTTTTTAG CTAATTCGAT AATAGATTTA 4 02 0 

25 

CGTGTAATAC CAGGTAAAAT ACTGCCATTC AACTCTGGTG TAATTACTTT GCCATTTTCA 4 0 80 

ACGAAGAAAA TGTTCATGCT ACCAACTTCT TCGATATATT TCTGTTCAAC ACCATCAAGC 414 0 

CATAATACTT GGTCATAACC TAATTTATTT GCATTAGTTT GTGCTAATAA ACTTGcCGCA 4 2 00 

30 

TAGTTACCTG CAACTTTTGC AAAGCCTACA CCGCCACGaA CAGCACGCAC AT ATT CAT CT 42 6 0 

TCTACATAGA TTTTAGTTGG TTTTAAAGTT TCACCACCAT AATATGCACC TGAAGGAGAT 43 2 0 

35 AAAATAATTA ATAATTTATA CTGATGTGAT GCACCAACGC CAAGTGCCCC TTCTGTTGCA 43 8 0 

AAAACAAATG GACGAATATA TAATGATTGA CCTTCCCCTT CAGGAATCCA ATCTCTTTCA 444 0 

ATATCAACTA ATTGTTTTAG CCCCTCTAAC AATTCTGCTT CGTCTACTTG AGGCATTTCT 4 500 

40 AATCGTGCTA ACGAGTTATT AAGACGCTTA AAATTTTCTT CAGGACGGAA AAGTGCAACT 4 56 0 

TCCCCATCTC TTTTATATGC TTTTAATCCT TCGAATACCG ATTGACCATA ATGAACACCT 4 62 0 

TGTGCAGCAG GTGAAATTTC AATAGGACCA TAAGGTACTA TCTTCAAATC ATGCCATCCT 4 6 80 

45 

TTATCTGCAT CATAATCATA ACTCAACATA TAATCAGTAA AATATTTACC AAAACCTAGT 4 74 0 

TGAGATGTAT TTGGTTTTTG TTTTAATGTT TCTCGTCGTT CAACTTTAAC TGCTTGTGAC 4 8 00 

ATGGTGATTG CCTCCTAATA ATATTGTATA AGAATTTGTT TAACTTAAAT TATAACAATC 4 860 

50 

CaTATTTTGC TGTTCAACAA ATTTTCTAAA AATTCAAAAT TAATTAACAG ATTTCTAGAA 4 92 0 



EP0 786 519 A2 



10 



15 



25 



CTATTGGCGT AGGTAAATCT TCACTTGCAC ACAAATTAAG TCAAACTTTA GATTTTTATG 150 0 

AAGAAAAAGA AATCATCACA GAAAATCCAT TTTTATCAGA CTTTTATGAA GATATCTCTA 1560 

AATGGAGTTT TCAAACTGAA ATGTTCTTTT TATGCAATAG ATATAAGCAA TTTCAAGATG 1620 

TAACACAACT AAATCAAGGT GTAGTTAGTG ATTATCATAT ACATAAAAAT AAGATATTTG 1680 

CTAAAAATAC TTTGAGTTCT GTTGAATTTC AGAAATTCAG TAAAATTTAT GATATTTTAA 174 0 

CTGAAGATAT GATTATGCCG AATATGATTA TCTTTTTAGA TGCAGACCTT GATGTGTTAA 1800 

AATCTAGAAT TGCTAAACGT AACCGTAGTT TTGAGCATCA AATAGAaGtG AtAcTGTaAg 1860 

TTAAAAAAAG ATTATCGTGA GTATTATGAG TCCTTACAAA GTAATGGTTC AAATGTAGTT 1920 

TTAATCGATA CnACTTCTAT TGATTTTCTT AAAAATGAAC AAGATTACGA AGATATATTA 1980 

CATATTATAT TACCTATGAT AGGAGATATT ACCAATGAAT AATTACGGTA TTCCACAAAA 2040 

20 TGCCATTATA ACCATTGCAG GTACAGTTGG TGTTGGAAAA TCAACACTAA sGCAAGCACT 2100 

TG CAG AT AAA TTAAACTTTA AAACGTCTTT TGAAAATGTC GAACATAATC CATATTTAGA 2160 

TAAATTTTAC AGCGATTTTG AACGATGGAG TTTCCATTTG CAAATTTACT TCTTAGCTGA 2220 

ACGTTTTAAA GAACAAAAGC GTATGTTTGA ATATGGTGGT GGCTTTGTCC AAGATCGATC 2280 

AATTTATGAA GATGTTGATA TTTTTGCAAA AATGCATGAA GAAGAAGGCA CAATGAGTAA 2340 

AGAAGATTTC AAAACATATT CAGACTTATT TAATGCCATG GTCATGACAC CTTATTTTCC 24 00 

TAAACCTGAT GTAATGATTT ATTTAgAATG TAACTATGAT GAGGTCATTG ATCGTATTAT 2460 

TGAACGTGGT CGCGAAATGG AAATTAATAC AGACCCTGAA TACTGGAAAA AGCTATTTAA 2520 

ACGCTATGAC GATTGGATTA ATAGCTTTAA TGCATGTCCA GTTGTACGTA TCAATATTAA 2580 

TGAATATGAT ATCCATAAGG ACCCCGAATC TTTAAATCCT ATGATAAACA AAATTGCTCG 264 0 

AATTATTCAA ACATATCGAC AAGTAGATAC ACGATAAAAG ACTAAAGACA TAGCGTATAT 2700 

40 GTTTATATTC AATGTATATT CCATAGATAT TATCGATTAT TTTATCAATT CTATCGAATA 27 6 0 

CATTAATTCA CATATACACT ATGTCTTTCT TTTTAATTTA AAGCTTCTAA AATATCTGCC 2 820 

GCACTATTTA AAATAATATC AGCTTCATGT AATTCTTCTT TTGTTGCAAT ACCTGTTAAT 288 0 

ACACCTATTG CCATACCTAA ATTTGCATTA CTTGCTGTCT TCATATCATT AGCAGTGTCT 2 94 0 

CCTACTATAG CTACTTTCTG AGGATCTACA TTATATTGCT CAAATAAAGG CGATAATACT 3 000 

TTAGGATTTG GCTTCTCATA GGCATCCGCT TCGGTAGAAA TGATCAAATC GAACAACGAG 3060 

GTAGCATTGG TATGTGCTAA AAATTGTTCT ACACCTTTTT TAGTATCACT CGTAACAATA 3120 

CCAAGTTGAT AGCCTTTTGC TTTCAAATCG ATAAGTGCTT CTTTAACA CC TTCTACCCAA 3180 
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(2) INFORMATION FOR SEQ ID NO: 233: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 6444 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

w 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 233: 

TGATAAGTCA TTTAAATTGT CACCTATTGA CATGACTTCT TTCATTTCAA TCCCTAATCT 6 0 

75 TTCGGCAATT GTTTCTAGCG CAATACCTTT TTGTGCATCT GAATGCGTTA TTTCTATATT 120 

TCCTCTCGAA GATGATGATA TAGCTAAATT CGGAGAkTCA GCTAAAATTT TACTAGCTTT 18 0 

GTCAATTTTT TCTAAATTTC CATCAAATGC TAATATTTTC ATAATTAATT CACCAGGTAT 24 0 

20 

GTTTTCAATA GCATCATAAT TATCAAuAA^ l^luAAuUTA CCATTATCTA TGCGTCTTTG 3 00 

AATACCATTT TTAATACGCT CAACGTTTGC ATGTTGACCT GCACGCTCAG CAATATCTAT 3 60 

GTAAATGTCT AAATCTCTTT GTGGATCTTC AGTATAAATC GCACGACTCG TGTATACTTG 42 0 

25 

ATAATAAATA CCTGCATCTT TTAAAACATT TGTAATTTTG TGTACTAACG ATTTATTAAG 4 80 

GTGTGAAGTG CTCATTACAT TGAAAGTTTC ATCACGTACT TCAGCACCAT TCAAACAAAT 54 0 

3q ATATGGTACT GTTAAATCTG TGTCAGCAAC TGGTGCTTGk GCTTCATAAA ATGCTCGACC 600 

TGTCGCGATA ACAACCGTTA TCCCTTGTTC TTGAGCGTAT TTAATCGCAT CAATATTAGG 66 0 

TTGAGAAATT TCATGTGCTG CATTAAGTAG CGTGCCATCC ATATCAGTGG CTATTAGTTT 72 0 

35 TATCATTATG TnACCTCGTT TCGTAAATnT AAAATCTTGT TCTTAAATAA GrATATATAC 78 0 

TCAGCGCACA TACTTTtCTA TTAmCATTTA TATkGTCATT aATTTATCAT ATAATGTAAT 84 0 

TCTaACAAAT nTTAAtTAGT ATGTACTATC GTCTAATTGG TGGATTTCTT ATTGGCTCTT 9 00 

A{) 

AAgTTTTTAA AAAATGTTGT TAATAATGTG CTACATGCTT CTTTAAGTAC ACCTTTATCA 960 

ACAATTGCAC GATGATTAAA ATTAGATTGT TGCAATAAAT TCATTAAACT GCcACTACAA 102C 

CCACCTTTAG GATCATCTGC GCCATAGACG ACTCTTGGAA TGCGACTCAT TACAATTGTT 10 8 0 

45 

CCTGCGCACA TGACACATGG TTCTAAGGTT ACATATAATG TGCAACCTTC TAAACGC CAA 114 0 

CTACCTAACA CTTTGGCTGC ACGTTCAATT GCAATATGTT CAGCATGCGC CGTTGGTTGT 12 00 

so TGTAGTGTTT CTCTTaAATT ATGTGCTCTA GCGATAACTT CATCATCTTT AGTGATGATA 12 6 0 

GCACCTATAG GTACTTCGCC TAGTTGAGCT GCTTTTTTAG CTTCTTCAAT CGCTAATGTC 13 20 
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ATCTATAGCT GCATATTGAA CAACATCCTC GATATGCGAT AAATCACGTT TTTGTGTATG 3 060 

ATGAATATAA TCTAGCAATA ATTGTGTCGC TTGATACaTT AATTTATGTT CAGTTTGATT 3120 

5 CACACTATAG ATTtCTGATG ATAACGTTTC CCTGACTGT 3159 

(2) INFORMATION FOR SEQ ID NO: 232: 

(i) SEQUENCE CHARACTERISTICS: 
io (A) LENGTH: 1238 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

15 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232: 
TAAGCGAGAA GCAATTGGTG TTATGTATGC TAGTGATAAA CCAACAGGTG AAAGTACAAG 6 0 

20 GTCATTTGCT GTTTATTTCT CTCCTGAAAT TAAGAAATTT ATTGCAGATA ATTTAGATAA 120 

ATAAATCATC CATCCATACA TTGATAAATG ATTTTyAGAA ATTAACAACA AAATCAACAA 180 
TTTTAAACAT CTCTGTGATT CTATTTATTC GAAATGATTT AAAAAATAAA ACTTCAAAAA 24 0 

25 CCTAACCTTA TATTTATACG AATACTTAGA GGAGCACAAA AATGAATAAA AATATAATCA 300 

TCAAAAGTAT TGCAGCATTG ACGATTTTAA CATCAGTGAC TGGCGTCGGC ACAACAGTGG 36 0 

TTGAGGGTAT TCAACAAACG GCTAAAGCTG AACATAATGT GAAACTAATC AAAAATACTA 420 

30 

ATGTAGCACC ATACAATGGT GTCGTTTCGA TAGGATCTGG AACAGGTTTC ATTGTCGGTA 4 80 

AAAATACAAT TGTTACCAAC AAGCATGTCG TTGCAGGTAT GGAAATTGGT GCACATATTA 54 0 

TAGCGCATCC CAATGGTGAA TATAATAATG GCGGATTTTA TAAAGTTAAA AAAATTGTCC 6 00 

35 

GTTATT CAGG TCAAGAAGAT ATTGCCATTC TACATGTGGA AGATAAAGCT GTTCATCCAA 660 
AAAAtAGGAA TTTTAAAGAT TACACAGGCA TTTTAAAAAT AGCATCAGAA GCTAAAGAAA 720 

40 ATGAACG CAT TTCAATTGTT GGCTATCCAG AACCATATAT AAATAAATTT CAAATGTATG 780 
AGTCAACAGG AAAAGTGCTG TCAGTTAAAG GCAACATGAT TATTACTGAT GCTTTCGTAG 84 0 

AACCAGGCAA CTCAGGTTCA GCTGTATTTA ACAGTAAATA CGAaGTtGTA GGTGTTCACT 900 

45 TTGGTGGAAA CGGCCCTGGA AATAAAAGTA CAAAAGGATA TGGTGTTTAT TTCTCTCCTG 96 0 

AAATTAAGAA ATTCATTGCA GATAACACAG ATAAATAAAT CCTTACATAG ATAAATGATT 1020 
TTAAAAATTA ACAACAAACT CAACaATTCA AATCATCTCT GTGATTCCAT TTATTCGAAA 1080 

50 

TGATTAAAAA AAATAAAACT TCAAAAAGCT AACATTATAA TTATACAAAT ACTTAGAGGA 114 0 

GCAGAAAAAT GaATAAAAAT ATAATCATCA AAAGTATTGC AGCATTGGAC GATTTTTAAC 1200 
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TTGTAGTTCA ATCTCGCTTT TTTGATCATT TTCAAACAAA TcAAATGATG CyTGTTCAAA 126 0 

GTCTTTTTGA GATAAAGTAT CaGTTGTTTC TTCaACACTT aAGTTTAAAT TTTCTTGATT 13 20 

AATTTCCAGG TTCATTTTCG ACCATTTTTA AATTTGATAT CGATGATtTT TCACCAGCAG 13 80 

ACGCTTCAAA CTCGCTTAGA ATCACTTGTG CTCTGCTAAT AACTTTTTCA GGTAAATCAG 14 4 0 

CTAATTTCGC AACTTGAATA CCATAAtATC GTCAACTGCA CCATCTTTGA CTTTATGCAA 1500 

GAATATAAGT TCACCTTTAT ATTCATTAGC AGCGACGTGA ACATTTTTTA GACTTGGTAA 156 0 

TGCTTGATCT AATGTTGTCA ATTCATGATA ATGTGTTGAA AATAACGTTT TAGCATGTGA 162 0 

TGTTTCAGCT ACATACTCTA TCATTGCCTG CGCTAAAGCT AAACCGTCAT ACGTTGAAGT 16 80 

ACCACGTCCA ATTTCATCGA AAATAATCAA ACTATCCTCT GTTGCATAAG TTAATG CCTT 174 0 

TTGTGCTTCT AGCATTTCTA CCATAAACGT ACTCTTACCT GAAACCAAAT CATCTGCCGC 18 00 

20 ACCTATTCTA GTGAATATTT GATCAAATAT AGGTAACACT GCCTCTTTAC AAGGGACATA 1SS0 

AGCTCCCATT TGGGCCATTA TACTAATTAT GGCAACTTGT CTCATATATG TCGATTTACC 1920 

AGACATATTC GGACCTGTAA TTAAATATAT AAATGTTTCA TTATCTAATC GACAATTATT 198 0 

AGGCACATAG TCATTATAAT CCATTACTCT TTCCACTACT GGGTGCCTAG ATT C CACTAA 2 04 0 

TTCTAATGTT TTATTTTCAC TAAATGAAGG CCTAGTGTAA TTATATTTTT GAGCAATTTC 2100 

TGCAAAGCTC TGTAAACAAT CTAGCTCTGA AATAATTTTA GCTTGTTGTT GTAAACGTTC 216 0 

AGTATATTTT TTAACTT CTT CACGTAGCTG AACAAATAAT TGATATTCTA ATTCGATGGC 222 0 

TTTGTCTTCC GCACCTAAAA TGATATCTTC TTTTTCTTTA AGTTCATCAG TTATAAAACG 22 80 

TTCAGCATTC GATAACGTTT GCTTCCTCAT ATAACCAAAT TCACTTGGTT CAAAATTTTG 234 0 

CAAGTTGGCA CGTGTTATTT CTATAAAATA ACCAAACACT TT ATT AAA GC TTATTTTCAA 24 00 

TGATTTTATT CCTGTACGTT GTCTTTCTTT GGCTTGTAAT TCTGCTAACC ATGTTTTTCC 24 6 0 

GTTTTTTGAA GCTTCAAGAT ATTCATCTAA TTGCGTATTA AAACCAACTT TGAATAGTCC 252 0 

GCCATCTTTA ACTGAAATTG GTGGTTCTTC TACTAAACTC TGTTCTAATA TATCAAGTAA 2 58 0 

ATCATCaAGG GGTTCTAGTT GATTAACTTG TACAAGAGTA TTCTGATTCA TAGAATTTAG 2 64 0 

TAATGCTTTA ATATTCGGTA TTTCAGAAAT GGAATGTTTA AGTTGAATTA AATCTCTCGC 2 70 0 

ATTAACATTT CCGTAACTAA CACGCCCAAC AAGACGTTCA ATATCATACA CTTGATTAAG 2 76 0 

ATATGTTCTT AAGGTGTCTC TTTCTATGAA ATGAGCACTA AATTCATCAA CGATATCTAA 28 2 0 

TCGTGCTTCA ATTTGTTCTT TACTTATTAG TGGTCTATCT ATCCATTGTT TTAAGCGGCG 2 88 0 
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CTTGGTTTGA TTTTAGGCAA GGTAATGGTT AATAACCCAT TTTCAAAACT AGCAGTAATA 192 0 

TGTTGCTTAT CAACAGCTTC AAAATCAAAT TGACGCATTA ATGATTCGAA GTTACGCTCA 1980 
5 TCTAAAATGA GTTGTTCAGA TTTGTATTTT GCGCTTCTAG TAGCTTGAAT AGTGAGCGwA 2 04 0 

TTAttATTGA AATCgATACT AATAtCTccC TG 2072 
(2) INFORMATION FOR SEQ ID NO: 231: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3159 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



25 



30 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231: 

20 CGTCTTCTCT TGGTTATTCT CTGTGTTTTG TCTTTGTTCA AATTCGATTT TTTGTTGTTC 60 

GAATGAATGT AATACTTTAT TTTTTTTTGG CACATAATCC AAGTnATTTT TAGGAATTAA 120 

TATACGGTCT TTAAATGCTT CTTGTATTTT GCTCACAATC AATTGGTATA GTTGCTCTTC 180 

TTTTGATAAA CGCACTTCTA GTTTTGTTGG ATGAACATTT ACGTCTACTA AGATTGGATC 24 0 

CATTTCAATA TTAATATAAC AAATCGGGAA CCTACCTATT GTTAAGAGTG TATGATAGCC 3 00 

TTCTAAAATC GCTTTATTTA GCATAAAGTT TTTAATGTAT CGTCCATTAA TAAAAATAGA 3 60 

AATATAGTGC TTATTACTTC TAGAATGTTC AGGCTTTGCA ACAAAACCTT CAATGTGATA 420 

ATCACTTGTA TCTCCAGATA TATGTACTAA ATCTCGTGCA ACTTTCATCC CATAAATCTC 4 80 

TGCCATCACT TCATTAGTTC GTCCTGAACC ATTTGTACTT AACATTGTTT TGCCATCTGA 54 0 

AATGAGTGCT ATTCGAATGT CCGGATGGCT CATTGCCATT CTGTTGACAA TATCTGTTAT 600 

TTTACCTAGT TCAGTGTATA AACTTTTAAT ATATTTTAAA CGTGCTGGTG TATTATAAAA 660 

40 TAATGATTCT ACAAGTATAT CTGTTCCTTT TTTCGCTTTT GCAGGCTTAT GATTTAATAT 72 0 

TTCACCATTT TCTACATATA TTTCATTTCC ATTAGCATTA TCCGTGCAAG TCTTCAATGT 780 

TACTTTAGCA ACTGATGAAA TACTGGCTAG TGCTTCACCA CGGAATCCTA ATGTCCTAAT 84 0 

ATGAAATAAA TCTTCATCTT GAT CTAATTT ACTAGTCGCA TGTCTATGAA ATACTAATCC 900 

TAAGTCTTCC GCTTCAATTC CGCTTCCATT ATCGACTACG CGAATAGATT GGACGCCAGA 960 

tTCCTCTACT TCAATGCTTA TTTCTGTAGC GCCTGCATCT ATAGCATTTT CCAACAGTTC 102 0 

TTTCACAACA GAACTCGGTC TTT CAACT AC TTCACCTGCT GCGATTTTAT TTGCTAATGA 1080 

GGTTTGGAGT TCTTTAATTT TCCCCATTTT GCAACACCTC TATTTTAATT GATTTTGTAA 114 0 
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TCTTTTTAAA AGGTACTAAT ATTTCTTTAG TGAAAATTGA ATCACGGTCG TTTATTGGTG 120 
CCTTGAGTAT ATTATTATAG ACGGAATCTG ATCTAATAAT ATTGATTTTA TACATGATAA 180 

5 

ACCTCCTTAT GTTGTCAGCA TAAAGGATAA CGTAACGTGA TTTTCAAGCA GTAATTGTAA 24 0 

CTAATTGAmA AAAATTAAGA AAAGTATGTG AGTGTTCCTA AwTAATATGa TTAAAATGAT 3 00 

GGCGAATAAG TGTCTaAAAG CATCTTAAAG GGACATTGTA TAGGGTAAAT CACTTCATAA 3 60 

W 

ATAAGGGaAA ATCCTTATGT TCACTTTTTC ACAATCATnA TAAAATATAT ATGTAGTCAA 420 

TACTTTGTCT ATATTGAATG TTTTCATATA AATGAAAGCA TTTTTAAATA ACATTGACCT 4 80 

15 CTAATATATA GGCAGAGTAT TGATATCTAT TAAAAAATAA ATGATTTTGA TGAAGGTGAA 54 0 

ACGTATGTAC AAAACAAAAG GTGGCTTTCA ACTTACATTA CAAACATTAA GTTTAGTGGT 6 00 

TGGGTTTATG GCTTGGAGTA TAATTGCGCC ATTAATGCCC TTTATTAAAC AAGATGTCAA 6 60 

20 TGTTACTGAA GGTCAAATAT C A ATC ATTTT AG CGA.TAC CA GTTATTTTGG GATCCGTC CT 720 

CCGTGTGCCA TTTGGTTATT TAACAAACAT TGTTGGCGCT AAATGGGTAT TCTTTACTAG 78 0 

TTTTATCGTA TTGTTATTCC CGATATTTTT CTTAAGCCAA GCACAAACAC CGGGTATGTT 84 0 

25 

AATGGCTTCA GGATTTTTCC TTGGTGTAGG TGGTGCAATT TTCTCAGTTG GTGTTACATC 9 00 

AGTTCCTAAA TATTTCCCTA AAGAAAAAGT AGGTCTAGCA AATGGTATTT ATGGTATGGG 96 0 

AAATATCGGT ACAGCAGTTT CTTCATTTTT AGCACCACCG ATAG CGGGTA TTATTGGTTG 102 0 

30 

G CAAACAAC A GTTAGAAGTT AC7TAATTAT TATCGCTTTA TTTGCATTAA TTATGTTCAT 1U80 

TTTTGGTGAC ACACAAGAAC GTAAAATTAA AGTACCATTA ATGGCtCAAA TGAAAmCATT 1140 

35 ATCTAAAAAC TACAAATTAT ATTACTTAAG TTATTGGTAT TTTATTACTT TTGGTGCTTT 1200 

TGTAGCATTT GGTATTTTCT TACCTAACTA CTTAGTAAAT CATTTTGGAA TTGACAAAGT 1260 

AGAT6CTGGT ATTCGATCAG GTGTATTCAT TGCGCTGGCA ACATTCTTAA GACCAATAGG 13 20 

to TGGCATTTTA GGTGATAAAT TTAATGCAGT TAAAGTATTG ATGATTGATT TTGTTGTTAT 13 80 

GATTATCGGT GCCATTATTT TAGGTATTTC AGACCATATC GCATTATTCA CTGTAGGCTG 144 0 

TTTAACAATA AGTATTTGTG CAGGTATTGG TAACGGCTTA ATCTTCAAAT TAGTACCATC 1500 

45 

ATACTTCTTA AATGAAGCGG GATCCGCAAA TGGTATCGTA TCAATGATGG GTGGTTTAGG 156 0 

AGGATTCTTC CCACCACTAG TAATCACGTA CGTAGCTAAT TTAACAGGAT CAAGTCATTT 162 0 

AGCATTTATT TTCTTAGCGG TATTnGGAnG TATTGCATTA TTTACCATGC GTCATTTATA 16 8 0 

50 

CCAAAAAGAA TATGGCTCAT TGAAaAACGG TTGATATGTA ATACATGCCA TTcATTTAGT 174 0 
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AACATTATTA AAATAAAACC CCTCTACTAC TATATGTAAC GAAGGGACAT GATTTCAAAA 4 44 0 

TAAAAT AC CT TTTTTATAAA TnTATTATAA TATCCCCCAC TATACnAC 44 88 

5 (2) INFORMATION FOR SEQ ID NO: 229: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 846 base pairs 

(B) TYPE: nucleic acid 

w (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



15 



25 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 229: 

TATGGCGCCA TATTAGTTGT AACTGGTTTA AGAGGTCcAa GAAaTATCaA ATAAAGTTGT 60 

tCCTGGGCTT GGTACTGTTA TCTCaATATT GmwTGCaTTT GGTGGTCTAG CTTTTAATAT 120 

20 TGGTAATATT GCTGGTGCCG GTTTAGGTTT AAATGCAATT TTTGGATTAG ATGTAAAATG 130 

GGGCGCAGCT ATTACTGCAA TCTTTGCAAT ATTAATCTTT GTAAGTAAAA GTGGCCAAAA 24 0 

AATTATGGAC GTTGTTTCAA TGATTCTTGG TATTGTGATG ATTTTAGTTG TGGCATATGT 3 00 

GATGTTTGTT TCTAATCCAC CTTATGGTGA TGCTTTTGTG CATACATTTG CGCCAGAACA 3 60 

TCCAATGAAA TTAGTCTTGC CCATCATTAC GTTAGTTGGT GGAAcTGTar GTGGTTATAT 420 

TACCTTTGCA GGTGCACATC GTATATTAGA CTCTGGCATT AAAGGTAAGC AATATTTACC 4 80 

ATTTGTAAAT CAATCAGCAA TTGCTGGTAT TTTAACTACA GGTATTATGA GAACGTTACT 54 0 

ATTCCTAGCG GTATTAGGAG TTGTTGTAAC AGGTGTGACA CTAAGTTCTG AAAATCCACC 6 00 

AGCGTCAGTT TTTGAACACG CAATTGGACC AATTGGAAAG AATATTTTTG GTATTGTGTT 66 0 

ATTTGCTGCA GCTATGTCAT CAGTAATTGG CTCAGCATAC ACAAGCGCAA CATTTTTAAA 72 0 

AACACTTCAT AAATCACTTA ACGAAAGAAG TAATTTAATT GTGATTGTGT TTATCGTTAT 78 0 

40 TTCAACAATG ATTTTCTTAT TTATTGGAAA ACCAATCAGC CTTTTAATTA TAGCAGGCGC 84 0 

GATAAA 84 6 
(2) INFORMATION FOR SEQ ID NO: 23 0: 



30 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2072 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 230: 



55 
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ACTATGCATG GTCTTTTTAA TCAACTTAAA CTCGGCATTA TTTCAATCGA AAACGCAGAG 264 0 

CATACGCTTT TTACACCTTA TATGTTGGAA ACGCTCTCTT CCCTAGGCGT GAAAGACAGC 270 0 

ATTGTCGATT TAATTCATAA AGGGACTGAA TTAGAAGACT TTGCGGCATT TAATTTATCA 276 0 

ATTGAAGACA CAGTTACAGT CTGTTTACAA AGAACTGAAG AACTATTAAA ACAATACAAA 2 820 

AATGTGGAAT TCAATGACAA AATATTAATC AATTGGCGTA TTATACAAGA GAAATAGACA 2 8 80 

TATAAAAGTC GAATGTAACt ACGTGAGTAT TGATTTTATT CTTTGTAAAT TACAAGCATT 2 94 0 

TCATATTATA AAGTTTGAAA AGAGGTATAT TGAAATGGAG AAAAATGAAT AT AT AG CT AA 3 000 

ATATAATGAA TATAGTCAAT TATTAGACGC TACATACTCG CAAGCTGTAG CATmCCTTTT 3 06 0 

AAGtAAATaT GGCGCTGTAA CCGATGATTA TTATAAAGaA AAATCATACA CGCGATTTTT 3120 

AAAtGGAGnA ATCAAAAGTA TTTCAAAAGG AAAATACACT AGAGCTAGCG AaGGATTATA 3180 

TTG C CAT CAT ATAAGCGAGG ACAAATTCCA AAATCTATCT GATCTAAGAT TCATl.TCCAA 3 24 0 



ATTTAAGTAC TCATACGACG TTCAAAAGAA AGAAAACTTA GTGTACTGTG ATCTAATCGA 3 300 

GCATTTAATT TTACATGCAA TTATTACAAA AGAATCCCAT GGCCAATTTG GTGTAGCTGG 3360 

ATTATGT CAA ATGATCAAAC CAACAGTCAT TGATTGGTAC ATTGGCGAAT ATAATC CAAA 34 20 

ACCAGCATGG ATGCAAGCCA CCAAAGCACG TGCCTATTTG CCTGGAATAT TAGTAGAGAA 34 80 

ATTACTCATT AAAATTGACG ATATGTTAAA AGGAATAGAA ATATAAGATT TCCTTGAGTC 3 54 0 

TAGATAAATG ATTAATGTAG ATTTATTTTT TGCTGTTGAG ATTTTGTTAT AGATGTTTAA 3 600 

ACCTGTAATT AAATATATTT TATAAAATAG ACCACGCATA CCTATCTATA AACGGrCAAT 3 6 60 

GTTTATAAAT GAGTTTGCAT GGtCTTGAAT TGTATTAAAT TTCTTTTGGT TTTAATAAAT 3 72 0 

CGACTAGATT TTCACAATAT TTATCAAATA TGTATTCCTA AATTATACAG CCTTAATCCA 3 780 

GCACjCTACTT TCGAAACTTC CAACTTAGTT GATATAAGGT TCAATAGTTT GTTTCGTTCT 3 84 0 

40 TTTTCAGATA AACCAGAACT TAAATTGATA TTATTGACTT CATAAAAATT ATAGACTAAT 3 900 

GCCTCTATTT GCTTTTTAGG CATAAGTAAG TCGACTGAAA ACTGATTTAC GTCGCTTTCA 3 96 0 

TAAATCATTT CATGTAAATT CTTTAGACTA TTATCGTTAC TATCTCTCAT TAAGTCTGTA 4 0 20 

TTTTTAAATA AATAACGGCC CAATTCACGA GCTATTGAAA ATCTTGTATT ATTAATCGAG 4 080 

TGATTATTAT TAATATAGAT TGTTCTTCCA CTTAAATAAC CCGAAGTATT ACCCTCCATT 414 0 

TTAATATATC TAACATTTAA ATTAAGTTGA AATAATAGCT TGTCTATGTC AAT AG CAAAG 4 200 

TGTTCAGAAG TAATAAAAAG TTGATCCATT TTGTCCTTTA TAAATGCCTG AAATAATCGA 4 26 0 
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GGGAGATTTT TTAGGCATGA GCAATCAATT CAAAAGCGAA GAAGAGCGAA GACAATGGGA 84 0 

ACAATTCCAA GCTTTCCAAA ATCAACAAAA CCAACAGAAC CAGCAATACG GACAAAAGAA 900 

5 ATCTAAAAAA GGATGGTTCT GGGGCTGTGG TGGTTGTCTA GTATTATTTA TTTTAATTAT 96 0 

CATCGGTATT TCAGCTTGTA CAGCTGGTAT TACAGGTAAC CTTGGCGGAA ATAGTTCTAA 102 0 

AGAAACGAAC AAAACCCATA AAATCGGTGa AACTGTTAAA AATGGCGACC TTGAAGTCAC 108 0 

TGTAAATTCA GTGGAAACTA TGAAATCTGT AGGACCATCT CTTGCACCAA CAAACGCTAA 114 0 

AGGTATATTT GTCGTTGCTG ATGTGACGAT TAAAAACAAA GGTAAAGAAG CGTTAACAAT 1200 

TGATAGTTCA ATGTTTAAGC TAAAATCCGG TGATAAAACA TTTGAAGCAG ATAATACAGG 1260 

15 

TTCAATGTCT GCTAATCAAA GTGACAATGG TAGTATAGAA AATTCATTTT TCTTACAGCG 1320 

TATAAATCCA GATAGCACTG CTCAAGGTAA AATTGTTTcG ATGTGTCAGA AAACATAGCC 13 80 

20 AACGCAAAaG ATAAAAAATT AGAAGTTATT TCTAGTTTAT TTAGCGTCAA GAAGATTACA 144 0 

TTTGATTTAT CCGATGCTAA AAAAACATCA AAAGCTAAAA AAGACAAGCA AGATACAGAA 1500 

GTAGCTGTTG CGAGTTCAAA TAGCGATAAT GTAAGTTATG AAGCTTCGGC TACTACACCT 1560 

25 GcTACAACTT CTAGTGCGGA TACTGATTCT GAAGATAGCG AAAAGTCTAG TAAAGATGAG 1620 

GATAAGCAGA ATGCGTCTAA AAgTGATAAA TCTAGTGTAG AAAAAAGTGA ATCTAATGAG 16 8 0 

GAAACTGCTC CTGTAGAGCC CATGCCCCAT AGCAAACCTA CCACTAGTGA aGCACCACCT 174 0 

30 

AGCCAAAATA TTCACAaTGa AGATAGCmTG TACGACGCTT CAACAGAATA AAATtnyCAG 18 0 0 

tAGCTCGGCT ACCCTTCTTT TACGGAAAAA TTAATTATAC ATAATCaAAT CaAGGAGATA 18 6 0 

AAAAAATGAA ATTCAAAGCT ATCGTTGCAA TCACATTATC aTTGTCACTA TTAACTGCCT 192 0 

35 

GTGGTGCTAA TCAACATAAA GAAAATAGTA GTAAATCAAA TGACACTAAT AAAAAGACGC 1980 

AACAAACTGA CAACACTACA CAGTCAAATA CAGAAAAGCA AATGACACCA CAAGAAGCCG 204 0 

40 AAGATATAGT TCGAAACGAT TACAAAGCAA GAGGCGTTAA TGAATATCAA ACATTAAATT 2100 

ATAAAACAAA TCTTGAACGA AGCAATGAAC ATGAATATTA TGTTGAACAT CTAGTCCGCG 2160 

ATGCAGTTGG CACACCATTA AAACGTTGTG CTATTGTTAA TCGACACAAT GGCACAATTA 2220 

45 TTAATATTTT TGATGATATG TCAGAAAAAG ACAAAGAAGA ATTTGAAGCA TTTAAAAAGA 2280 

GAAGCCCTAA ATACAATCCA GGTATGAACA ATCATGATGA AACAGATGGT GAGTCAGAAG 2340 

ACATTCAACA TCATGACATT GATAATAACA AAGCAATTCA AAATGACATA CCAGATCAAA 24 0 0 

50 

AAGTCGACGA TAAAAATGAT AAAAATGCTG TTAATAAAGA AGAAAAACAT GATAATGGGG 2460 

CAAATAATTC TGAAGAAACT AAAGTTAAAT AATGGCATAC TTTGATTAAT CGTAATTTTT 2 52 0 

55 
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CTTTTTACTC GTTCTATTAT TGGGATGTGT ATTAGTTTAT GTAGGATATC TTTATTTTCA 5 820 

TAAAATACGT GGCCTTTTGG CGTTTTGGAT AGGCGCGCTA TTAATTGCAT TCACATTATT 5 88 0 

GTCTAATAAG TATACAATCA TCATCTTGTT CGTCTTTTTA TTATTACTTA TTGTGCGTTA 5 94 0 

TTTAATACAC AAGTTTAAAC CAAAAAAAGT AGTTGCGACG GATGAGGTTA TGACTTCACC 60 00 

ATCTTTTATT AAACAAAAGT GGTTTGGTGA GCAACGTACA CCAGTTTATG TATATAAGTG 6 06 0 

GGAAGATGTA CAAATTCAAC ATGGAATTGG CGACCTACAT ATTGACTTAA CAAAAGCTGC 612 0 

AAATATTAAG GAAAATAATA CCATTGTTGT TAGACACATT TTAGGTAAAG TGCAGGTTAT 6180 

ATTGCCGGTT AATTACAATA TTAATTTACA TGTAGCTGCT TTTTATGGAA GTACTTACGT 624 0 

GAATGAAAAA TCATATAAAG TTGAAAATAA CAATATTCAT ATTGAAGAAA TGATGAAACC 63 00 

GGATAACTAT ACAGTTAATA TCTACGTATC AACGTTTATC GGAGACGTAG AGGTGATTyA 63 60 

2 0 TCGATGAAyC ACT 63 73 
(2) INFORMATION FOR SEQ ID NO: 228: 



15 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 88 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 228: 

ATAGnGAAAG CGTTTTACAC TTAATAACTC CCTCTTAAAT GCATCCAGGT TCTATGTAGT 6 0 

35 AAATCATGAA nATAACATAT AAATnTAGAG GAGATTTACC TTTGAATACA GAGAACAACA 12 0 

AGAATCAAAA CCAATCTGTT AAAAATTCTG AAAGaCGCGG CATGTTAAAA GGATGCGGCG 180 

GTTGCCTTAT TTCTTTTATT TTATTAATAA TCTTATTATC AGCCTGTTCA ATGATGTTTA 24 0 

40 GTAATAATGA CAATTCCACT AATAATCAAT CATCAAAAAC GCAATTAACT CAAAAAGATG 3 00 

AAAATAAAAA TGAAGATAAG CCTGAGGAAA AATCAGAAAC AGCAACAGAT GAGGATTTAC 3 60 

AATCAACCGA AGAAGTACCT GCAAATGAAA ATACTGAAAA TAATCAACAT GAAATTGATG 42 0 

45 

AAATAACAAC AAAAGATCAA TCAGACGATG ATATTAACAC ACCAAACGTT GCAGAAGATA 4 30 

AATCACAAGA CGACTTGAAA GATGATTTAA AAGAAAAGCA ACAATCAAGT AAC CAT CATC 54 0 

AATCCACGCA ACCTAAGACC TCACCATCAA CTGAAACAAA CACGCAACAA TCATTTGCTA 6 00 

ATTGTAAGCA ACTTAGACAA GTATATCCGA ATGGTGTCAC TGCCGATCAT CCAGCATATC 66 0 
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CAAG CATTTT TCAATTATAG TCCGGGGCCC CAACATAGAG AATTTCAAAA AAGAAATTCT 4 02 0 

ACAGGCAATG CAGGTTGGCG GGGCCCCAAC ACAGAAGCTG ACGAAAAGTC AGCTTACgAT 4 080 

AATGTGCAGG TTGGCGGGGC CCCAACATAG AGAAATTGGA TCTACAATTT CTACAGGCAA 414 0 

TGCAAGTTGG GGTACAACGA TAAAGAAATA TTTTTTCTTT ATCACACTAT GTCTCACTCA 4200 

CTTTCCAAAA TACTAAAGTA ACATCTTTAG TATATCAAAG AATTTTTGCT ATAATAAGTT 426 0 

ATAATTATAT AAAAAAGGAA CGGGATAAAA TGATTGTAAA AACAGAAGAA GAATTACAAG 4 320 

CGTTAAAAGA AATTGGATAC ATATGCGCTA AAGTGCGCAA TACAATGCAA GCTGCAACCA 43 80 

AACCAGGTAT CACTACGAAA GAGCTTGATA ATATTGCGAA AGAGTTATTT GAAGAATACG 444 0 

GTGCTATTTC TGCGCCAATT CATGATGAAA ATTTTCCTGG TCAAACGTGT ATTAGTGTCA 4 500 

ATGAAGAGGT GGCACATGGG ATTCCAAGTA AGCGTGTCAT TCGTGAAGGA GATTTAGTAA 4 560 

ATATTGATGT ATCGGCTTTG AAGAATGGCT ATTATGCAGA TACAGGCATT TCATTTGTCG 4 62 0 

TTGGAGAATC AGATGATCCA ATGAAACAAA AAGTATGTGA CGTAGCAACG ATGGCATTTG 4680 

AGAATG CAAT TGCAAAAGTA AAACCGGGTA CTAAGTTAAG TAACATTGGT AAAGCGGTGC 474 0 

ATAATACAGC TAGACAAAAT GATTTGAAAG TCATTAAAAA CTTAACAGGT CATGGTGTTG 4 800 

GTTTATCATT ACATGAAGCA CCAGCACATG TACTTAATTA CTTTGATCCA AAAGACAAAA 4 860 

CATTATTAAC TGAAGGTATG GTATTAGCTA TTGAACCGTT TATCTCATCA AATGCATCAT 4 92 0 

TTGTTACAGA AGGTAAAAAT GAATGGGCTT TTGAAACGAG CGATAAAAGT TTTGTTGCTC 4 980 

AAATTGAGCA TACGGTTATC GTGACTAAGG ATGGTCCGAT TTTAACGACA AAGATTGAAG 504 0 

AAGAATAGTT CAACATATAC TAAGACTAAA GTATGAACAT CATTTAGTTC CGGAGCCTAT 5100 

TCATATTGGT TTCGGAACTG TTTTATAATA ATTAAGAACA CAATCAATGC GTCATTTCAA 516 0 

AAATATGTTG TAACAAAGTA GTTTTTAAGC AAACATATCA TCGACATCAA CGAAGATACA 5220 

40 TAGCGCATTT GGTATTTTAA AACTTATTAT AAAAGGTGAT AGTTATGAAC TATGTTGAAC 52 80 

GTTATATTGA ACAGTTTTTG AGAGCAACAG TAAGAAATAA TATCAAGCAC TACCTTTTAA 534 0 

TGCTAGATGA AAAAATGAAA AATTTAGATG ATTATATGCG TTATTTAATT ACTAAAAAAG 5400 

AACAACTTAG CAAGTTAATT GACAGTCTAA TGCTAACATT AGAAAATAAA TATATTGATA 54 6 0 

TTGCTGAAGC ATTTCAAATT CAATGTGCAA GAGAAATCAA TAATCAAGAA ATTGAAAATA 552 0 

TTAAATCAGA GTTGAATAAA GTTGAAGCAT ATTATGCACA AATTGAAACT CAAATTCAAC 558 0 

AAACTTCAAC TGAAAAAATA GCAACAGAAA AAACATCGTA TCTAATAAAT TATATGAACG 564 0 

CTGTGGCATA GAAAGGCGGC GAAACATGAC ACACAAATAT ATATCAACGC AAATGTTGAT 5700 
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TTATAGAACC TGATTTACAT TTTACAATTA TTGATTTTAA TCAAGAACTG CTTTGTATTT 222 0 

ATATTGATTT TGATTCTGGT TTAAGGCATT CAAACATGGC AACAGAATCT GGTATTTCAT 2280 

TAAGGATAAA TGTTGCTAAA TCAGATTTTA CTAAATTTAT TAATGAATTA GCCTCTTTAC 234 0 

ATTAATGATT TAAATCTGAT ATGTAATTAC AATCAAAAAA GACAGCCACA TCCCTCCGTA 24 00 

GTTTAGGCGT GTGGCTATAT TTGAGTCTGA ATATTTATGC TTGTAATTTT AAAAAGGGAC 24 60 

ATGCTATATA CGATAAAAAG AGGCGGGGAC ATAAATCAAT GTTCTATGCT CTACGAAGTT 2 520 

ATATTGGCAG TAGTTGACTG AACGAAAATG CGCTTGTAAC AAGCTTTTTT CAATTCTAGT 2 580 

CAGGGGCCCC AACAAAGAGA AATTGGATTC CCAATTTCTA CAGACAATGC AAGTTGGGGT 264 0 

GTGGGCCCCA ACACAGAGAA TTTCGAAAAG AAATTCTACA GGCAAAGCGA GTTGGGGTGG 27 00 

GACGACGAAA TAAATTTTAT GAAAATATCA TTTCTGTCCC ACTCCCATGG TGCCAATTAG 2760 

20 CATAAGGTAC TTAAATTAAG CATATCTGCT GTCTAGCAGT CGATAAATCA TTAGAACTTC 2 820 

GTATAGTATA TGACTTTTAA TTTGATTTTC ACCACTAATT TCAAGTGCTT TTATAGTCGA 2 8 80 

ACGTAAAGTT TCTACAGAAT CATCTTCTCT CTTAAAAGAA C CAT CAT AAA ATATATCTTT 2 94 0 

GATGCTACTA CTAATTTTTA GCAATGCCAT TTTTTCGTCA CCTGAAAAGT TAACACGAGT 3 000 

ATTTTTAGGC AAGTAAATGA TATTTGATAA ATGAGTGATA AACAAACGAT TCGTATATGC 3 06 0 

ACGTTTAGTT AATTGATTGA GTAATTTCCA ATCACATTCT TTTTT CTTAT GATAGCTTAA 312 0 

TTCATCACGT TGATAACTTA TTAACGTTTC AACTTGATTA TTTAAATTGA AAATATTTTT 3180 

ATATG CTTTT TCGCTTTTAT CAGATTGCAG TCTTGATAAG ATAAGTTCTT GGCAGCGATT 324 0 

GTAAAATAAT TTATACATCA AGGCATCTGT CTTACTTAAT TTTTCTTCGA CCTGACCATA 33 00 

ATACTTAGGT GGAAACACCA TGAAGTTAAT TAAACCTGAT GTCACGAGTC CAATAATTGC 33 60 

TGTCAATGTT CGAGACAAAA AGTTGAATAT GTAGGCATCA TGAATACCTG GAATCATAGC 3420 

40 TAATGATGTT AGTACAGCGA CATTCGTACC AACTTGCAAT TTGAGTTTTG TACAGAATAA 34 80 

AATCGTGAAC GTTGCACTCA ATGCATATGT AAAAGGTGAT TGATCGCCGA ATAAATATGT 3 54 0 

AAATAATACT GCAAAGCCTG CACCAATTAC CGTAGCAGGT AATCTACGAT AACCTTTAAT 3 6 00 

AAGTGATGCC TTGGCAGTTG GTTCAATTGT GACTACAC-CT GTTAAAATGG CATAGATGGG 3 56 3 

TGTTAAATCT AGTGCCATAC AAAAGACAGC TGTTAAAAAA ATGGCAATAC CAGTTTTAAT 3 72 0 

TGTTCTGGCA CCAATTAAAT GTTTATACCA TTGATCGTTC ATTTTTTAAC CTCTAATCAT 3 78 0 

CGTAAAATCT TAGCGAGCGC TTTATAATAA TAGTATCGTA CATTGGAAAA GTTCATGTAT 3 84 0 



25 



30 



35 



45 



50 



EP0 786 519 A2 



AAAATATGAA ATATCAAGTT TTGATGTGGC ACCGTTTTTA TATTTAAATA TCAATGATGA 42 0 

AAAATATGAT ATGAAAATTG CAGGTGACTT TAACGCTTAT AACGCGTTAC AGCATATACT 4 80 

5 

GTTTTAAGAG AGCTAGGGTT AAATGAACAA ACAATTAAAA ATGGCTTTGA AACGTATACA 54 0 

TCAGACAATG GTCGTATGCA GTACTTTAAA AAAGAACGAA AAGAAGCGAT GATCAATTTA 60 0 

GCTAAAAATC CTGCAGGAAT GAATGCAAGT TTATCAGTTG GTGAACAATT AGAAGGCGAA 660 

w 

AAAGTGTATG TTATTTCGCT AAATGATAAC GCTGCAGATG GTCGAGATAC TTCATGGATT 720 

TATGATGCAG ATTTTGAAAA ATTATCTAAG CAACAAATTG AAGCTATCAT CGTGACAGGT 78 0 

15 ACACGAGCAG AAGAACTTCA ATTGCGATTG AAGTTAGCAG AGGTTGAAGT ACCAATTATA 84 0 

GTTGAGCGTG ATATTTATAA AGCAACGGCA AAGACTATGG ATTATAAAGG TTTCACAGTT 900 

GCAATACCAA ACTATACATC ATTAGCGCCT ATGCTTGAAC AATTAAACCG TTCGTTTGAA 960 

20 GGAGGTCAAT CATAATATGC ATGAATTGAC TATTTATCAT TTTATGTCAG ATAAATTGAA 1020 

TTTATACAGT GATATAGGAA ATATTATTGC TTTAAGACAA CGTGCTAAAA AACGAAATAT 108 0 

TAAAGTTAAT GTCGTAGAAA TCAATGAAAC AGAAGGTATT ACCTTTGATG AATGTGATAT 114 0 

25 

TTT CTTTATC GGTGGTGGAA GTGATAGAGA ACAAGCATTA GCAACAAAAG AATTAAGTAA 12 00 

AATTAAGACA CCACTTAAAG AAGCGATTGA AGATGGTATG CCGGGATTAA CGATTTGTGG 1260 

AGGCTATCAA TTTTTAGGGA AAAAATATAT CACGCCTGAT GGTACAGAAT TAGAAGGGTT 132 0 

30 

AGGTATTTTA GATTTTTATA CTGAATCAAA GACAAACCGA TTAACAGGAG ATATTGTTAT 13 80 

CGAAAGTGAT ACTTTTGGAA CTATTGTAGG TTTTGAAAAT CACGGTGGTA GAACATATCA 144 0 

35 TGATTTCGGT ACACTTGGTC ATGTTACTTT TGGTTATGGT AATAATGATG AAGATAAAAA 1500 

AGAAGGCATT CATTATAAAA ATTTATTAGG TACTTATTTA CATGGACCAA TTTTACCTAA 1560 

AAATTACGAA ATCACTGATT ATCTGTTAGA AAAAGCTTGT GAACGTAAGG GTATTCCGTT 162 0 

40 TGAGCCTAAA GAAATAGATA ATGAAGCGGA AATACAAGCG AAACAAGTAT TAATAGACAG 168 0 

AG CAAAT AGA CAGAAGAAAT CTCGTTAACT CTGAACATCG CATCAATGGA TTTAATATTG 174 0 

ATAAACGATG AAGTTTAGTA ATTAATCATA TATGTATAAA CACACACATT ATTTTGGATG 1800 

45 

GAAACAACCA AATTGATGTG TGTTTTtTTG TTCTAGTGAA TAATTATTAT ACAATGAGTA 186 0 

TCTATCCTAG AATTATCAAT AGTAATGGTG ATTATGCAAC ATGAAAAAAT GAATGATGAA 192 0 

AGGAATTTGA CGATGAAGCC TACTAAAGTG ATATTAAAAG ATGCATCTTA TTTACATAGC 198 0 

50 

AAAACATCGA TAACATTTAT TTTAAAAGAT GTAGTTATCG AAGAAGATAA TAAAATTTAT 2040 

TATTTCGACA CTA QTGCAcT TTCGAAGATC AAGAAGTTAA ATTTGAATTT GCACTCTTTG 2100 
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TTGAAACTTG TTTCGCTGGC TTGTTATCAA AGCGGAAAAC ACGTAGTAAT GGTTTAGAAC 1020 

CAAGATTAGT ATGGTATATT AACACAGGTT GACCTTGATC GATAATACCT TTAAGATCTT 10 80 

CTAACGATTT ACCAGTGCCG TCTACGATAT TAGGATTGTA TTTTTGTAAA AATGGTACAT 114 0 

ATGCTTCTGG AAATATCGTT TGATGATAAT TGCCAAGCTT AATGAATAAG TGATGTCCAA 120 0 

CATAACCTTT ATGTGGATTG TTCGGATGTG TCGGCCAATG TCTCATAATT TCTGTAGCAG 12 6 0 

GGATATGTTG GTTGTTGTAT TGCAACATCA TGGCTGCGGA AACACCTTCA CACCCCATGA 13 2 0 

CCATAGGGAT AGGAAATAGC TGACTGATAG GTTTAACTGG TAATATTTTT CGGTTCATAA 13 80 

TATAGTCCTC GCATTGATTC AATAAATATT TAATATAATT ATATAGCGTC AATGCAAAAT 144 0 

GTCCTAAACA TATGTTTTAC ATGAGTGAAT AAAATTAATG GAGTGATAAA ATGGAATATC 1500 

AATTACAACA ACTTGCGTCG TTAACGTTAG TAGGTATTAA AGAAACGTAT GAAAATGGAC 15 60 

20 GACAGGCTCA GCAACATATA GCAGGGTTTT GG CAAAGATG TTATCAAGAG GGAGTAATTG 16 20 

CGGATTTACA GTTAAAAAAT AATGGTGATT TAGCCGGGAT ACTTGGCTTA TGTATACCTG 16 80 

AATTAGACGG TAAGATGTCA TATATGATTG CAGTTACCGG AGATAATAGT GCTGATATTG 174 0 

AAAAATATGA TGTCATAACA TTAGCAAGTT CAAAGTATAT GGTATTTGAA GCACAGGGCG 18 00 

CAGTACCTAA AGCAGTTCAA CAAAAAATGG AAGAGGTTCA TCACTACATA CATCAATATC 18 60 

AAGCAGATAC GGTAAAATCA GCACCATTTT TTGAGTTGTA TCAGGATGGT GATACTACAA 192 0 

GTGGAAAATT AATATTACCA GAAATTTGGG ATnCCTGTTA AAGGGGTGAT TGAAATAnGA 1980 

AnTG 19 84 
(2) INFORMATION FOR SEQ ID NO: 227: 

35 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6373 base pairs 
- (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
40 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 227 : 

4 * GATTCCACGT GTGTTAAAAG AAGTTACAcC TTCAATGATG GTATTTACTA ATTT CTTT AG 6 0 

AGATCAAATG GATCGCTTCG GTGAAATTGA TATTATGGTT AATAACATTG CAGAGACAAT 12 0 

TAGTAATAAA GGCATCAAAT TATTGCTAAA TGCTGATGAT CCATTTGTGA GTCGTTTGAA 18 0 

50 

AATCGCAAGT GATACGATTG TGTACTATGG TATGAAAGCA CATGCCCATG AATTTGAACA 24 0 
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GAACAACTTT GATTCTGTCG ACTCCAATGA TTTTATCAAA TTCATTTAAT ACGCCATCAA 54 0 

AGTCCTCTTT AACATTATAT CCAGCATCAT GCGTATGACA TGTATCAAAA CATACTGATA 600 

AACGTTCGTT ATTATGAACT CCATCAATAA TACGTGCTAA CTCTTCAAAT GAGCGACCAA 660 

TCTCTGTACC TTTACCTGCC ATCGTTTCAA GCGCAATACG TACATTATTG TCATTCGTTA 720 

AAACTTCATT TAATCCTTCA ATAATCTTAT TAATTCCGGC ATCAACACCA GCTCCAACAT 780 

GCGCACCTGG ATGTAATACa ATATCTTTAG CCCCTATAGC TTGCGTTCTk TCaATTTCTT 84 0 

GTTGCAAGAA ATCTACACCA AGATTAAACG TTTCTGGTTT GGTTGTAttG CAATaTTaAT 900 

GATGTATGGT GGCATGAACA ACAATATTAG 930 
(2) INFORMATION FOR SEQ ID NO: 226: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1984 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



• (xi) SEQUENCE DESCRIPTION: J 
TGACGCACCA ATTTATAACG CAATTGACAA 
CTGGAAACCA TGATTTATGG GAAGTTGAAA 

30 

ATAAGAGTAT GTCACAGTGC TTGGTAGGAA 
T CAT AGGACA TACTGGCTGG TATGATTATA 
AGTTACAAAA AGGAAAACAT TATGGTGCGA 

35 

GCATATCAGA TCAAAATTTA TCTAAAATAG 
AAGTAGGAAA TAGACGAGTG ATTTTAGTCA 

40 TTCCTATGCC GCATCGTATA TTCGATTTTT 
ATCCTTTGTA TGCGATGTTC GATATACCAT 
AAAGTGTGAT AGATGATGGC AGATGTTATC 

45 GGCGTTCAGA AGATATTTAC CAGGAAATAA 
ATGCGCAAAC CTGACCCAGT TTGCGCATTT 
CTTACGTGTG TTTATTTTGT TGCTGATTTT 

50 

TGCACTTTCT TCCTGGTGAA TTAAAGCTGT 
TTTGTTTGCG ATTTGGAATG ATGGCAGGTA 



>EQ ID NO: 226: 

AACAATTAGA TATACCTGTG AAATTTGTAC 60 

GTATGACTAC GCAAGACATT TGGAATAATT 120 

AACCATTTAT AGTAAATGAA GAATGGGCAA 180 

GCTTTGCAGC ACAACGATTT TCATTAGATG 240 

CTTGGCAAGA TAAAGAACGA ATATCTTGGG 3 00 

CGGCTGAACA AGTGAAGAAA GATATATTAG 360 

CACATGTTGT GACGCACCCT GATTTCATTG 420 

ATAATGCATT TATTGGGACA AGTGATTTCA 4 80 

ATAGTATTAT GGGCCATGTT CATTTTCGTA 54 0 

TCTGTCCGTG TCTAGGCTAT CCAAGACAAT 600 

ATGAGACGAT ACAAATAATA GAAATTTAAA 660 

TATGTTTTAC ACACGCGAGT AATGTGTTTA 720 

CAATTGTATA TGAATGTGGT TGCACATAAA 780 

ATTCCATTTT CTCTTTACGA ATTTTAATAA 840 

AAACTAGGCC ACGACGAATA TGACTCCAAA 900 
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TACTTATGTG GAAGCACAAA AAGATGTTGC AAATGTTTTG GAAAATGTGG AGCAAGTAGA 54 0 

TGCGGTTGTT GGAGCAACTG ATACGATTGC ATTAGCTGCC TATAAATATT ATTCTGATAA 6 00 

AAAAGATGTT ATGAAACCAC ATCAAATATA TGGTTTTGGT GGTGACCCAA TGACACAATT 66 0 

AGTGTCTCCA TCGATAAAAA CAATTCATTA TAATTATTTT GAAGCTGGCC AATGCGCGAT 72 0 

GGaAGAGATA CAACAGATGC TTAAAAAGCA AGATATGCCA TATAGCGTCA CAGTAGATGT 780 

TAATATTTAG ACGCTGTATT TTTTAAAATA AATGTGGAAC C GAT AC CAT A TAACTATAAA 84 0 

TGGATAGGTT AAAAGTTAAA GAACGTAGGT AAAATTTGCT ATAATAGAAT ATAAATTGTT 90 0 

AACAGCATAA ATTATAAAAG GAGGACTGGG TAAATATTAT GACCGAATGG ACTAGAGAAG 960 

AACGTTATCA ACGAATCGAG GACGTTGATA CTGAGTATTT TAAAACATTA AAACAACAAG 102 0 

TTGATCAATC AAAATTTCGT CAACAATTTC ATATACAACC AGAAACAGGC TTATTAAATG 1080 

20 ACCCCAACGG ACTTATTTTT TATAAAGfiGA AGTATTATGT TTCACATCAA TGGTTCCCAT 114 0 

TAGGCGCAGT ACATGGCTTA AAGTATTGGT ATAACTACAC GAGTGATGAC TTAATAAACT 12 00 

TTAAAGCTGA AGGGCCAATT TTAAATCCAG ATACTAAATA TGACAGCCAT GGTGTATATA 1260 

GCGGTAGCGC TTTTGAATAT AACGGGCATT TATATTATAT GTACACAGGA AATCATCGAG 1320 

ATAATCATTG GCAACGACAT GCGAGTACAG ATGATCGCAC GATTGAAAGA AGACGGTTnC 13 80 

AGTTGGnAAA A 13 91 

(2) INFORMATION FOR SEQ ID NO: 225: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 0 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

4 ° (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225: 

ATTTATTTTA ATGTTTATAT TTTCTAACAC TTTTTTATGA TCATAGTAGT AATTGACATT 6 0 

TTTCAATTCA AAGACTGGTG TCATCGTATc TCACCTCGCA TTCAACTATA CAACTCCTAG 12 0 

TAACATATGT AAACAGTAAT GTTTACGACT CAAAATTAGA CAAAATAAAG AGATATGCCC 180 

CCTTCAAGTT TTATTTATCG CATTTCTTGA AGAGAGCATT ATCATTTTAT TGTTGCATAA 24 0 

CCTTATTTTT TAATTCTGGG TCAAATTGCT GTTGTTTTAA CATTTCAATT TCAAGTTTAT 3 00 

ATGGCGG TTT TTTATTTTTC TTATCTTCAC CAACATAAGG TGTTTCTAAG ATTTTCGGAA 360 
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TCCTTGTCGA TACCTATCAA CAGATGTTAC AAATAAAAAC CaCCCGTGTG AACGGGTGGT 312 0 

TTGTTCTGCG gCTATAAGCC TTCCTTACTG GCCaGCCCTA AAAGGGCACT GACAAGTCAG 318 0 

CCAACTGCAC TACTATTCCA GCAATCCTAA AGGTTTACTC r fT lTTCTIT CTTTTTTTAT 324 0 

TTTTCTCTCC AGTGAAAGGA TCTAAATATT CTTCCATTGA AATTTCATCT GCAACGATAT 3300 

CCTCTTGTAA TTGATTACGA ATATAATTTT CAATCACTTT TTTATTTCTA CCTACTGTAT 3 36 0 

CCACATAAAA TCCTTTACAC CAAAACTTTC TATTTCCATA TCTATACTTT AAGTTAGCAT 3420 

GTCTATCAAA TATCATTAAA TTACTTTTCC TTTTAAATAG CCAACAAATG ATGATACCCC 3480 

AAGTTTGGGT GGTATACTTA CTAACATATG GATATGATCT TTACATACCT CTGCTTCAAT 354 0 

TATCTCTACA CCTTTTCTTT CATATAATTG ACGTAATATA ATCCCTATAT CTTTTTTTAT 3600 

TTTTCCATAT ATCGCTTGTC TTCTGTATTT AGGTGCAAAG ACAATATGTT ACTTACAATT 3 660 

20 CCATTTCGTA TGTGCTAAAC TGTTTGTGTC AGATGACATT AAATAGCATC TCCTCGTGTT 3720 

GATTATTTTG GTTGGCTGAC CAATATTTAC TCTAACATGT AGAGATGCAT TTTTTTGACA 3 78 0 

ATGGTAGAAC CTTTTCTGGG GAGTGGGACA GAAATGATAT TTTCGCAAAA TTTATTTCGC 3 84 0 

CGTCCCACCC CAACTTGCAT TGTCTGTAGA AATTGGGAAT CCAATTTCTC TTTGTTGGGG 3 900 

CCCCGCCCCA ACTCGCATTG CCTGTAGAAT TTCTTTTCGA AATTCTCTGT GTTGGGGCCC 3 960 

CTGACTaGAA TTGAAAAAAG CTTaTTaCAA GCGCATT 3 997 

(2) INFORMATION FOR SEQ ID NO: 224: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1391 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224: 

GnGCGAGACA AACACACtTA TTGGTGCCAT TATmCcTAGA ATGaATTCaT ATGCAGTAGA 60 
TGAaaCAATC AAAGGATTGG CAAAACAATG CCAAAAATAT GAATCaCAAT TAATTTTAAA 120 

45 

TTACACAGGT TTAAATATCG AAGCAGAAAT ACAAGCGCTT GAAACATTAG CACGCaGTAA 180 
AGTAGATGGT ATTGTTTTAA TGGCTACAGA CATAACAGAG AGACATATTG AAGTCATTAA 240 
TAAAATGAAT GTACCAATCG TTATTGTTGG TCAACAACAT GAACAACTTC ATAGTATTGT 300 

50 

GCATGATGAT TATAAAGCAG GTCAAATTAT AGGCGAATGG ATTGGTCAAC AGGGATATCA 360 
ACAAGTTGAA GTGTTTAGTG TAAGTGAAAA AGATATTGCA GTTGGTATAC ATAGAAAACG 420 
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ATAACCGAAT TCTACCATAC CAGGGTCACA GACAATCATC ACTTTTTCAA TCTTGTCCAT 13 2 0 

TGTTGTTAGA CTCATGATTG CATTTTCTTC AAAATAAATT TGAGCAGGCA CCTTGAAAAT 13 80 

TTGAGTATTA TTACGTCGTT TAGCAATCGT TTTAATGTTT AATAAATCTG TCGCACTAAC 144 0 

ATTATGTGAA ATTGAGTTTC TACCGTAGaA CCACAACCTA ATGTTAAAGA CGGAATCAAT 1500 

TCGTTATACA TATCACCAAT ACCTCCAACC GCTGATGGTG TATTTACAAG TACACGACAA 156 0 

GCTTTCATTC TTAGTCCAAA AT CTTTTTGT AATGTTTCAT CTTCTGTATG GATAACGGCT 16 2 0 

GTGTGTCCTA ATCCACCAAA ATGTAGTGTG TCTTCACAAA TTTGAAATGC TTGTTTTGTA 16 80 

GATTGGGCTT TTACTAAGGC TAATACTGGA GATAATTTTT CACGAGATAA CGGATAGTCT 174 0 

GAACCTACAC CGCTAATTTC GGCTATGATA AGTTTTGTAT TTTCGGGGAC AGGTATACCT 1800 

GCTAATTCAG CTATTTCAAC TGCAGATTTA CCGACAATAT CAGGCTTAAT ACCTGTTTTT 18 60 

20 TGTTCATTcA TAATTGCATT TTCTAAGCGT TGTAATTCAT CTTTTTTAAC AAAGTATGCT 192 0 

TGATGTGCTT TAAATTCATT AGTAACATCT TTATAAATTT CTTTATCAAT GACTACAACT 1980 

TGTTCAGAAG CACAAATCAT ACCATTATCA AATGTTTTTG AACCAATGAT AT CATTTACT 2 04 0 

GCACGTTTAA TGTGTGCTGT TTTTTCAATG TAAGACGGCA CGTTACCTGG TCCCACACCT 2100 

AATGCCGGTT TGCCAGTTGA ATATGCAGAC TTAACCATGC CCGAACCACC TGTTGCTAGA 216 0 

ACTAATGCAA TACCTTTGTG ATTCATTAAT TGTTTTGTTG CTTCGATAGA AGGCACTTCA 222 0 

ATCCACTGAA TAATATCTTT AGGTGCACCT GCCTTCATTG CCGCTTCTAA TACAACTTCT 22 30 

GCTGCACGCT TCGACGATTC TTGTGCACTT GGATGGAATG CAAAAATGAT TGGATTTCCT 234 0 

GTCTTAATTG CAATCATCGC TTTAAAAATA GTTGTCGACG TAGGATTTGT TGTTGGCGTA 24 0 0 

ACACCACAAA TAACACCAAT TGGTTCCGCT ACATACGTTA ATCCTTTTTC TTTATCTTCA 24 60 

CCAATAATCC CTACTGTCTT ATTGTCTTTT ATTGAATTCC ATATATATTC AGAAGCGTAT 2 520 

40 AAATTTTTAA TCGCTTTATC TTCGTATATA CCTCTTCCAG TTTCTTCATG TGCTAATTTT 2 58 0 

GCTAGCACCA TATGTTGATC AACAGCTGCT AAGCTCaTTT GATGAACAAT ATGATCAATT 2 64 0 

TCTTCTTGTG ACTTTTTAGA TAATGCTTCT AATGCTTTTT TCCCTTTGTC AGCTAGAGCA 2 700 

45 

TCAATCATAA TTCCCACTTC TTGTTCTTTC GATCCACGAT TTTCTTTTTC AGGTATAGTT 2/bu 

AACATATACA ACCACTCCTT TATACTTTGT GAATTATTTC ACAAACATTA TAGTACATGT 2 82 0 

CTCTCAGGAT ATAAAGAAAA TTCTATACAA AAAAGTTTAA TTTCGAATAT TATTTGAACA 2 88 0 

AATATCAAAT TTTAAAATAA ATGTTTTCAT GAAATCATTG TTATTTCGGT GTTTTTAGAA 2 94 0 
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ATTTTCTTCT TTTGGTTTAA ATTGATTTAC TTTTTCTTCC GGCAATGTTT CAAATCGTAT 102 0 

ACCGACCGTT TTCGCACCGT TTTCTTTTAA TGCATTAACA ACAGCCATCT GTAAATCTAA 1080 

aTTGCGtGCA CCACCTAATT GTGCCATTG 110 9 
(2) INFORMATION FOR SEQ ID NO: 223: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3997 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 223: 
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TCTTTATTTA 


AAAAAATGAT 


TGTCTAGTTT 


GTATCTCTCT 


GAAGATTTGG 


CAATAAATAA 


60 


AAGCCGATAA 


CCGTATAATG 


ATTATCGACT 


TAAAGTTTAT 


GTGGCATTTT 


TTACTTTTGT 


120 


AATTTCAGGT 


GAGTTAGATG 


ATTATTATCA 


GATAGATTAT 


TGCTTATAAT 


CATATGATGT 


180 


TTGAATGATA 


TCTTTGATTT 


CACTGATTAG 


TGCTTCTTTA 


GGATTAGCAG 


TTGTACATTG 


240 


ATCTTCAAAT 


GCGAGCTCTG 


CCATTCTATC 


AATTGACTCA 


TTTAATTCTT 


CTTCAGACAC 


300 


ACCTTGTGAT 


TTCAAATTCA 


TTTCAATTCC 


GACTGATTGA 


CCTAATTCGT 


AGACAGCTTT 


360 


AgCTAATGAT 


TCTACGAGTG 


CTTCTGTCGT 


ATTACCTTTT 


AATCCTAAGA 


ATTTGGCAAT 


420 


ATCTGCATAA 


TCTGTATCTG 


CTCTGAAGAA 


CTCATATTTA 


GGGAATAATG 


CATGTTTTTG 


430 


CGGGTCTTTG 


GCATTATAAC 


GGATAATATG 


CGGTAGTAAT 


ATCGCATTCG 


CTCTACCATG 


540 


CGGAATACCA 


TATTCGCCAC 


CAATTTTATG 


CGCAATTGAG 


TGTGCAATGC 


CTAAGAATGC 


600 


ATTTGCAAAT 


GCCATACCAG 


CCAAAGTTGA 


TGCGTTATGC 


ATTTTCTCTC 


TTGAAACTTT 


660 


ATCACCCTTT 


TCAACAGATG 


ATTTTAAATA 


TTCGAACGTC 


AATTTAATCG 


CTTGTAGACT 


720 


CAAACCTCTT 


GTGTAGTCTG 


AAGCCATTAC 


AGATACATAT 


GATTCCATTG 


CATGCGTTAG 


780 


TACATCCATT 


CCTGTATCTG 


CTGTAACGCT 


TTTTGGCACA 


CTCATCACAA 


ATTGAGGGTC 


840 


AATAATTGCA 


ACGTCAGGTG 


TTAAAGCAAA 


ATCAGCCAAC 


GGATATTTTA 


CATTTGTTTC 


900 


ACTATCTGTG 


ATAACTGCAA 


ATGGTGTTAC 


TTCTGAACCT 


GTACCTGATG 


TCGTAGGGAT 


960 


ACAAATGAAC 


GTCGCATTTT 


CAGGCATGCC 


TATTTTATAA 


GTACGTTTAC 


CGATGTCTAG 


1020 


GAACTTTTGT 


TTAGCACCGA 


AGAATGATGT 


CTCAGGGTGT 


TCAAAGAACA 


TCCACATTGC 


1080 


TTTTGCAGCA 


TCCATCGCTG 


AACCACCACC 


AAGTGCAATG 


ATTGTATCCG 


GTTGGAAATC 


1140 


AACCATCATT 


TCCAGACCTT 


TATATACTGT 


ATTAGTTGAT 


GGGTTCGGTT 


CGACTTCGCT 


1200 
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ACCTCTGGAT TATATGATAT ATCTCCATCT TTATAATTCA TTAAATCTTT AAAATTGCTA 103 80 

TATTGCGCAA AAAACTTAAA GTTTTCGATT TCTTTTTTTA TGTtTTCTTC TTTAACTTCC 10440 

5 

TCAGTAGAAA TGAATTTATT ATTAATCATT TTAACTGGAT ATTTTTTTTG ATTATCCTGA 105 00 

GCTACTTCGT ATTTCTCCGT CTTTAtTTCA TTAGTATAGT AAAAtCCTTT TGCACTTCTT 10 560 

GTATTTCTAT CTATCTTCAA AAGCATGCCT TTTATTTTTA GAGCTTCTCC TTTATTTTGA 10620 

ATTGCCATTT GAGAATTTAC AATCCATGTT CCCTTATCAT TTTTATCAAA TTGATCATCA 106 80 

CGATATCCTT CTTTATCGTA TAAATCCTCT AGATTTTTAA TCGGATACAT ACTCAATGTT 10740 

15 TTTTCAAAAC TTTTCTTT 10758 

(2) INFORMATION FOR SEQ ID NO: 222: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1109 base pairs 
20 (B) TYPE; nucleic acid 

<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

25 

■ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 222: 
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nTaTCaACTT 


TGG aATTTAA 


AgTCAATAAC 


TTTTTTAAAA 


ACTTTTTGTG 


TTCACAACCC 


60 


GCTTCTTTTT 


CAACGCGTTT 


ATTGCTTAAC 


ACAAGAACTT 


ATTTTACCAG 


CATTCCAAAA 


120 


CAAATCAACA 


TAAAAACGTA 


CAAAATAAAA 


GTAATTTTGT 


ACGTTTAGCA 


TATATTATAC 


180 


CTATTTATTT 


GTAGCAGCTA 


TAACTTTTTG 


TGCAATCGAG 


CTATAAATTT 


TACCTAGACG 


240 


ATCATCTGAT 


TGATATATTG 


ACGGTGCAAA 


ATCTTTTGGA 


TTCCAAGATG 


GTTGCTCTAA 


300 


AGGTAATTCC 


CCAAGTAATT 


GAGTATTAAG 


TTCATCAGCT 


AACTTAGTAC 


CGCCACCTTT 


360 


GCCAAAGACA TATTCTTTAT 


TACCCGTCTC 


TTTACTTTCA 


AAATAACTCA 


TGTTTTCAAT 


420 


TACGCCAAGA 


ATAGAATGAT 


CCGTATGTTT 


TGCCATCGCA 


CCTGCGCGAg 


CTGCAACAAA 


480 


TGCTGCTGTA 


GGATGAGGTG 


TCGTTACAAT 


AATTTCCTTA 


CTTGAAGGTA 


ACATCGTATG 


540 


AACATCTAAA 


GCTACATCTC 


CTGTTCCAGG 


TGGAAGATCG 


AGTATTAAAT 


ATTCAATGTC 


600 


TCCCCATTTA 


ACTTCTGTAA 


AGAAATTCGT 


CAACATTTTA 


CCTAACATTG 


GCCCTCTCCA 


660 


TATAACTGGC 


GCATTTTCTT 


CCACAAAAAA 


GG CCATTGAT 


ATAACTTTAA 


CGCCATGACG 


720 


TTCAACTGGA 


ATTACTTCCT 


TCCCTTTAAT 


TCCAGGCTTT 


TCATCAATAC 


C CAT CAT AT C 


780 


TGGTACACTA 


AATCCATATA 


TATCGGCATC 


TACTAATCCG 


ACTTTTTTCC 


CTTCACGAGC 


840 
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TTCGCATGCT CAATTCGCGT CGTATTTAAG AAATGATGGA ATCCTACACC TAGCGATTCT 8580 

GTAAACTTTT TAGACAGATG GCTCTCTGAC CACCCAACGT ATTCGCTTAA TTCTGAAAGG 864 0 

CTTAAATCTT CATGAAAATG TAACTCGATA TAGTCGCATA CTTGATTCAC TTTATCATCA 87 00 

TTTAAGATAC TTTGGTTCGA ATGATATGTA CGCGGGACAT AATGAATCAT ATGCATAAGC 87 6 0 

AACTGAATCA CAAGTTGTTG CTCAGTCAAT TTAGACAACT CATTATGTCG GATATGTGTT 8820 

GAAACCAGTC TTGCCATTAT ATTTCTCAGT TGATGTATAT TCTTTGTTGT GGTCGCATCT 8880 

GTTAAGTGAA AATATAGACA ATGCACATCA TCAAACTTGT CTGCTAAATA TTTCATTTGG 8940 

AATTGGATAT AACATATGAT GCCATCTTGT TGAAGTTGAA ATCGATACAA GTCGCGGTGG 9000 

TTAATGATGA AAATGTCGCC ACTGTTGCAT TGCGTCATAT TATTTTCATC ATAAATGTGT 9060 

GCCTCnCCTT TAATAACAAA ACCAATCATT AAACTATTGA GCCTTTTGAA ATCTGACATA 9120 

20 CTCTCAGTTT CTACTCGAAT TAAATAATCA CGTTGCATAC TATCCCTCAA TTCAGTAATA 9180 

TGAATACGTT TATTTTACAT TATTTTACAG CAACATATTT GAATTTCATA TTGAATCGTG 924 0 

TGTGTGGATG ATTATTTATC CTCACTCGGT TCAAGATGTA GACTATCAGT AAAAAAAGTA 93 00 

TTTTCACCTT TTTTCTCCAC AAAAGTAAAT TCAATGTCTT TATATCCAAC TGrTGaACCT 936 0 

TTTAAGTCTC CCgAACCTTT CaACaATAAC TTTGGTGCTT TATTCGTTGG TATTTTATAT 9420 

CTTTTTCGTA ATTGTTTTAC ATTATAGTCA TCATTAGTTA ATTGATATTT TGCTGAATAA 9480 

CTCGGTACCT CTGGATTATA TGATATATCG CCGTCTTTGT ACTTCGACAA ATCTTTAAAG 954 0 

CTGCCATATT GCGCGAAGAA CTTAAAATTC TCGATTTCTT TTTTTATATT TTCGTCTTTG 9600 

ATACCTTTAG TTGGAATGAT TTTATTGTCT ACCATTTTAA CGGGATATTC TTTATCTTTA 9660 

CTCTTAGGTC TACCATCTTC ATCATGAAGT GTTTCACTCA CTATATACTT CCCGGTTGTA 9720 

GTCTTAGTGT TTCTATTCAT ATATAGAACC ATACCTTTTG ATTTCATACG TTCCCCTTTA 97 80 

40 GGTTGAACAA CCATTTCAGA ACCAATAATC CATGTACCTT TATCATTTTT ATCAAATTCG 984 0 

TCATCACGAT AACCTTCTTT ATCGTATAAA TCCTCTAGAT TTTTAATCGG ATACATACTC 9900 

AATGTTTTTT CAAAGCTTTT CTTAACTTCC GCTTCTTTAC CTATGCCACA ACCAGCAGTG 9960 

AAACTAATGA CTAATATCAA AAAACTAATA TACAATACCA ATTTGTTTAA TCGTTTCATA 10020 

ATTTCACAAT CCTATTCTTC TTATTATCTT TCCTGGATTG ATTTCATATT TTGATCGAGT 10080 

CATGATTATT TATCCTCACT TGGTTTAAAA ATTAACCCAT CACTAAAGTA AATGTTCTCT 1014 0 

TCTTTTTTCT CTACAAACGT AAATTCAATG TCTTTATATC CAACTGATGA ACCTTTTAAA 10200 

TTCCCTGTAC CyTTCAACAA CArCTTCGGy GCTTTATTTG TTGGTATGTC AT AT CTTTT A 10260 
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ATCATTTTGC CTTCTTTAGA TTTTAAAGTT TCAATTAATT GTTGATTCAT ATCCTTCATC 6 780 



TCCTTAGTTA CATCATAAAT GATTAATCAT TATTTATATT GCCAACAACA GAGATGTTAA 6 84 0 

5 

CCATTAATTT TTTGCAATTT TAG CTTTGAA TATAAAAAAT CACAAATTAT GTATATCAAA 6 900 



ATTTGTGATT TGTGATCATT TTATGAACTT GGGTAACGTT TTACTTCAAT TAAGTGAATC 6 96 0 



CCATTCGTAA TCATTTTAAT GTTTAATGCC AGTGTGTCCG TGATATCTAT ATCATATACT 702 0 

10 

TCTAATTTCG GAAAACTCAT TCGATTAACG TAATCTATAG AGTCCTTGTC CATGCCATGT 7 080 



ATCGTATGAT GTTTGCGCCA AAGATTAAAT AACGCACCAT TTTCTTTATC TAAGGTAAAA 714 0 



TGTTTAATCT TATACATACC TTCTTCCAGG GCATTAATGT TCAAATGAAT CATTTCCGTC 7200 



GCACGCATAT TCATTTGATT GTCCAACGCT AAGTACGGAT TAAAATGCTT TGCATCATAT 726 0 



AACAATATTT GAAAATTTGA ATCAGTCCCC GTGACAATAC ATGTATCATC AGAATACAAA 73 20 



ATATTGCTTG TTAATTTATT AAATAGCAAT GCCGTGAAAT AGACCGGACG TTTTCCATTA 



TATTGATGAA ATAGTTCAAT AGAATTCATA TAATCCCGTT CATTTTTACA ATGACTGACG 744 0 

TGCAAATCAT AATTCAACCA ATACCCGATA CCCTCTACTT TAGAACTTAA TTTTAATAAT 7500 

25 

TGCTCAATGA TGATACCACC TCTAAAATAT TCGCCGTTTG TAATAAATGT ATCACCCGTC 7560 

AATGTATTCC AATTGAGTAA AATGAGTGGA CGCTTTAGGC GATGACGATG CATTAAGTCG 762 0 



ATAAGGTAAT TCGTTTTATT AATAATCATT TGACTCGCGG TTTTAAATTC ATCATCATTC 76 80 

30 

ATTTTATTAA AATCAACAGC GTCATTTGAA TTGGCATCAA ATACAAAATG GTCGATGTGT 774 0 



GGCTCAAGTC GTTTCAATAA TGGTAGATGT CTTTCCGTAG CTTGATCTAA GTGAATGTAC 7 800 



AAGCCACCAT TAGGGAATAA TGCTTTAAAA TAATCAATCA TTTCAATCAA AGACGTGTGC 7 860 



AATGTCGTCA CATACAAGTT GAACTTCAAA TCTTTTCTAT GACTGACATG CAGGGCAACG 7 920 



TGATGGATAA AAATTTTAAA TGCATCGATA TAATCACGTG AGTCATACTG ATCCAAATGC 798 0 



ATGGTCAAAC TAAAGTTATG ATCTAATAAA AAGTCTAAAC ACAAATCAAT ATCATAAAAT 8 04 0 



ATATTCGAAA TTTCTGCATC ATACGTGAAT GGCGCATTGA GCTTTTTCAT GATATATGGA 8130 



ATCACATCAT ATGCTAATAC TTCATTGACT TGAAAATCAT GATGACATGT AAGCAACTGT 816 0 

45 

GATTGATACT GTGTATTGAG CAAATTCCTC AAATAGCCCA CTTGAATAAT ATGATTAAAT 8220 



TGATTTAGTT GGTGATTGGT TGGTTGAAAG GCAATCTCTT TATAGTTCAT CTTTTCAATA 82 8 0 



TCTTCAATAA AATGATTCAT TTCTTCAATG TAGTCATTTA AAAGTAATAT CAATTCACGG 8 34 0 

50 

TCGTGATAAT CATGTTGTGC CGATTGCTGG TTTTCAGTGA TTGCTGGACG ATCACCTCGA 84 0 0 
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AACCAAGTGC TGCACCTTTA GTAATGACAA CGTTTAAACT TAGCAACATA A CT ACT AG AA 4 980 

CAATTAGCCC TGCAACGTCA AATTTATGTG TATTGGTAAT TTCTGATTTC GTTTCAGGCG 504 0 

TCCCTTTGAT GAGTAACATT GAAAGTACGG CAACGATAAT TGAGAAGATG AAAATCCATC 510 0 

TCCAACCCAT AGTTGTCGCA ACTGCACCAC CGAAGAGTGA ACAGATACCA CTGCCACCCC 516 0 

AAGAACCGAT AGACCAATAA CTTAAGGCAC GCTGACGTTC AGCACCCTGA TAATAAGTTT 5220 

TCATAATGGC CAATGTAGAA GGCATAATAC ACGCTGCTGA TACACCTTGT ATAACACGAC 5280 

CTAAAATTAA TAATGCCGGT AAATTCGTAA TAATAATTAA TGCTGAACCA ATAATACTTA 534 0 

ATAATAAACC GATATTCGTC ATTTTCACGC GCCCAATTTT ATCTGCCAGA CCACCTGCTC 5400 

CAACAACAAA CATGCCTGAA AATAGTGCAG TTAGACTGAC CGCAATACTA ATTGTCCCCA 54 60 

TGTCTGTACC AAAACTTTGT TGTAAATTCG GTACAACATT TACAAGTGAT TGTGCAAACA 5520 

20 ACCAAAATGT AATAACACCT AATACAATAC CTAAGATTAA CTTGTTGCCC CCGCGATACG 5580 

TTTCATTCAT GTTAGTTATC TCCTTTAAGG TAATCTAAAA CAACTGTCCC TACTGCTTCT 5640 

GCAGAAATAA GTAATGATTT TTCTGAAATG TTAAATTTAG GATGATGATG TGGGTAAATT 5700 

TCACCATTTT CCACCGCTGC ACCTGTATAA ATAAAGGCAC TTGGGCGTTC TTTAGCATAA 5760 

TATGCAAAGT CTTCTGAAGG TGGTTGTGGT TCACACATTT CAACACCAAA ATCAAGGTTT 5820 

GCTTCTTTCA ACGTCTTAGC CACGTACTCA GTAAACTCTG GATCATTATA TAATGCTGGA 5880 

TAATCATCGT TATATTCTAA GGTGCAAGTt ACACCATACA TATCCTCTAA TCCTTTTGAT 594 0 

AAACGTTTAA TTTCTTTTTC AATTGTTGCT TTTGTAGCAT CTGTTAATCC ACGTACATCA 6 0 00 

CCTTCAATTT CAACAACATC TTTAATGACA TTGAATTGAC CTTTACCGTC AAATGAACCG 6 06 0 

ATTGTGACAA CACCGGTTTC AAATGGACTT AGTCGTCTAG ATACAACTGT TTGTAACGCT 612 0 

GTGACGAAGT AGCTACCTGC AACAATGGCA TCATTGGCCA TATGTGGTGA TGAACCATGA 6180 

CCACCTTTAC CTTGAACTTT CAATTTGAAG AATGCGCGTC CTGTTTGAAC ATAACCAGGT 624 0 

CTGTAATACA CTTTACCTGT TTTCATTGTG CTCATGACGT GTACACCTAA TACATGATCA 63 00 

ACACCGTCTA ATACACCATT TTCAATCATT GTTTTAGCAC CACCTGGTGG TACTTCTTCA 6360 

GCTGGTTGAT GTAT CACAAC GACTTTTCCT GTAAAACTAT CTTTCATTTC AGCAAGCGTC 64 2 0 

TCTGCTAATA CAAGCATGTA TGCTGTATGT GCATCGTGAC CACATGCGTG CATAACACCT 64 80 

TTATTTTGTG ATGCAAAAGA TAATCCTGTA TCTTCAGTAA TGGGTAATGC GTCAAAGTCT 654 0 

GCACGGATTG CTAATGTTTT ACCAGGTTTC CCTGAATCAA TCGTTACTTT AATTCCACGT 6 6 00 

GGTCCGACAT TCGTTTCTAC TTCCACATCT TTACCTTTGT AAAATTCAGC GATGTATTTC 666 0 
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AGACATTCGC GAAATTACAA AAGACGATAA 
TCAAGCTGTT GAAACAATCA ATGATAGCAT 

5 

AGCCATTACG AAGGCAGACG TTGAGCGTTT 
TTTAAAAGTA GCAGAGCTAA CAGAAGAGTA 
AGATATTCGC ATTACAGAAG ATGCACAACG 

10 

TGAGTCATTT GATAAAGCCA TCGACATGTT 
AATTGTAGAA CGTAGTAGAG AATCATTTAA 

75 TAAACGCCTT AATCGTGGTG AATGTACAAC 

TGGTGTTCTT GAACGTATCG GTTATCATTC 
TAACGATGAT GTACCTACAG ATGAAGAAAT 

20 TTATTTATAT TCATATTTTT TTAAAATTAO 

ACATTCATGG GTTGGCTTTT TTGTTTAGCA 
CTGATATAAT AATGCTTCAT TAGTATGCGG 

25 GACCCCGAAA CGTTTAATAT ACACTTTTAC 

ACCTTCACAT TATATATAGT TCTTTCCATA 
AAATAACCAT CCACCTAACT TATCAAAAAT 

30 

ATTTATATTA GTGTTAATC C AATCATAGAT 
TCATTTAATT TGCTTTACTT TCATTTATAT 

35 ATTATTTCGC ATCTTTGACA CGTTTATCAT 

ATGCCATGAT TCCCATTAAT ACGTTAACCC 
TCGTTGCAGC AACTGCACCA GCATATACAG 

40 CAAGTGATGA AGCCATTTTA TAAATACCTG 

TCGAAATAGC TGTATCTGTA GAAGGTGTTG 
AATATCCTAC GACACAACTG ATAACATAAA 

45 

GTGCAATACC AATGACCACA ATGAATGTAC 
TTTGTAATAA TTTTTCACCA ACTCGAATCA 
ATAAGTATCC TGCCTGCAAT GCTGTATAAC 

50 

CAATTAATGT ACCTGCAAAA CCGTTTAATA 
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AAAATTGATC AAAAAGCTTG AACAAAAGCA 3180 

TCGAAATTAT TTAGTTAGAA TTTCTACAAA 324 0 

AGCAGTTATG TTTGATGTCA ATCGCTCTAT 33 00 

TGTCGCTCAA TTAAAACGCC AACATGATGA 33 60 

CGGTATGGAT AAATTATTCA ACCATGTTGC 34 20 

AGATGTTTAT GACAAAACGA AAAAAGATGA 34 8 0 

TATTGAACAT AAACTACGCA AAGGTCATAT 3 54 0 

AAAAGGCGGA TTACTATATA TCGATATGAT 3600 

ACGAAATGTT TCTGAAGCAC TTGTTGGCCT 366 0 

TGCAACAACT GAAATTTAAT TTTTACTGTC 372 0 

AGATTCAQAT GCATGTAAAA ACCCAATCCA 3 780 

AAATTTATTA TCTTAAATCG GCTATAAACA 3 84 0 

TAAGCATGAC GGACACTGTT CTCGGAGTCT 3 900 

ACGTCGCCTT CATTGAAGCG AATTGCCATA 3 960 

TAAATGTCCA AATTTTTAGA ACAACGCAAT 4 020 

TTAAGTGGAT GGTTTTTCAT TTTCATTTAT 4 0 80 

TTATCTATAT GCACTGCTCT ATACATTTCC 414 0 

CATTATCAAA ACACTTGGCG TGTCATCGTT 4 20 0 

CATTAGGAAT CGCGAATAAA ATTGCGATAA 4 26 0 

AAAGTGCAAT CATCGCACCT GTATGAATGC 4 32 0 

CACCACTAAT TGCGACACCG AATGCGCCAC 43 80 

AAGCAACGCC AACTTTATCT AACGGTGCAT 444 0 

CATAAATACC TAAGCCTAGT CCGAAACATA 4 50 0 

ATATGCCTGG TAAGAATACT AATGAAATAA 4 56D 

CTAATAACAT TGGTCGCTTA GAACCCATTT 4 62 0 

TCAATAACAC CATGATTAAA TAAGTAATTG 46 80 

CTAAACCTTG TTGCACGAAT GTATTCGCTA 4 74 0 

AGAAGTTCGA AATCGTTGCA CCTGTATATG 4 8 00 



AATACGATTC CTGTTTATAT GCCATATATC 
GACAGACCTT TAGTCGTCCC GCATCAATCT 
5 GAAACAGAGC GAGACACTGT ATTTACAACA 

GTTTATCAAT TACTAAATAT AGATCGTGGT 
CTTCGCGTCT TAATGGATGC CATATACGAA 

10 

ACTAAAGATT CGAAAATGCA AAAACTCGCA 
ACGTACATTG AGTCATTATT AAAAGAACAC 
TTTTATTTGG TGATTTCAAA TCATGAGACT 

15 

TTTCGTTGTT CCACTCTCAT GATTTTTTTG 
ATTTTGTTAA ACAAGTGATT GCAAACCTGC 

20 GACGATATTT TACAAGTCAT ATACAAATAA 

TAACATTAAA TTTACAATTA TAAGCGATAA 
AATGGAAATG TCGGTTACAG AAGTCATTTT 

25 TTACGGCTTA AAAATCATGG GAGACGGGCT 

TATTTTAAAC AAATTTACAT CAAATCCAGT 
TATTTTAATA CAAAGTAGTT CAGGTACGAC 

30 

ATTTATGACA TTGAAACAAG CCATTGGAGT 
AACTGCATTT ATTATCGGTA TAGATTTAGG 
TGCATTCTTA ATCTTTTTCT TTAAACGCTC 

35 

cggtttcggt tcactattct tcggtctaga 
atcattagat ggatttaagc aattaatgct 

40 cattgtcggc gcagggttaa cagcactagt 
acaagaattt tatcaacaag atttaattag 
cgataacatt ggtaccacga ttacagctat 

45 aaaacgtgcg gcg cttgtac acgtcatctt 

TTTCTTGCCA GTTGTGATTC ATTTGATTAG 
AGCGATGACG ATTGCAGTAT CACATGGTAT 

50 

ACCATTTGTA GCAGGTTTAG CATGGATTGT 
TGATGACTAT AAACCTCAGC ACTTAAACAA 

55 
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ACAT CTTATT TCATGACGCG TGCTATCGGC 13 8 0 

CAGAACTTAG CATTTATTGG TAACTTTGCA 144 0 

GAATATTCGG TTCGTACTGC CATGGAAGCT 1500 

ATTCCAGAAG TCATCAATAG TCCATTTGAT 1560 

CTGAATGACC ACCAAGATTT GCGTGAGATT 1620 

TTAGCAGGAT TCCTTAAAAA GATAAAAGGT 16 80 

AAATTGTTAT AACGAAAACC ATTAATAGAT 174 0 

GGGACAGAAA TGATGTTTTC ATAAAAATTA 1800 

ATGAAACATA ATTACATGAT TGATTGCATC 1860 

CATTTCACAC TGAAAATTTA CATAATAAGT 192 0 

CATATATTGT TAAATAATTT TACCTAATCT 1980 

TCTAAATATA AAGCTTATTT GAGGTGAAAT 204 0 

CTCCTTTTTA GGTGGTTTAG GTATTTTCCT 2100 

TCAAGCATCA GCAGGAGACA GGCTACGAGA 2160 

ATTAGGTGTT ATTGCAGGTA TCGTTGTAAC 2220 

AGTTATCACA ATCGGACTGG TAACAGCTGG 22 8 0 

GATAATGGGT GCTAATATCG GAACAACGGT 234 0 

CGAATATGCA ATGCCAATTT TAGCATTAGG 2400 

TAAAATCAAT AACATTGGCC GCATACTATT 2460 

ATTTATGGGT GATGCCGTTA AACCTTTAGC 2520 

TGATATGTCT ACAAATCCAA TACTCGCTGT 2580 

TCAAAGTTCA AGTGCGACGA TTGGTATTTT 264 0 

CTTAAACGCA GCAATCCCTG TGTTACTAGG 2700 

CTTAGCTAGT TTAGCCGGCT CAATCGCTGC 276 0 

TAACTTAATC GGGGTAATTA TCTTCACAAT 2 82 0 

TTTGTTACAA GATTTATGGC ACTTAAAACC 2880 

CTTCAACATA ACAAATACTT TGATTCAATT 2940 

TACAAAGCTT GTCCCAGGTA AAGATATTGC 3000 

AGATCTTGTT TATCACGCAC CTGGTGTTGC 3 060 
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TTTGAATCAC TTAATGAATC AGATTCACTC ACGCTTTCTG AACTTCTTAG TGACGTCGAT 13 020 
ACACTTAATG ATGACGAATC GCTTGTGCTT ACTGAATCG 13 059 

(2) INFORMATION FOR SEQ ID NO: 221: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10758 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 221: 

AGGGATGGCC TTACCTAAAA AACCGGGnAA ACCCTCCAAA ACCCATTAAA AGGnTGGnTA 60 

CCCTTTAAAA TGGTAGCATT TAACCGCCAC CCGCCAAGGT GGGTGGTTTA TTCTTCCGTT 120 

20 ATTTAAATTA GTACACCATG CAGATTCTGT AGTTGAGGGA TATTTTAACG AAAC-CTTATT 18C 

AGCAACTGAT AAAAAAATAC GTCCTAAGGC ATATATTGCT TCATGGAAGG ACATCGAGCC 24 0 

GGCTAAGAAA ATAGAATTTA AAATTAAAAA AGGTATTAAA TGGCATGATG GTAATGAATT 300 

GAAAATTGAT GATTGGATTT ATTCAATTGA AGTCTTAGCT AACAAGGACT ACGAAGGTGC 3 60 

TTATTATCCA AGTGTAGAAA ATATCCAAGG TGCGAAAGAT TATCATGAAG GAAAAACTGA 420 

T CAT ATT AG C GGATTGAAGA AAATAGATGA CTACACTATG CAGGTTACAT TTGATAAAAA 4 80 

ACAAGAAAAT TACTTAACAG GATTTATTAC TGGACCTTTA TTAAGTAAAA AATATTTATC 540 

AGATGTACCA ATTAAAGATT TAGCGAAATC AGATAAAATC CGAAAATATC CTATTGGTAT 6 00 

TGGACCGTAT AAAGTTAAGA AAATCGTTCC AGGTGAGGCT GTTCAACTCG TTAAATTTGA 66 0 

TGATTATTGG CAAGGTAAGC CTGCACTAGA CAAAATCAAT TTAAAAGTTA TTGATCAAGC 72 0 

GCAAATTATT AAGGCAATGG AAAAAGGCGA TATTGATGTT GCGAATGATG CTACCGGTGC 78 0 

40 AATGGCAAAA GATGCTAAGT CATCTAATGC TGGTCTCAAG GTATTATCTG CGCCAAGCTT 84 0 

AGACTACGGT TTAATAGGtT CGTATCTCAT GATTACGATA AAAAAGCTAA TAAAACTGGT 9 00 

AAAGTGAGAC CAAAATATGA AGACAAAGAA TTACGTAAAG CAATGCTTTA TGCAATTGAT 96 0 

AGAGAAaAAT GGATCAAAGC GTTTTTCAAT GGTTACGCTA GTGAAATCaA TAGTTTTGTA 102 0 

CCATCTATGC ATTGGATAGC AGCCAATCCT AAGGACCTAA ATGATTACAA ATATGATCCT 10 80 

GAAAAAGCTA AAAAAATCTT AGATAAGTTA GGTTATAAAG ATAGAGATGG TGACGGATTT 114 0 

AGAGAAGATC CTAAAGGTAA TAAATTTGAG ATTAACTTTA AACATAATTC AGGTTCTAAT 12 00 
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GAAATCAGCA TCAAGATAAT CATTGATGTT AACCACGGAC CTAAtCCTAA AGTGAAAATG 11220 

TTTAAAGTAT TAACGTCTCC ACCCATATTA GAAATAGCTA TTTTAAAAAA TGACTCATGT 112 BO 

TTTACTTGCA TATCGTTaTA GGAAACGATG GAAATGTTTG TGCCTAATAT ATAAATAaAC 1134 0 

AAGATAAAAC ATGTGTATAG CATACGTTTA TATATAATTT TATATTCGTA TTGTTGTAAA 11400 

AGTTTTAACA TGTTGCACCT CTTTTATATC AAAAACATTA AAAAGACTAA GGGTTCATCA 11460 

CTAATTATTA AAATCCTATA TCGATTTTTC TAGTGATTGG TGCCTCAGTC TTTTTAATTT 11520 

TAGCCAGCTA TAAATTCAAT TTATGCTTGA GAATCATCTT GATCATTTTC ATCTTTCTTT 11580 

TTCTTTCTCT TCATTAAACC TAAACCAACT AATAATGTCA TAACGCCACC TAGTAATCCA 11640 

TTTTGTTTTA TTGAGTCACC TGTATCTGGC AATCTTTTTT CACTTTGTGC TGGTGTGCCA 11700 

TTATGTTTAG TCACTTCAGA TGTTGCACTT AATGTAGACT GAGATTCACT CGTGCTCGTT 11760 

GTTGCTTCAC TTGATAAGCG AGATGTGCTC GTGCTGTGAG TATGATGCAT ACTCATTGAG 11820 

TCTGACGGAT GCATTGAGTT AGATTCAGAT GTACTTGTTG AGCCGGACAT ACTTGTTGAT 11880 

GTTGAGTCAG AAATGCTTTG TGAAC CAGAC ATAGATGTAC TCAGTGATTC GGATGTG CTT 11940 

25 GTCGAATCGG ATGTGCTCAA TGACGTTGAT GTGCTTGTTG ACACTGATTC TGAGTCACTA 12000 

ATTGATGTTG AGTCGGATTT GTCTTGTGAC ATTGAAACAC TCGATGAATT AGATTCACTC 12060 

ATTGATGTTG AGTCAGATAC GCTCGTTGAA CCTGAACCAG ACGTACTTAA TGATTCAGAT 12120 

ATGCTTGTTG AAGTTGAACC ACTTGTTGAG TCCGATGTAC TTGTCGATGT CGAGTCTGAA 12180 

TCTGATGTAC TCAATGATTC TGAGTCACTG ATAGAAGTTG AATCACTTGT AGATTCTGAT 12240 

TCTACTGTAC TTTGTGAACC ACTGATACTT ATTGAAGTAG AATCACTGAT ACTGTCTGAT 123 00 

GTTGATAATG ATGTCGACAC CGATGTGCTT TGTGATGACG ATGTACTAGC ACTCATTGAC 12360 

ATTGATGTTG ATATCGATGT ACTTAAGGAA CCAGATGCAC TTGTACTTGT TGACTGGCTT 124 20 

TGTGACATTG AATCACTTAA TGATGTAGAT GTGCTTGTTG AGCTCGAGTC ACTTACACTT 12480 

GTTGAACCTG ATATTGAGTC ACTTAAACTT GTCGATGTTG AAACTGAtwC GcTTCCGCTC 12540 

ATTGAGTCAG ATGTTGAAAG TGATGTACTC GTTGAATTTG ATCCACTGAT GCTAGACGAA 12600 

TCACTTGTAG ACATTGAGTC GCTTTCTGAT GCACTGATGC TCATAGAGTC AAATTGACTA 12660 

TTACTTGTTG AGCTTGACTG CGAATCGCTC ACACTTGTTG ACGTTGATTC TGATCCACTC 12720 

ATACTTTG CG AGCTACTCAA TGATTTTGAA TCACTTAATG AATCCGAAGT GCTAAGACTT 12 7 BO 

GTGGAACCAC TTAAAGATAT TGATCCACTT AATGAGTCGG AGTCACTTGT ACTAGTAGAA 12840 

TCACTCATTG ATATTGAATC ACTTAGcGAG GTAGACTyGc tTACGCTTTC TGAACCACTT 12 900 
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TTAAATGGGC ATTCTTTACG ATAGATTGAT ATT CCT CATC TGACACAGTT TCATTTCTAT 94 2 0 

TTTTAAAAAA TGAATAACTT AATGATTTCG CTGGAATATG ATTGGCTATT TGTCGATTGT 94 80 

5 G CCT AG CATC TGAAGCCACA ATCACATGAT CATCTTCATG TATTTGTTGT GCAATCATTG 954 0 

CTTGAAATTT TTCTTCAATT AGTTGAGCCA TATTGTTATA TTCTGTTTGT TGATAGTGAT 96 0 0 

GTTGATATCT TTTTGAAACA GTGACTCTGC CATTTTTCAA ATCTTCATGA AGTACACAAT 966 0 

w 

CTCCATTAAT CGTTAAATAT TCTTGGTAAG AAGCCTCTCC CTGATCATCA AAATAACGTA 972 0 

TCGCTGATAA ATAACCTCTG TCATCAAAAA TATAACGCCG TTGTAACTGA TCTCTTTCAA 9780 

/5 ATTCTTCAAA CCAAATTGAA TACCCTTCTT GACTAAAATA AATATTTGTA TAGGTCTGTT 984 0 

CACTCGTCAC ACATTTTAAT AAATACGG TG TGTACACAAA CTCAACATCA TCCGGCCATT 9 90 0 

TTAAGTGATG ATAATTAATC GCTTGTGGCG CATGOTGACT GAATCCTTGA ATTTCATCAA 9960 

20 ACACAGACGA ATACTTTGTC TCATATAAGT CATATCGATG TAAAAATGTT rTTAAATTTG 10 020 

GTGCATGATT GAGAACAATC AGTTGATAAT CTAAGTCATT TTCAAGGTGC ATTCCCATTA 10 080 

AACTAATCAT ATCGTCAAAT TCCGTCTTAT TTTGTAGTTG ATAATACGGC ACAGTCGTGT 1014 0 

25 CTTGCCACCA TCGTTGGTCA TCGTACCAAG CTGGAATAAA GTATTTCATA ATTACCTCCT 102 00 

TACCAATACT GGTTTAAAAA TGGCTTATAT TTATCAAAAT ATAAATATGT ACGAATTGTT 10260 

TCTGCAATAT TAATACTGAT GTAAACTAAT ACAATCAGTT GTACTGAGaA ATAAATTTCA 10 320 

30 

GTAGATAAAT G CGGTACAAA CAATGTGAAA TAAAGCGGTA TACCAATAAT GACTGTAACT 10 3 80 

AATGCCAATC CAAACCAACA TACGCGTCGT GCTTGATAAT TTAAATAACG TTCTGTATCC 10440 

35 TTACCAGGTT TaACTCCTGA AAAATAATTG CCACTCTTTA AGAAATCTTT GGATTTTTGT 105 00 

TTAGTATTGA TTAAAAATCT CGATAAAAAA TAACCCAATA ACATTTGAAT CACTAAATAT 10 560 

ACTGAAATAC CTACTGGACT ATCAAATGTC AGCATTGGCA TGTCATCTGA TATGCTTTTA 10620 

40 TTAAACATAG ATAAAATAAA ATGAATGCCA CTTTTTAAGA AAACAAAAGC TGAAATACTC 106 80 

ATCATTAAAG TAATACTGCC TGCAGGGTTA ACTTTCCAAG ATAAATAAGA TTTCATATTT 10 740 

GTTGCGGAAA CGTTCATTAA ATCGATATAT GGTATTCTCA CTTCTACTAA TTCAATAAAT 10 8 00 

45 

AATAAGATAA ACAATGTGAT TATCACAAGG ATGATTAACA ACGCAATCAC AATATGACTT 10 8 60 

GCATCTATAT ATTCCATTTT TTGATGCATC ATTGATTTAA TAATACTAAC CATTACAATC 10 920 

GGCATTGGTC CTGCGATGCC GTAGCGACTA TTTTTGTCAG CTAACCAAAC TAATAACATC 10 9 80 

50 

GTTCCAGTAA CCAAAATCAA TATTGTTAAG TAAATATTGT CTTGATGAAC ACGTTCTTTC 11C40 
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TAATTCATGT GATTTAAAAT TCAGCTTTTC TAATGTCTCG TCAATAACAT TGATAATACC 7620 

TTGTTCATAT TCAGATGAAC CGATATAAAA ACTACCACCT TCAACACGAG GATCGCCGAT 7680 

AAGTAAAAAC GGTGCATTCA TACGTTTCAT CATATAATAT CCTTCGAAAC CTTCCGCTGT 774 0 

TCGATAACCA CTAAAATATA CGTTTAGTGG CGGTTTCATA TCACCAGGGT GGAAATAATA 7800 

AATAAATTCC TGTCGTTGAC TATCTACGAA ACGACTACCA CCAAGTAAAA ATTGACCCAT 786 0 

GTCTAATCTA GACCATCGTT TGTGTATAGG TCCTAAATGT ACCGTCCCGT TCCCACGCGC 7 920 

CTTAACAGTT ACACTTATAT AAGCATCAAA TGGTTTCGCA GGTATCTCTA AAGGACTGTC 7980 

15 TAACATATCA TCAGTCAATA CGATTTGTTC AATTAATGCA CCATCAGCGC CAGTCTGAAT 8040 

CAATCTAAAT GTATATTGCA ACTCGACCGC ACCATCAATA TCAAATTCTG GCCATATTTG 8100 

AATGACTTTA TCTTTATCGT AAACGAGATT ATTTTGCCAA GATGCGATAG GTTTAAATTC 8160 

20 TTTCCCAAAT TCTCCACTCA ATGTGAGCTC TGAATTACCT TGGTAAACGA CATCTCCTTT 8220 

AAAATTCGGA TGCACAAGTG CTAACTTAGG AGAAACCTTA TCTCCATACT GTCCTGAGAA 82 80 

GCTAACTGCC TCTAATTTAT TATTACGTTC TTCAATATTC CGGTAATGTA ATGGTTGAAC 8340 

AACGTATTTT TGGACATTTT CGTCTTGTTC ATATTCAACT GACCAAAATG ATTCATCAAC 84 00 

ATACGTATTG TATGGTTCGC TTATCATTTG TAATAAATTC GTTAATGTCT CCGAGTATGG 84 6 0 

TGCTTGAATA TAGATAAAAT CAAAGCGCCC TTCTGCTTCA ACAATCGCTT CAATAGCCTC 8520 

TACATAACCA CTATCAAATT CAAACAATCC AATATCGAAG TAATCCCAAC TCACACCTTT 8580 

TTTGTGTTGA AAAATAGGTT CTAAATCGTC TCCTCCAATT TGCAAAACTC TAAATTTACG 864 0 

TGGCATCATT TTCACCTTCT ATTAACTCAT CGAGCTGATT AATAATATTC TTAGAAGCAT 8700 

ATGCATCTAT TAATTTTAAA GAATAGGCGT ACGCATAATT CCAATTTTTC AAATAAAATA 8760 

AATAATAATT TAACGCATCA TCTAATTCAT CAACTGTATT TATAATACGO CCATTGTCAT 882 0 

40 AATCAGAGAC GTAATCTGTT TGTTGACCAT TAATTTGTGG AATCCCAGCG CTAATTGCAC 88 8 0 

TAATTTGTAA ATACAAGTCA GGTTCTTTTG ACATATCTAT CACAAGTCGC AACGTCCGCA 8 94 0 

ATGCTTCTAC AACATCATGT TCAGCATGTA TCGTCTTAAC AGCAATGATG TCATCTTGAT 9000 

CTTCAGGTGT CATTAATGCT GAAACATTAA CATCCGCATT CTGTTTAGCT TGGTATTCCT 906 0 

CATTTACCGA CGTAATACAT TCACGAAGCC ACATCGGTAT GTCATTTTGA TGGCGCGATA 912 0 

ATAAAATTAA ACGGTAATAA TCTTCCTGTG CGATATAATC CACAAGTCGT TGCATCATTT 918 0 

GTTGCAAATC AGCGTCACTC ATACCATCTA TCCATACACC TATAAATGTT TCCATCAATT 924 0 

GACTACTTAT ATTAGGTGAT TGTCTCGTTT CAAATGGTGT GATTCGAATC ATTGTATTCT 93 00 
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TAATCATTCG TTGTAATTAA ATATGTTCCT TTTCCCGAAA GAGCATTTAA ATATAAAGGC 5 82 0 

ATCGTTGCCG TTAATGTTTT ACCTTCGCCT GTTTGCATCT CCGCAATGTT ACCTTCATGC 58 80 

5 

AATACAATCG CTCCGATTAA CTGAACTTCT TTAGGATACA TACCTAATAC TCTCCAGCTC 5 94 0 

GCTTCACGTG CCACTGCATA AGCTTCAGGT AACAATGTAT CTAGTGTATC AACTCCTGAT 6 0 00 

GCTAAACGTT CTTTAAATTC TATTGTCTTT TGTTTTAACG CATCATCAGA ATATGATTTA 606 0 

10 

ACTTCATCGC TCCATGTATT GaTGsGTTcA CTATTTTTCT AATCGACTTT AGTCTTAATT 6120 

CGTTTATCGT AACATCTAGT TTATGTTTCA TTTACTTCCC CACCATTCAG TTTCGATACA 6180 

15 TCTAAGTAAT CTAAAAATCG TACTGGATTC ATTAAACGTG ACATATAATT TAGATGTTTG 624 0 

TCTTGCTCTT CTTTAAAATA AACCTCGACA TTTGTATCTT TTAGTTCATG ATTTCCTGGG 63 00 

ACATGTTCTG TAAGCCATCC TTTTAAATCA TCATCTTCAT GGCTTGTACG ATACACTTTG 63 6 0 

20 CAACCCAAAT GCTGAGCGAC ATAAGTTGCA AAAACATTTG ACTTTGACCC ATAACT AAT C 54 20 

AAATTAATAG CCTTTAGGGT ATCTTGACTT TGCAAATCAT TCTTTAGTTG CTTAATATTT 64 80 

CCCTCGATAT TGTCGTCCAT CCAACGTTCA ACGAGCCAAA CATGACCAAA CAGTTTCAAA 6 54 0 

25 

AAATCATTCG AAATAGTTGG ATAGGTGTCA GATGGTTCTG CAATAATGAC ATTG AT CAT A 66 0 0 

TCATTTCCAT ATTGGTCATC GCCTATCTTC GTCACCCGCA TGCTTTTATA CTCTAAATCA 66 60 

TATTGATGCG TCATCTCTGT GATTGTTAAA CATCTAAATA TAAGACTCGT CGATGCTGCA 672 0 

30 

TTCATCATTT TTATTTTATA AGCATAGGCk TCATCAGGAT ATTGAATCGT AATACTATTT 678 0 

GACTTTACAA TCTCAGTACT TAGTTTTGTG CCATTTTTAT TATAAAAAAT GATGATAAAA 684 0 

35 TACACTGAAC CAGCAGGCGT TGCATCAAAA TCAAAATGCA ATTTATAATG CTGTCCTCTA 6 900 

CGCAAAATTG GkAAACTTGG CGcACTTTTA TATTTTGAAA ATTGCTTTAA CATCAACCAC 6 960 

TCATGAATCG GTAATCCAGA GGGCATCAAA GGATTTATAA AAGTCACTTC ACCATTTGAA 7020 

40 AATGATACTT TAGAGCCATA CATAAATGTA GTTTGTGAAA TATAATTCCA AGTAACTTTA 70 8 0 

AATGTTTTGT TTTTCAGCAT GTTGAACTCT CCCAAACTTG TCTTCCAAAA TAATGTTGTA 714 0 

AAAATTAACA AACCAACTTG CAATGGTAGG TGAATCATCA TTATGTCGCC CAGGAATACT 72 0 0 

45 

GCGATTCATC ACTCTTGCTT GGTGTGCTGT CAATACAGGT AATAGCTCTT GAAATG CATG 72 6 0 

TGGATCATAA T CAT CATG TT GCATATATGC TATGGCAAAA ACAGTTTGTG ACAATGATTy 73 20 

CTTTTGAAAT GTTTGCCAAA ATTTTTGATT TAATGCCTGT ATCGACGCTT GAGATGTATC 73 80 

50 

ACCTTCATTA GACACCAGGA CGTCTAATGC TGTACCGAAC TCTTCTGGTC TAAGTAATCG 744 0 
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GACTTGTTCT AACCAACATG AATCAATTGC TTTCAAAAAG ACTTTTTGAA CGAAAATATT 4020 

ATAATAATAT GCACTTTGCA TGTTTTTACG ATTCAAAGCT AATTGCTTTT CAAATTGCTC 4 080 

TAATAAAAAT GTCACTACTG CTTGCTTATC TTTAAAATTA ACACAAGCCA CATCTTTATT 4140 

AAATTGGAAA CTTAAATTTT GATAAATATA CTCGACAACA CGCGATTTTG TTAGCACCTT 4200 

TTCCTCATTT ACAAACATTT CAAATACATC TTTAGCTAAC GCTTTAAAAT CTTGATTCTC 4260 

AGCATCATCT ATTTCTAAAA CTCGATTGCG TTCCTCGTAT ACAAGATCTC GCTGTATACT 4320 

AATGCTTTTT TCAAATTCAT TAGCCATTTC ACGAGCTTTA ACCCCTTGTT CTTCCGAGAt 4 380 

aCGcTGCGCT TTAACTACAA TTTGCTTAAC TTTGCGATTA AACAAATTAC TTTGCGATAA 444 0 

TCGTTGTGCA TCTAATGAAT ATAATTGATT ATTTTCCGCT AAATTACTAT CGCTCCATCG 4 500 

CTTAACTAAA TAATCATCTA GTGAAATATA TATACAAGAT GATCCCGGAT CCCCTTGTCT 4560 

ACCAGAACGA CCACGTAATT GCCTGTCTAC ACGGCTATTT TCCATATGTT CATGAATAAT 4620 

AACAG CTAAT CCACCTAATG CTTCGACACC TTCACCAAGT TTAATATCTG TGCCTCGACC 4680 

TGCCATACTA GTCGCAACAG TCATGGAACC AATTTGCCCT GCTTCAGCTA TCATCTGCGC 4 74 0 

TTCTTTTGCA ACATTTTGCG CAATGAGTAA ATTATTAGGA ATATC CATTT GGAATAATAC 4 800 

TTTCGAAAAG TATTCAGCCG CTTCAGCAGT TCTCGTTATG AGTAAAACCG GTCGCCCCGT 4 860 

30 TTCATGAAGT TCAACTATAT CATGAATCAT CGCGATGTTT TTCTCATCAA CTGAACGAAA 4 92 0 

CACTTTATCT GGTTCATCGA TACGTTGAAT CGCTTTATCA GTTGGTACTT GTACGACTAT 4 980 

TTTTGAATAC AAATCAAAGA ACTCTGATTC GCCTAATTTT CCTGTAGCTG TCATACCTGA 504 0 

AAATGATTCA AAAAGTTTAA ATAAATTCTG GAAGGTAATT GTTGCCATAA CACTTTTATC 510 0 

TGTTGAAACC TCCATACCTT CTTTCGCTTC AATAGCTTGG TGAAGTCCAG CTTGCAACTT 5160 

AGTTCCCGGT AACATACGAC CTGTAATACG GTCAATTAAA ACAATATCAC CATTATATAC 5220 

AAAGTAATCG ACATTAGATT CAAACAAATA TTGTGCGCGC AGTGCTAAAT TAATATTACG 52 80 

CACTAGGACC ATCGCTTGTT CGCTATATAA ATCTTCAACA TTAAAGTATG ATTGTGCCGC 534 0 

TTCAATACCT TGATTTAACA GCCATATTTC TTTTTTGGTC TTCTT CATTT TAAAATGCAC 54 00 

GTCTTCAATC AATGTATCTA CAAACTCTTT CACAATATGA AATAGATTTG ATTGTAATCT 5460 

TGGTGCACCC GAAATAACTA ATGGTGTTTG AGCAGCATCT AAAATGATTG AATCCACTTC 552 0 

ATCAATAATA CCGTAATTTA ATTGTGGTAA AAATTTCCCT TCCGCACTAT CAGCCAAATT 558 0 

ATCAATTAAA TAATCAAAAC CGAGACGTCC ATTAGTTGTA TATATAATAT CATGTTCATA 564 0 

TATATTACGT TTTTCCCCTT TTTGATACTC ATAATCCACA ATATCAACAA AACCTAATGA 5700 
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TAAATCTTCT TCCAACTTAC TGGCTTTAAA AGACTCATAT AACTTTCGTG AATGATCGTT 22 20 

AAAGTAATCA AATAATTTAA TCATGTAGCA CCTCTTGaAC TAATGTTTCC CATTTTAAAA 22 8 0 

5 

TAATATCTTG AGTCATAAAT TGCTGTGCCA CTTCATAAGA GATGTCATGT GGTGTCTGGG 23 4 0 

GACCATTGTT AAAATACATT ACAATGGcAT GAGCTAGTTT TGCGATAACA TCATCCACAC 24 00 

w TATCTTCGTC GGTATCAAAA GGTACCAAGT AGCCATTTTC CCCATCTCGA ATAAAGGTTG 24 60 

GG TTACCATA ATTCACATTT AATCCAATCA TACCTAGTCC TGAGCCTACC GCTTCCATTA 252 0 

GTGTTAACCC AAAACCTTCG CTAGTTGATG CAGAAAGAAA TAACTCATAA TCATTATAAA 25 8 0 

15 TTTCATCAAG TTTAACATGC CCTAGTAAAC GAATATAATC TTGTGCGCGG TGTGTATCAA 2 64 0 

TAATTTTACG CAGTCGCGTC TTCTCGCTAC CTTCTCCATA AATATCAAAT GTTAATTCTG 2700 

GCACTTGTCG TTTAGCCACG ATAACCGCTT TGACAAGCCA ATCAATATGT TTCTCATTCG 2760 

20 

CTAAACGAGA TGCACTAATC ATCGCATATG GC-lTTCTTGA TAATGTTGGA TATGATAATG 2 82 0 

CATCAATGCT TCCCACAGGA ATAGTATAAA CACGTGGGCG ATAACCTTGA TATTGCTCAA 2 880 

ATTGTCGACA AACCATATGA TTTTGAATAT CTGTTGCTGT AATAAAGAAA TCAATGTATT 2 94 0 

25 

TAGCTTTTGA AAATTGATAT T CAT AAT AAT TGTTCCATAG TATATGCTGC TCACTCATCA 3 000 

TATTATTACT ATAATGATCA GCATGAATCA CAACACCAAC TTTACTATCA CCTTTATGCT 3 06 0 

30 GCAAAACAGC CTGACCAATA TCAGAAGCGC GGTCTAATAT GACAATATCG TCTCGGGTTA 312 0 

AATTCAATCG TTGTAAAAAG TATGCAATAA ATTCCGTTTT GTTATACAAC ACCGCATCTT 318 0 

CAAACACATA TATAGAGCTG TCTCCATCAA TATATTCGTT ATAAGCGATG GAACCATCTT 3 24 0 

35 GATTATAAAA TTGTCGCATA TATAATTTCG CTTTATTATC AGCTGGTGCA TAATACTCAG 3 3 00 

AAAATATGCG CGTATAACTA TAAAAATCTT TACGTACTAA CATACTATTA ATTACAAATT 3 36 0 

CTGCACGATC CACAATATCT TTTTGTTCAT TTTGCAGATA ACATGTTACA AATGATGATT 34 20 

40 

TCCCATTAAA ATATAGGCGG ACTATCTTAC CATTTCTTTC TCTAAAACTA ATGTCATGAC 34 8 0 

CAAGCTCACG TTCAATGTCA TCTAACGTGT ACGTTGTTGG TGCTAAAGAA ATATCACTAA 3 54 0 

45 AATACTGATA CAACCAAATA ACTTCTTGAT CTTTAAACCC AATGTTTTGC GTTAATGTCT 3 60 0 

GTATGTTCTC TGACTGTATA AAATCTAAAA ACACAAATTT AGTGTCTTGA TTTGTACGTC 3 66 0 

TCAATAATTT AGCACGGTAA GCTTGTGCAT ATTCAACACC GCTACTCGCC CAGCCTATAC 3 72 0 

50 CAAAGTTTAT ATTATATATT GTCATGCGCT ACCCCTTTTC ATTTATGGAA AATGTATAAC 3 78 0 

TGGCATACCC TCTTTATCAA ATGTAATCAT ^TTTHA^AA ATATTTT~~A „,_^ T ,^^_^ , ^ - - 
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TTCATCTATT TGCGATGCTG TTTTCGCTTC ATTTAGTTGT GCTTTATAAT GTTCTTTAGA 4 20 

TGAAGCCGAT AACTGTTTTA ATTGCTCAAT TTGACGAATT GCCTTGTCAA CTTTGTCTAA 4 80 

TAAATCTTGC TTAGATAATA TCTCTTTTGT AATTTCAGTA TCCTTTTCAG ATGCAGCTTG 54 0 

GGCATCGTAC GGCAAGATAT TCGTTAAAAT GATACTTGTC GCCATCATTG TCGAACACGA 600 

TAACTTTACA TATAATTGAA ACGGTTTCCC TCGATATTTA GCCATCAACA TACTCCTTCC 6 60 

TCACTTACTT CCTTCAAAGA ATTACATACT ATTATATACC TGTTTACAAG AAATTTACAC 720 

TTATCTATCT AGTTATTGTT GTTAGTAATT ATCTACTTAT TACTTAGCTT ATATTTAAGT 78 0 

AAACAAAACA AGCATGACGT AATATCATAT TGTCCATGTC GCTAACATCA TATTACGTCA 84 0 

AATCTTTTAT ATTAAATGAT GTTTTATTTT AGACTGCTTT TTCCTTTTAG CTTTCGAGCG 900 

CCTGTTTAAA AACTTGCTCG AATTGTTCAC GCGAGATTTC GTGTGCATGT GCTTTTTGTG 960 

CTAATAAAGC ATCTCGAAAC TGTTGTTGAT CTTTCAAACT TTCTAACATT TGTATTAATT 1020 

GGTCTTTACT TTCCATTGTT ATCTCATCAT TATGCTCAAA TAAGTGCTCT GATAATGTTA 10 80 

CTTTAGCATG GTGTGCGGTT TGACGATAAC CTAAAATCAA CAACTCATAG TCAAACGCTT 114 0 

GTTCCACCGC ATTTAAAATT TCATTACCCT CATTGATATC AAGATAAATA TCACATAACT 12 00 

GGTATAGTTC ATTTACCCTG TCAATATTAA TAGATGGGTA TAAATGCACA TTAGCATATT 1260 

30 GATCAAGTTG CATTAGCTTA TCAGACATCT CTGTAATAGC AGCGATGTGn AACTTAAAAT 1320 

CTGGTAAAGt TyCAACCAAT ACCTTGATGT TACGAatTGa TCCgAGTTAG TTAATATTAC 13 80 

AATTTCTTTA GTATATCTAT TACGACTACG ATAGTTATAT AGATATCCGC CTTGTAAAAT 144 0 

ACGAGATTGA ACCTTTGCGT CTGCTATATT GAGCATCGTT TCATATTCGT TTTTATCTGG 1500 

AATAATAATA TTACAATGTC GTTTCATATC ACCTTTACAC ATCAATTGCA TATTTCCCGG 156 0 

GACATTACCA TTACAGTGTT CTTGCCATAC CAAAACATCA CTACCTTTTG ATGGCAAATT 1620 

ATATAACACT GAAAATGGTA GGGCTAGTGA GTTAATAACG AAATGATGTT CCGTAATTTC 1680 

AAGTTGCTTG ATAAAAAATA ATGCGAATGC GAGCTTTGAA GGGAAAAAGT AAGACTTCCC 174 0 

TTGCCAATCC AATATGACAT CAGATGTTAC AAAATTTTCA TAAATCACTT CTTTACCTTC 1800 

TGCTGTCATA TATTTCTTCA AGATCGCTTT ACGATTTAAA TCGTAAACAG TTTGTGCAAA 1860 

TTTAATACCA TTCTTAGAAT AATAATCGAC AAATCGGACA CGTTGTTGGT CATCAAACCA 1920 

50 TTCGACACGA CTAACAATTC TAGGGCGCTC TCCACTTTGA yAAAATATTT TACCTCGyAG 1980 

ACGTCCCATA TCATTaATTG TAGCCGAATT GTTGTTACCT TTAATTTCCC AAAAAGCTGG 204 0 

TACAGTAACC TGATTAAAAA ATCGTGGTTT CATATTTTCT GTATTATGAT TATCTGCAAA 2100 
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AGTGATTTCG CCAGATTCAA AATCAGGGTT ATCATCTTGA ACTACAGCTT GGTGAATATT 5 34 0 

TGGATCAAAT GCTTCACCTT CAGTTTTAAT AACTTCAAGA CCATTATCTT TTAGTGCGTT 54 0 0 

5 

AATCAAACTT TCATGCACCA TTTGTACACC TTTTTGAAGA GATTTAAAAG TCTCATCATC 54 6 0 

ACCTTCAATT TGAAGTGCAC GTTCTATATT GTCTATTGCT GGTAAAATAT CTGTTAACAC 5 520 

10 ACGTTGTGCT TGATATGTTT TGTTTATTTC ATTTTCTTTT TGAATTCTAC GCTTATAATT 5580 

TTCAAACTCA GCGTAGAGCC TTAAATATTT CTCTTCGTTT TCATCTGCTA ATTGTTGAAG 564 0 

TTCATTAATT TTTTGATCTT TTGGATCTAT TTCTTCAATA ACATTCTCGT CAGACGTTTC 5700 

15 TTCTATTGCT TCATCTTGTA AATGACCTTT ACTTTCTTCA GCTTGTTCAA CTGAATCATC 5760 

AATATTTTGT TTGACGTTTG TTTCTTCAAC TGTTGATTCA GTGTTTTTTT CAACTGATTC 582 0 

GTCTTTATTT GTCATTTTCT GTCCTCCAAT ACTTTCTAAT CCATCATTAC CAAATTCTAT 588 0 

20 

TTAATAATTG AATGACATTT TGATAATGCA TAGCTGTAGG TCCAATCACA GCGATTTGAC 594 0 

CTTTTAACGT TTCATCAAAA TGATATTGAC TTGTTACAAT TGAAATATCA CTTAAGCTGT 6000 

CATCAATTTC ATTACCAATT TTTACATTAA TATTTGGTGA AGATATATCT TGTAATAATT 6060 

25 

CTGCAATTCT ATTTGATTCT ATATATTGTA GAATGGGCTG AATTGAAGAT ACATTACTTT 6120 

CATTCAATGC ATCAATAAGT TTAACCTTTC CACCCATATA AATGCTATTA CTTTGATTAG 6180 

30 AAATATGATT ATTCATCGTA TTTAAGAATT TATTGATAAA AATTTCTTCC TGCTCTGATT 624 0 

GAaCAAAAGA GACAATATCA TCTTGTAAAT TCTGATTAAA CTCAGTTAGT TTGTTTGTAA 63 0 0 

CAAAATTTGA TATTGTATTT AGTTTGTCAT TATTAA 63 36 

35 (2) INFORMATION FOR SEQ ID NO: 220: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13059 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 220: 

4b 

TTCATGATTA TTATCTGTTG TAGACACTGC TGGATCTTCC GATGTATCTT TCGATGCATC 6 0 

TTTCGATTTG TGTATTTGCT GATTCAAATG GTCTAGGTCT TCTAACGCCT TATTTACCAT 12 C 

50 TGCTTCATCA TTTTTATCAT CTTTTTCTCC ATGTTTTGTT GTAGCCGTTT GTGACATATC 18 0 

ATTTTTCATT ATT AAOA^ "^^rpTr^^^ A rwT T! , -T' T, -. n rmryfrT^frb ^ n ^^^.^ ^ Tx » - « 
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TATCTTCTAT 


ATCTTGACCT 


TCTAAAGCAG 


TTTTAAGAGC 


GTCTTTTTTC 


TCTTCAGCAG 


3540 


ATTTTTTATC 


TTCTTCACCG 


ATATTTTCGC 


CTAAATCAGT 


TAAAGTTTTT 


TCAACTTGGA 


3600 


ATACTAGACT 


GTCAGCTTCG 


TTTCTTAAGT 


CTACTTCTTC 


ACGACGTTTT 


TTATCTGCTT 


3660 


CAGCGTTAAC 


TTCAGCATCT 


TTTACCATAC 


GGTCGATTTC 


TTCGTCTGAT 


AATGAAGAAC 


3720 


TTGATTGAAT 


TGTAATTCTT 


TGTTCTTTAT 


TTGTACCTAA 


GTCTTTTGCA 


GTTACATTTA 


3780 


CAATACCGTT 


TTTATCGATA 


TCAAACGTTA 


CTTCAATTTG 


AGGTTTACCA 


CGTTCAGCTG 


3840 


GTGGAATATC 


AGTCAATTGG 


AATCTACCAA 


GTGTTTTATT 


ATCCGCAGCC 


ATTGGACGTT 


3900 


CACCTTGTAA 


TACGTGTACA 


TCTACTGATG 


GTTGATTATC 


TACTGCTGTT 


GAATAGATTT 


3960 


GAGATTTAGA 


TGTAGGAATC 


GTAGTGTTAC 


GTTCAATTAA 


CGTATTCATA 


CGTCCACCTA 


4020 


AAATTTCAAT 


ACCTAAAGAT 


AGTGGTGTTA 


CGTCTAATAA 


TACTACGTCT 


TTAACGTCAC 


4080 


CTGTGATAAC 


GCCACCTTGG 


ATTGCAGCTC 


CCATTGCCAC 


TACTTCGTCC 


GGGTTTACTC 


4140 


CTTTGTTAGG 


CTCTTTACCG 


ATTTCTTTTT 


TGACAGCTTC 


TTGTACTGCT 


GGAATACGAG 


4200 


TTGATCCACC 


AACTAAGATA 


ACTTCATCGA 


TATCTGAGTT 


TGTTAAGCCA 


GCGTCTTTCA 


4260 


TTGCTTGGCG 


TGTAGGTTCC 


ATTGTTCTTC 


TAATTAATGA 


ATCTGATAAT 


TCTTCAAATT 


4320 


TAGAACGAGT 


TAAGTTTACT 


TCTAAGTGTA 


ATGGACCGTT 


TTCACCAGCT 


GAGATAAATG 


4380 


GTAATGAGAT 


TTGAGTTTGT 


GATACACCTG 


ATAAGTCTTT 


TTTAGCTTTT 


TCAGCAGCAT 


4440 


CTTTCAAACG 


TTGTAATGCC 


ATTTTATCTT 


GAGATAAGTC 


TACGCCATTT 


TCTriTITGA 


4500 


ATTCTGCAAC 


TAGGTAGTCA 


ATAATTACTT 


GGTCAAAATC 


ATCACCGCCA 


AGTTTGTTGT 


4560 


CACCGGCTGT 


TGATAGTACT 


TCGAATACAC 


CGTCACCTAA 


TTCTAGGATA 


GATACGTCAA 


4620 


ATGTACCGCC 


ACCTAAGTCA 


AAAACAAGAA 


CTTTTTCATC 


TTTATCAGTT 


TTGTCTAAAC 


4680 


CATATGCTAA 


TGCTGCAGCT 


GTTGGTTCAT 


TAATGATACG 


CTCAACTTCT 


AAACCAGCAA 


4740 


TTTTACCAGC 


ATCTTTAGTT 


GCTTGACGTT 


CAGCATCGTT 


AAAGTATGCA 


GGTACTGTAA 


4800 


TTACAGCTTT 


GTCAACTTTC TCACCTAAaA TAgTTTCAGC 


tgtatttttt 


AAGTTTTGTA 


4860 


AAATCATAGC 


TGAGATTTCT 


TGTGGTGTGT 


ATGATTTACC 


TTCAATATCT 


ACTTTATAAT 


4920 


CAGTACCCAT 


ATGACGTTTA 


ATAGATTGAA 


CAGTGTTTGG 


GTTTGTAATA 


GCTTGACGTT 


4980 


TTGCTACTTC 


aCCAACTTGA 


GTTTCTCCAT 


TTTTGAAAGC 


TACAACAGAT 


GGTGTTGTAC 


5040 


GTGAACcTTC 


AGGGTTTTGA 


ATTACTTTTG 


GCTCATCGCC 


TTCTAATAcT 


GTnACACATG 


5100 


AATTTGTTGT 


ACCTAAGTCT 


ATACCAATAA 


TTTTACTCAT 


AATAAAATTC 


CTCCATTTAA 


5160 


TCATTAAATT 


AATTTAATTT 


TAAACAATGT 


CTTTTCGCCA 


AATTTAAGTT 


ATTGGTTTAC 


5220 
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GGAAATAGTT TTTCCATTCA TTTTCCCAAT CCGTCTCTGC AATAATTTGC TCACTGAATT 174 0 

GAACGTTATG TTGATCAAGT TCATCTAAAT TTAATAACTC ATCTTTAATT TGCTGTCGCA 180 0 

5 

ACTTATCATC ATAAGTCATT TCATTAAAAT AGGCTTTCAA TCTTACTCCC TTATCTGGAT I86 0 

AATCCTCTTT TTTCAAAGCG TAAATTTCAC CGTATTTATC TTCTGGTTGG TTAATTAAAT 192 0 

CATCTGAATC TTCTATCACG ACACCATTTG ATCCATGATT TTCAAGTATA TTGGTAGCCA 198 0 

w 

ATTCTACTGC TTCATGATTA ATAATAATTG AAAGCTCTGT CCAGTTCATA CTTTATTCTC 204 0 

CCTTAAAGAA TCTTTTTGCT CTATCTTTAA AATTCGAAGG TTGTTCATTA ATTTCTTCAC 210 0 

15 CATTTAATTG GGCAAATTCT TTCATTAGTT CTTTTTGTCT ATCTGTTAAT TTAGTAGGCG 216 0 

TTACTACTTT AATATCAACA TATAAATCTC CGTATCCATA GCCATGAACA TTTTTTATAC 222 0 

CCTTTTCTTT TAAGCGGAAT TGCTTACCTG TTTGTGTACC AGCAGGGATT GTTAACATAA 228 0 

on 

CTTCATTATT TAATGTTGGT ATTTTTATTT CATCGCCTAA AG CTG CTTGT GGGAAGCTAA 234 0 

CATTTAATTT GTAATAAATA TCATCACCAT CACGTTTAAA TGTTTCAGAT GGTTTAACTC 24 00 

TAAATACTAC GTATAAATCA CCAGCAGGTC CTCCATTCAC GCCTGGAGAG CCTTCACCAG 246 0 

25 

CTAATCTAAT TTGTTGTTCA TTGTCGACAC CTTCAGGTAC TTTCACTTCT AATTTAACTG 2 52 0 

TTTTATTTTC AGTACCTTTT CCGTGACATG TTGGACAAGC TTCTTCAAAT TCTTGACCAC 258 0 

3Q TTCCATTACA TTTAGGACAA ACTTGTTCAG TACGAACTCT ACCTAAAATT GTGTTTTGTT 264 0 

CTACAGCTAC ATGACCAGCG CCATTACAGT AACTACAAGT CTTTTTACTT GTTCCAGGCT 2700 

TTGCACCATC ACCATGACAT GTTTCGCATG TTACATCTTT ACGGATTGAA ATTTCTTTTG 2 76 0 

35 TTGTACCAAA TACCGCTTCT TCAAATGTTA ATGTCATTGT ATACTGAAGA TCATCACCTT 282 0 

TTTGCGGTGC ATTTGGATCT CTTTGTCTGC CGCCACCGAA GAAAGAGCTA AAGATATCTT 2 88 0 

CAAA5CCGCC GCCACCGAAG CCACTAAAAC CGCCAAAGTC AGAGCCATTG AATCCTTGTC 2 94 0 

40 

CACCAAAACC TTGTGGACCA TCATGTCCAA ATTGATCATA GcTTGCGCGT TTATTATCAT 3 000 

CACTTAAAAC TTCATAGGCT TCAGAAATTT CTTTAAACTT TTCATCTGCA CCTTCTTCTT 3 06D 

TGTTAATATC TGGATGATAT TTTTTCGAAA GCTTTCGATA CGCTTTTTTG ATTTCATCTT 312 0 

45 

TTGAAGCATC CTTACTAATG CCTAAAACTT CATAATAATC TCTTTTGGCC ACAGCTATCT 318 0 

CTCCTTTTCT TAATTAACTC ATATAGTTTA ACGTAATATG TCATACTATC CAAATAAAAA 3 24 0 

50 GCCAAAGCCA ATGTTCTATT GACTTTGACT TTTCAGATCA TGACAACATT CTAATTGTAT 3 3 00 

TGTTTAATTA TTTTTTGTCG TCGTCTTTTA CTTCTTTAAA TTCAGCATCT TCTACAGTAC ?3<^ 
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10 



15 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 219: 

GTCAATGTAA CCTAATAGTT TATGTCTATC TTGTGTACCA ACTACTACAT CGACACCAGG 60 

AATTTCCATA ATTTCAGCTG ATGAAGTTTG CGCATAACAA CCTGTTACAC AGATTACAGC 120 

ATCAGGATTT TGTCTTATTG CACGTCTAAT TATTTGACGA CTTTTTTTAT CACCCGTATT 180 

CGTTACTGTA CAAGTATTAA TAACAAATAC ATCAGCATTC GCTTCAAAGT CAACGCGCTC 24 0 

AT AGTTTG CT TCTTTAAATA ATTGCCAGAT TGCTTCAGTT TCATAATGGT TTACTTTACA 3 00 

ACCTAATGTG TGaACGCAAC TGTTGACATA AATATTCACC CCATTAATTC TTTTTCATAA 360 

CTTATTGCAC TTAACGCATA CAATGGCGCA GTTTCTGCCC GTAAAATTCT CGGCCCAAGA 420 

CCAACAACTG TACTAGTATT ACTAAATAAT GAAATTTCAT TTTCTGACAA ACCACCCTCA 4 80 

GGaCCAAAAA TCATCAACAC TTTATCCTGA GCATTGAATT GTTGTAAAGT TTGCTTGAAA 540 

TTGCTTAACT CACCATCTTT TGCTTCCTCT TCATATGCAA TAAGAATATA GTCATAATTA 600 

TCAATAGTAT CACAAATTAA TTTTAAATTC GACTCGAATT GAATAGATGG AATCACTAAA 660 

CGATAGCTTT GTTCAGCAGC TTCTTTAATT ATTTTTTGCC AACGCTCTAT CTTTTTGGCA 72 0 

ACTTTTGCCT CGTTTAATTT AACAATTGAA CGTTCCATGC TCACAGCTAT AAATGATGAA 7 80 

GCACCCAATT CAGTAGCTTT TTGTAGCAAC CACTCATATT TGTCAGCTTT GATTAGTCCA 84 0 

CTGCAAATCG TAACATCAAC TGGCAATTCT GTATTAATAT TTTGTTTTTC TTTTAAATCA 900 

ACTTCAATTT TATCACTTGT TATGTCAGCA ATTTCACATA AATAAACTGT TTGATCATTA 960 

35 AAAGTTAAAA TAATTTTACT ACCAACATCA TATCTCATTA CATTTGTTAT ATGATGAATA 1020 

TCTTCTTTTT TTGTAATAAA AAAACGCTGA CTTACATCAG CGTTTTGGnT CTATGAAATA 1080 

ACGTTGCACA TTATTCACTC ACTTTCTGGC CAACAAGACA AACCCAACCG TTGTCATGTT 1140 

GTTCTGAAAT AATTTTAAAA CCTACACGCT CCATATGTGA CTGTATACCT TCATACTTCT 12 00 

CTTTTATAAT ACCAGAAGTA ATAAAATAAC CGCCTTCATT TAGAGTATTA TAAGCATCTT 1260 

CAATCATTTC ATCAATAATA TGCGCTAAAA TATTTGCTAT TACAATATCA AATTTTTCTG 1320 

TTTCGTCTTT CAATAAGTTA CCTGGAACAG CTTCAATTAA CGTTTCACAA TGATTTCTTC 13 80 

TGAAGTTTTC TTTAGCTACA CTCACTGCCA TTTCATCAAT ATCCAACGCT TTAATACGTT 144 0 

TTACACCGAT TAGATGACTT GCAATACTTA ATATACCTGA GCCAGTACCA ACATCAATTA 1500 

CTGAATGCTG TGGCAATACA TATGTTTCTA TTGCCTTCAA ACACATACTT GTAGTCGGAT 1560 

GATCACCTGT TCCAAAAGCC ATACCTGGGT CGAGCTCAAT GCAAAGCTCT TCATCCGCTT 1620 
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AGTTTTAACA 


TTTGGTTGGG 


TTGGGCATAT 


GTTCCAGCCT 


TTTTTAATAC 


TTAAAAACTA 


780 


5 


ACGAAgTATA CTTGTGTGCA CAAATGGTTT 


TTATACAACA 


TTTTATAAAT 


TTATACATTT 


840 






UATACGATAG 


ATGGTTTAAA 


CCTTGTTAAC 


TGAGAAATTT 


TGATATGTAT 


900 




1L1 ILbAAAI 


ITAACTAAAT 


ATACGAAATT 


CAAGAAGCAC 


AATAATTAAT 


raTTTTTrrT 


Ci r r\ 


10 


A JL AL~AAAA(j I 


TCGTATGACT 


GCATTATAAA 


AGCATAAATT 


TATAATTTTT 


TTAAATGTCA 


1020 




1 I bAALb I\*A 


TAATGTGAAT 


GGATTGAGCA 


ATTTTGAAAA 


AGTGAAAAAT 


AACCTATGCG 


1080 




ACTTG CAATT 


AATTTTCAGT 


ACGTTATAAT 


GCACACTGTG 


CAAAATTAAG 


GAGGTCTATT 


1140 


15 


ATTCACATGA 


TGATGAaTAA 


AGAAGCAACA AAAATTGGAT 


TTGCCTACGT 


CGGCATTGTA 


1200 




GTGGGCGCAG 


gATTTTCAAC 


TGGACAAGAA 


GTTATGCAAT 


TTTTCACTAA 


ATATGGCTTG 


1260 


20 


TGGGCTTATT 


TAGGTGTTAT 


TATATCTGGT 


TTTATTTTAG 


CTTTTATTGG 


GCGCCAAGTA 


1320 


GCAAAAATTG 


GTACTG C CTT 


TGAAGCGACA 


AATCATGAAT 


CAACATTACA 


ATACGTATTC 


1380 




GGTGAAAAGT 


TTAGTAAAGT 


CTTTGaTTAT 


ATTTTAATCT 


TCTTCTTATT 


TGGTATAGCT 


1440 


25 


GTAACCATGC 


tAGCTGGTGC 


AGGCGCAACA 


TTTGAAGAAA 


GTTATAACAT 


ACCTACATGG 


1500 




CTAGGTGCTT 


TaATTATGaC 


ATTAGCGATT 


TATATTACGT 


TGCkATTAGA 


CTTTAATAAA 


1560 




ATAGTACGTG 


CACTAGGTAT 


CGTTACACCA 


TTTTTAATTG 


TTTTAG TTGT 


ATTAATCGCT 


1620 


30 


GGCGTTTATT 


tATTTAAAGG 


TCATGtTTCA 


TTAGCAGAAG 


TTAACCAAGT 


AGTGCCtGAA 


1680 




GCAAGTATTT 


GG AAGGGAAT 


CTGGTTTGGT 


ACAATATATG 


GTGGATTAGC 


TTTTTCTGTA 


1740 




GG TTTTAG T A 


CCATCGTAGC 


AATCnGTGGG 


GATACTGAAA 


AGCGTACAGT 


GTCAGGTGCA 


1800 


35 


GGCGCGATGT 


ATGGTGGTAT 


TATCTATACT 


GTATTACTAG 


CATTGATCAA 


CTTTGcATTG 


1860 




LAAb 1 Ca 3AT A 


TCCAACTATT 


AAAAATGCCT 


CAATTCCTAC 


ATTGACGTTA 


GCAAATAATA 


1920 


40 


TCCATCCTTT 


AATAGCAACA 


GTGkTATCTG 


TTATTATGCT 


GGCGGkTATG 


TATAATACTA 


1980 


TTCTAGGACT 


AATGTATTCA 


TTTGCAGCAC 


GTTTTACAGA 


ACCATACAGT 


AAAAATTATC 


2040 




ATATCTTTAT 


TATTATAATG 


ATGGTAGCAG 


GTTATTTATT 


AAGTTnCGTA 


GGATTTGCTG 


2100 


45 


AA7TAATTAA 


TAAGTTATAT 


ACnATTTATG 


GGATATGTAG 


GCTTATTnTA 


TTGTAGTAGC 


2160 




TGTAATTATn 


AAATATTTCC 


AAACGTAAAA 


ATGGCGGATA 


AAAAACATAT 


TGCTTTAATA 


2220 




TCATATGGAG 


GGG AT ATCCG 


AAACTTTACA 


ATTTGAATCA 


CTTTGGT 




2267 


50 


(2) INFORMATION FOR SEQ ID NO: 219: 









<i> ^FCtTENCF CHARACTERISTICS ■ 
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ATAAAGATAC TGAACAGTTA TATGGTGAAT TGATCACTGC TAATATATAT CGAATTAAGC 450 0 

AAGGCGATAA AGAAGTGACG GCATTGAATT ATTATACGAA TGAAGAAGTT GTCATTCCTT 4 560 

5 

TAAATCCTAC AAAATCCCCA TCAGCAAATG CTCAATATTA TTATAAACAA TATAAyCGTA 4 620 

TGAAAACGAG AGAmCGTGAA TTACAACATC AAATTCAATT GACGAAAGAC AATATAGATT 4 680 

10 ATTTTTCAAC AATCGAACAA CAATTACATC ATATTTCTGT CCATGACATT GATGAAATTA 4 74 0 

GAGATGAATT AGCAGAACAA GGCTTTATGA AACAGCGTAA AAATCAAACT AAGAAAAAGA 4 800 

AAGCGCAGAT TCAATTACAA CATTATGTAT CAACTGATGG CGACGATATA TATGTTGGTA 4860 

15 AGAATAACAA GCAAAATGAT TATTTAACAA ATAAAAAAGC TAAAAAAACT CACACATGGT 4 92 0 

tACACACAAA AGATATTCCT GGTTCACATG TCGTTATATT TAATGATGCA CCAAGTGATA 4 980 

CGACAATCAA GGAAGCGGCT ATGTTAGCAG GATACTTTTC AAAAGCTGGT AATTCTGGAC 504 0 

20 

AAATACCTGT TGATTATACA TTAATTAAAA ATGTGCATAA ACCATCaGGT GCAAAGCCTG 5100 
GGTTTGTAAC ATATGACAAT CAAAAAACTT TGTATGC 513 7 

(2) INFORMATION FOR SEQ ID NO: 218: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2267 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
3 o (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 218: 

35 GTTTTATCGC AGCAGTAAAG CTATCAATCG GCGGTTCAAT TGATGATGCA TTAGCAGAAA 60 

TCAnACAATC ATTTTAGTTA AAATTTACTA ATAATGAaAA ATGTAAACCT TTTTCAAATG 120 

AAACTTTATa AaAAATATGA TAGTATATAT GTAAATGTTT AATAAAATCT GGAGAAATAG 180 

40 

GAGGACATTG CCATGCAACA CCTTATAAAA AAACATGTAT TGAATGGCGA GTTTGATTTA 24 0 

GTACGACAAT TGATGTCCGA AACAGATTTT ATGGAATTTG AAGAAGCATA TATTTCAAGT 3 00 

GCGCATGAAG TAGAAAGTAT GATGTTTTAT ACATGTATTT TAGATATGAT TAAGTACGAA 360 

45 

GAATCATCTG AAATGCATGA CTTAGCATTT TTATTGCTTG TGTATCCACT AAGTGAATAT 420 

GAAGGTGCTT TGGATTCTGC TT ATT AT CAT GCAGACGCTT CCATAAAACT TACTGACGGC 480 

50 AAAGAAGTTA AAAGTTTGTT ACAAATGTTA TTATTGCATG CGATACCAAC ACCTGTTATT 540 

TCAGATAAGA AGGCTTTTGA TATCGCCAAG CAAATTTTAA AATTAGATCC TAATAATAAT 6 00 

GTTGCTCGTA ACGTCTTAAA AGACACTGCC AAACGTATGc gACAaCGTTG TTGTTGATAT 660 

55 
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5 



10 



15 



20 



25 



30 



35 



40 



45 



TTTArGTGcT 


CAGCTTCTAC 


AATACATTGA 


ATTCtATTCy 


TCGcAAGtTC 


TACTTCAtCA 


2700 


TTAACTACAA 


cGTAAyCGTA TAAATTCATC ATTTCrACTT CTkTACGCGC yTCGTTAATA 


2760 


CGACTTTGTA 


i I 1 1L. 1CJAIL 


AGATT CTGTT 


CCTCTACCTA 


CTAATCGCTC 


TCTCAAGTGT 


2820 


TCTAAACTTG 


oALrtj 1 LjL 1 AA 


r~ , 7\7 k .T\7 k ,'T , T\7\TV r T' 

CjAAAAI AAA1 


AGCGCATCTG 


GAAATTTCTT 


TCTAACTTGC 


2380 


TTTGCACCTT 


/ T A /Mint » » rr^ 

U lAv_ I 1 LAAI 


TTCTAAAAAT 


ACATCATGAC 


CTtCGTCCAT 


TGT AT C TTT A 


2 94 0 


ACATATTGAA 


(_ lVj<jrl (j IACL 


A I AA TAC3 11 (J 


CCTACATATT 


CAGCATATTC 


TATAAATTGG 


3000 


TCATCTTTGA 


I I AAACjL I 1L 


a a a nor 1 a r P^/^> 
AAALvjCATCC 


CTAGTTTTAA 


AAAAGTAATC 


TACGCCATCA 


3060 


ACTTCACCTT 


CACG CATTTG 


ACGTGTTGTC 


ATTGAAATAG 


AATACTTATA 


TGATGTACTT 


3120 


GGATCTTCAA 


ATATnCGTnT 


m< »( ■ i it ik /"i ik m ik 

TCTAACAGTA 


CCTTTACCTA 


CTCCAGATGG 


TCCTGATAAA 


3180 


ACGATTAACA 


ATCCTTTTTC 


ATTATCCATG 


CCTTACGACC 


TCTCTAAGCT 


AATCTTCTAT 


3240 


TATTTAAATA 


ft ft ^P^^ ft ^* ft 


TTGTTCTTTA 


TATTGTATAG 


CATATTTGAA 


A'lTGCATGCC 


3300 


ATAATTTCTA 


TTAAGTCTAA 


CAATATCGTT 


ATATTGCACG 


ATTAATTTTA 


ATTAAATAAA 


3360 


TTGAATTGCA 


AACTTTTAGA 


TAATGTAAAA 


TGTATGGCAT 


AATGTATGGT 


TCAATAACTA 


3420 


TACTGAAAAG 


TTACAATCAT 


GTTAAAATGA 


AACGAATGAT 


ATGAAGAAGG 


TGGAAGATAA 


3480 


ATTATGGCTT 


ATG ATGG CTT 


ATTTACAAAG 


AAAATGGTTG 


AGTCTCTACA 


Aii"i"i"i'AACA 


3540 


ACAGGACGTG 


nwn^i JV ^ TV TV TV TV fT* 

TTCACAAAAT 


CAATCAACCT 


GATAATGACA 


CGATACTAAT 


GGTTGTACGT 


3600 


CAAAATAGAC 


AAAACCATCA 


ATTGTTATTG 


TCAATCCATC 


CAAACTTTTC 


AAGATTACAA 


3660 


TTGACTACTA 


A A A A A H** A T 1 /^ n 

AAAAATATGA 


T AAT C CATTT 


AATCCACCCA 


TGTTTG CG CG 


TGTTTTTAGA 


3720 


AAACACTTAG 


AALHj I LjO 1 A I 


1 A 1 COAA 1 CC* 


ATTAAGLAAA 


TTGGTAATGA 


TCGTCGCATT 


3780 


GAAATCGATA 


I AAALtAvjt 1 AA 


A l"" 1 A Tf* TV TA 7A TT 

AbAlbAAAI I 


CjCjC-tjArACTA 


TTTACCGCAC 


IXjTCATCCTT 


3840 


GAGATTATGG 


Lj I AAAL-A 1 Atj 


TAACTTAATT 


TTAGTAGATG 


AAAATCGCAA 


A A ^ A A MM) i/"*t 7v TV 

AATAATTGAA 


3900 


GGATTTAAAC 


AL 1 1AALALL 


aa a rp a. c*c* n 7\t 
AAA 1 ALbAA 1 


L-AL. I A I t-Cj I A 


LAb 1 AAT CjL L 


ACjCj Al TT AA 1 


3960 


TATGAAGCAC 


UALL IAL IUA 


ULALAAAA 1 A 


A J1TPPPT71 T"P 

AATCCGTATG 


ATATTACAGG 


TCjCAGAGGTG 


4020 


TTGAAATATA 


TCGATTTTAA 


CGCAGGTAAT 


ATTGCTAAAC 


AATTATTGAA 


TCAGTTTGAA 


4080 


GGATTTAGCC 


CTTTAATTAC 


GAATGAAATC 


GTTAGTCGTC 


GTCAATTTAT 


GACTTCATCA 


4140 


ACATTACCAG 


AAG CATTTG A 


CGAAGTAATG 


GCAGAAACCA 


AGTTACCACC 


TACTCCTATT 


4200 


TTTCATAAAA 


ATCATGAAAC 


AGGTAAAGAG 


GATTTCTATT 


TTATAAAGTT 


AAATCAATTT 


4260 


AATGATGATA 


CAGTTACATA 


CGATTCATTA 


AATGATTTGC 


TTGATCGTTT 


TT ATOA^^ ^ ^ 


* 1 *? ^ 
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ACAATTACAC GCACACCAGG TTGGATGACA GATTCGAGTT GTTCGGGAAT TATATAATCA 900 

AATTTATAGT CAACGCTCTT CGACGCGACA TCGACTATGA CTTTCGCTAT CATTATTGCC 96 0 

5 

ACCTAGTTTC TAGTTCATCT AAAATTTGTG CAGCTAATAC TACTTTTTTT CCTTTCTTGA 102 0 

TATTTACTTT TTCATTATTT TTAAAATGCA TTGTCAATTC ATTATCATCA GAACTAAATC 10 80 

CGATAGACAT ATCCCCAACA TTATTTGAAA TAATCACATC TGCATTTTTC TTGCGTAATT 114 0 

TTTGTTGTGC ATAATTTTCA ATATCTTCAG TCTCTGCTGC AAAGCCTATT AAATACTGTG 120 0 

ATGTTTTATG TTCACCTAAA TATTTAAGAA TGTCTTTAGT ACGTTTAAAA GATACTGACA 126 0 

15 AATCACCATC CTG C TTTTTC ATCTTATGTT CTAATACATC AACCGGTGTA TAGTCAGATA 132 0 

CGGCTGCTGC TTTTACAACA ATATCTTGTT CGTCAAATCG GCTTGTCACT TGTTCAAACA 138 0 

TTTCTTCAGC ACTTTGAACA TGAATAACTT CAATATCTTT TGGATCCTCT AGTGTTGTAG 144 0 

20 

GACCAGCAAC TAACGTCACG ATAGCTCCTC GATTTCGCAA TGCTTCAGCT ATTGCATAGC 1500 

CCATTTTTCC AGAAGAACGA TTGGATACAA ATCTGACTGG ATCGATAACT TCAATAGTTG 1560 

GTCCTGCTGT AACCAATGCG CGTTTATCTT GAAATGAACT ATTAGCTAAA CGATTACTAT 1620 

25 

TTTGAAAATG AGCATCAATT ACAGAAACGA TTTGAAGCGG TTCTTCCATA CGTCCTTTAG 1680 

CAACATAACC ACATGCTAGA AATCCGCTTC CTGGTTCGAT AAAATGATAC CCATCTTCTT 174 0 

30 TTAAAATATT AATATTTTGC TGCGTACGTT TATTTTCATA CATATGCACA TTCATAGCAG 18 00 

GCG CAATAAA TTTCGGTGTC TCTGTTGCTA GCAACGTTGA TGTCACCAAA TCATCAGCAA 18 60 

TACCTACACT CAATTTTGCA ATTGTATTTG CCGTTGCAGG TGCAACAATG ATTGCATCTG 1920 

35 CCCAATCACC TAATGCAATA TGCTGTATTT CTGAAGGATT TTCTTCTATA AAAGTATCTG 1980 

TATAAACAGC ATTTCGACTT ATTGCTTGAA ATGCTAATGG TGTCACAAAT TTTTGTGCGT 204 0 

GATTeGTTAA CATAACGCGA ACTTCATACC CAGATTGTGT TAACTTACTT GTCAAATCAA 2100 

40 

TTGCTTTATA TGCCGCAATG CCACCTGTAA CGGCTAATAA TATTTTCTTC ATATTCAATC 216 0 

TCCCTTAAAT ATCACTATGA CATTTACGCT TTACATCATC ATATGCGCAC AAATGCTCAT 222 0 

TACTTTTTTA TAGATACAAA TTTAGTATTA TTATAACATC AATCATTGGA TAAACTAAAA 228 0 

45 

AAACACACCT ACATAGGTGC GTTTGATTTG GATATGCCTT GACGTATTTG ATGTACGTCT 234 0 

AGCTTCACAT ATTTTTAATG GTCGAAACTA TTCTTTACCA TAATAATCAC TTGAAATAAC 240 0 

50 AGGGCGAATT TTACCGTCAG CAATTTCTTC TAACGCTCTA CCAACTGGTT TAAATGAATG 24 6 0 

ATATTCACTT AATAATTCAG TTTCAGGTTG TTCATCAATT TCACGCGCTC TTTTCGCTGC 2 52 0 

AGTTGTTGCA ATTAAATACT TTGATTTAAT TTGTGaCGTT aATTGGTTtA AAgGTGGATT 2 58 0 

55 
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ACCATTGTTA 


AACAAAAGTG 


ACAGTGAATT 


TTCAAAAGAA 


TTGTCAAATG 


TTAAGAAGCA 


1080 


5 


ATTAAAAGAT 


AAGTCTAAAG 


TTTCGGTAAC 


TACTACTCTA 


TTTAGTAAAA AAAAGAACTA 


1140 




TACTAAAAAA 


AGTAACAGTG 


AAAATGTAAT 


AAAAATGGCA 


GAAGAAATAA 


AAAAAGATAA 


1200 




AQ AG AT AC C A 


AACGGTATAG 


AGCTTAGTAT 


AAAATTTTCG 


GACAATAAAA 


TAAATACGGT 


1260 


10 


TAAACCAAAT 


TTTAACGGTG 


aAAGCACTTC 


AGAATATGGT 


GTGTTTGATC 


AAUAAI AAAA 


1320 




TTAATGATGa 


AAATTTAACG 


GAGAATAGTG 


TATATTGAGT 


AG AT CmAGAA 


1 AAAnnbn 1 r\ 


1380 




ATTCTACTAT 


TGTTGTGAAG 


GCAAATAAGT 


AGAAGATTTT 


AAGTGTAATT 


TCTCiCj I CjA 1 1 


1440 


15 


TAAATAATAA 


TATAnATGGn 


AGTACTGATA 


TAAnACTTTT 


TAAC CT ACT A 


GATTCTTATA 


1500 




ATTTGCTTTC 


CATTTTATGA 


CGA'ITITTAC 


TCCAATTGAG 


TGATAGAATC 


CAAAAAAGCC 


1560 


20 


ATCTCCAAAA 


ATTAATCC 










1578 


(2) INFORMATION FOR SEQ ID NO: 217: 








25 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5137 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 








30 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217: 








TGTTTTCCTT 


GGGTTAAAAC 


ATGCTTGCTA 


TGCGTTTGTA 


AATATGACTT 


GCTGTTTTnA 


60 




CCTGnATACC 


CGTCACACCA 


TGGAAGTAAA 


AATGTTTCTT 


GCTCTTGGCT 


TACAATTTTA 


120 


35 


GCTTTAATCG 


CTTCATATGC 


TTTATATTGG 


TCTTCTGTTA 


ATTGCTGTTT 


TGATTCTTGT 


180 




TCGAAAACAC 


GATCTTTAAA 


TGGGTCTCTT 


TCAACAACCG 


CGTCATATTT 


TTCAACATAA 


240 


40 


CCTTPTTTGA 


TAAGTCCATC 


TAAACTGGAT 


TTTGAAAAGC 


CCATATCCTC 


AA1AI<JA<jI 1 


300 


AAAAATATTG 


TTTTATGTTG 


TTCTTCAGAC 


AAGTAAGCAT 


ACAAATCGTA 


TTGTTTAATA 


360 




ACTTTCTCCA 


ACTTAGCTAA 


TACTTCATCA 


GGATGATACC 


CTTCAATGAC 


ACG AACAG CA 


420 


45 


CGCTTGGTTT 


TTTTAGTTAT 


ATTTTGTGTG 


AGAATCGTTT 


TTTCTTCAAC 


GATATCATCT 


480 




TTTAACAACT 


TCATAAGCAA 


TTGAATATCA 


TTATTTTTTT 


GCGCATCTTT 


ATAATAATAG 


54C 




TAACCATGCT 


TATCAAATTT 


TTGTAATAAA 


GCTGAAGGTA 


GCTCTATGTC 


ATCTTTCATC 


600 


50 


TTAAATGCTT 


TTTTATACTT 


CGCTTTAATA 


GCACTCGGAA 


GCATCACTTC 


TAG CAT AG AA 


660 












.AAnrTATTAA 




~t n 



EP0 786 519 A2 



10 



TGCTTCTGTT AATATAACTG TTGGCATGAT AACTCCTCCT TAAAAAATCC AAGTTTCTTT 300 

TATATGTGCA TATATATTTT GTAATAATTC TTCCGGCGAA TCACCTTCAA CAATATCACC 3 60 

ATTTACTAAA GCATACAACC CGGCTGAACA TATACCACAA TGTGTCAGGC AACCATACTC 420 

TAACACATCG ACATCTGGGT CATTTTCCAG TTGATTAAAA ACATAATCTC CACCTTTTGC 4 80 

CATGTTAGAG AGACAAAATT CTACGATCGG ATTCATACTT CACCTTCTTA TTTCATTTGT 54 0 

TACAATATTA TAGCATTTTA AAACTGGTAT TTTAACATGA TGTGCTCAAT TAGCAACAAC 600 

TGATGTTTCT TATCCCAGTT ATGTAATAGT GCCTTAGTTA GTAC 644 
75 (2) INFORMATION FOR SEQ ID NO: 216: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 1578 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
20 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216: 

25 

GAATGATGAA AGGAATAGAA AAGAAAAGAT AAATAATGTA ATAGATTTAT CCGAGAAAAT 6 0 

TGAAAGAACA AAAGATATGC CAATCAAGAA TACTATAACT ACTCAATTAG GAAATAAACT 120 

TATTGGCACA AAAAAAGCTC GTTTTGATGA TAAGAAAGTA GTGTCGTTTG GAGCATTTGA 180 

30 

AGATGAATAA AATAAATGAT AGAGATTTAA CAGAATTGAG TAGCTATAGG GTTTATCAAG 24 0 

ACATCAATAA AGATAATGAC TTTACAGTTA ACGAAAAACG ATTTAAGCAG GCAGATGTAT 300 

35 TTGAAGATTT ATATAGAGAG AAACTAAAAG ACACAAATAA ATTAAGAGAG TATAATTATT 360 

TACAAAATGA AACTTTTAAA AGCGCATAAA TAGGTGATGA GATATGCTTA AAAAAGCAAA 420 

ATTTATCTTA ATGGCAACGA TACTACTATC AGGATGTTCA ACTACCAATA ACGAATCCAA 480 

40 CAAAGAAACA AAATCTGTAC CAGAAGAAAT GGATGCTTCA AAATATGTAG GACAAGGATT 54 0 

CCAACCACCT GCAGAAAAAG ATGCGATTGA ATTTGCAAAG AAGCATAAAG ATAAAATTGC 600 

TAAGCGAGGC GAACAATTTT TTATGGATAA CTTCGGTCTA AAAGTTAAAG CTACAAATGT 660 

45 

TATAGGTAGT GGCGATGGTG TAGAAGTATT CGTGCATTGT GATGACCACG AyATCGTATT 720 

TAATGCGAGT ATT CCATTTG ATAAATCAAT wATTGAsAGT GATAGCTCAT TAAGAAGTrA 780 

50 GGAyAAAGGy GATGATATGA GTACTTTAGT TGGTGCAGTA CTCAGTGGGT TTGAATATCG 84 0 

AGCACAAAAA GAAAAATATG ATAAATTATA TAAATTTTTC AAAGATAATG AAGAGAAATA 900 

TCAATATACA GGATTTACAA AAGAAGCAAT TAATAAGACG CAAAATAGTG GTTATGAAAA 96 0 



55 
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AGGTGAAGAA TTTGATAAAA GTAGATCAAC ATTACTTTGA ATTAATAGAA AATTATCGCG 66 6 0 

AATGTTTTAA TGAAGAACAA TTTATTGCTA GGTATTCAGA TATTTTAGAT AAATATGATT 6720 

5 ACATAGTTGG TGACTATGGT TACGATCAAT TACGATTAAA AGGTTTTTAC AAAGATTCTA 67 6 0 

ATAAAAAAGC AGAGATGAGT AAACGTTTTT CAAATATTCA AGATTACATA TTTGAATATT 6 84 0 

GTAACTTTGG TTGTCCTTAC TTTGTATTAA GACATTTGTC TAAACAAGAG GTTAAAAAGT 6 90 0 

w 

TAATCGAAGA AGTTCATCCG TCTGATGTGA TAGATGACGA CAATAAACTT CAAGATGTGA 696 0 

AGATTAAGCC AACCATTCAA GATACTGAAC ATTAATAAAA CCCTTAGCTA GATTGAAAAT 7 020 

1S GGGAATCATG CAATTCAAGC ATGGACCTGT AATCTAGTTA GGGGTTTTTA TCTTTAATGA 7080 

ATGACTTCAT TTAAATACTC AGTAATTTCA TCGCCTTCTT CAGCATTTAC ACCTAAAATA 714 0 

TGAGCGATAT AGCCTTCTTC TTTTAAATCA TCAGTACCGA TAATACCGAA TTTATTTGTT 72 00 

20 TGCATATTAA GTACGAGTGT CTTACCATAA TGTCTATTTG TATGGACTAA CATCAAATCA 72 60 

TATCGACTAT GCTCGCCAAC AAAACCAACA AACTGAACTT GACTCTCTTC GTTGTCATCA 73 20 

TATAAATACA TATCAATCAT TTTGTAGCGA CTCCTTTTAA AAGTAGTAAA GTTAGTATAA 73 8 0 

25 

CGACAAATGA AGTATACTGC AAAATTATGA TAATATATAA GTGAGAGGTG ACAAGGAATG 74 4 0 

TATTTTGTAG ACAAAGATAA ACTAACTCAG AAATTAGCCT ATTTACAAGC ATTAACTGAT 7500 

GATTATCATG AGAGCAAGCA CAATCATTAT GCATTTGAAC GCATTGCTCA AATGTTGATA 756 0 

30 

gaatcatcgg tagatatagg gaatatgatt atcgatgcat ttattttaag ggatcctggt 7 6 20 

aattataaag atgtgattga tatattagaa ctagaaaatg ttattactaa agaaacacag 76 80 

35 caggcgatta ataaaactgt cggtattcgt aaacaattta catatgatta cacagcctta 774 0 

gatgttgaga ttatcatgcc aatgtttga 7 76 9 

(2) Information for seq id no: 215 : 
40 (i) sequence characteristics: 

(A) LENGTH: 644 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215: 

so ACCGCCACCC ATTAATGATT GCTTAAAATC AATAGTCGTA CCATTTAATA CGGGTGCATC 6 0 

TTTTTTGTCT ACTAATACTT TTAATCCAAA GTATTCTAAG ACTTCATCAT TTTCACCAGG 12 0 



10 



15 



25 
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GCAgCTGCAA AGTATGGTAA TTATCTTGGA GAGGTTAATT TAACTTTTGA GGCACATAAA 4 86 0 

GTAGTACATA AAACTGCAAA GATTATTCCT TTAGAAACAT TACCTGAAGT TGAAACTTCA 4 920 

TTTGAAGAAG AAGGAAAAAC GTTAATGTCC AATTCAGTAA TTCAACATCC AGTAGTGCTT 4 9 BO 

AAGCGTAGTA TGAATCACAT AACTGAAGCT GCATACTTAT TAGCTCAAAG TGTTTGTGAG 504 0 

TATACACATG CACAATGTGC CATCATCAAT GCTGGCTTAC TCGTTAAAGA TATTGTAAAA 5100 

GATGAAGTGA CAGAATATGA CATTCATCAA ATGTTACCGC ATCCGATTAA TATGGTAAGG 5160 

GTTAGACTTT TTGGTGTGAA ATTAAAAGAG ATTATAGCTA AAAGTAATAA ACAAGAATAT 5220 

ATGTATGAAC ATGCACAAGG TTTGGGTTTC AGAGGGAATA TATTTGGAGG ATATATTCTT 5280 

TATAATTTAG GGTACATTCA TTCTACAGGG CGTTACTATC TGAATGGAGA AGAAATCGAA 5340 

GACGACAAAG AATATGTACT AGGTACGATA GATATGTATA CGTTCGGTCG TTATTTCCCA 5400 

20 ACATTGAAAG AATTACCAAA AGAGTATTTA ATGCCAGAGT TTTTAAGAGA TATATTTAAA 5460 

GAAAAATTAT TGGAATATTA AAAAGTAAGA TTATTGGATT TTCATTTGTC ATGAATTTCG 5520 

ATATAATGTT TAAAGATACA CTTAACAGGA GGGTATGTGT TGTTATGGCG ACAAAAAACG 5580 

AGGAAATATT ACGTAAACCG GATTGGTTGA AAATAAAATT AAATACCAAC GAAAACTATA 564 0 

CAGGACTTAA GAAGATGATG AGGGAAAAAA ATCTTAATAC TGTATGTGAA GAAGCTAAAT 5700 

GTCCTAATAT ACATGAATGT TGGGGTGCAC GTCGTACAGC GACATTTATG ATTTTAGGTG 5760 

CCGTATGTAC AAGAGCTTGT CGTTTTTGTG CGGTTAAGAC AGGTTTACCT AATGAACTTG 5820 

ATTTAAATGA GCCTGAACGT GTAGCTGAAT CAGTTGAATT AATGAATTTG AAACACGTTG 5880 

35 TTATCACTGC TGTTGCGCGT GATGATTTAA GAGATGCTGG TTCAAATGTT TATGCTGAGA 594 0 

CAGTACGTAA AGTTAGAGAA AGAAATCCAT TTACAACGAT TGAAATTTTA CCATCAGATA 6000 

TGGGCGGGGA CTATGATGCG TTAGAAACAT TAATGGCGTC AAGACCTGAC ATTTTAAACC 6060 

40 ATAATATTGA AACTGTTCGT CGCTTAACAC CGAGAGTTCG TGCGCGTGCG ACTTACGACA 6120 

GAACATTAGA GTTTTTACGT CGTTCAAAAG AATTACAACC GGATATCCCA ACTAAATCAA 6180 

GTATTATGGT TGGATTAGGT GAAACTATAG AAGAAATTTA TGAAACGATG GATGATTTAC 6240 

GTGCGAATGA TGTAGATATT TTAACGATTG GTCAATATTT ACAACCTTCA CGTAAACATT 6300 

TAAAGGTTCA AAAATATTAC ACGCCTTTAG AGTTTGGTAA ATTAAGAAAA GTGGCAATGG 6360 

ATAAAGGGTT TAAACATTGC CAAGCTGGAC CTTTAGTACG TAGTTCTTAT CATGCGGATG 64 2 0 

AGCAAGTAAA TGAAGCTGCT AAAGAAAAGC AACGCCAAGG TGAGGCACAG TTAAATAGTT 6480 

AATATTTAAC CATTAATAAG GCATAAAGGC TTAGTTTGTA CAAAACGAAC GTGTCATAGA 6540 
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AATCACAAAT GAAATTGCGT TTGTACGTTA TCATGGACGT AATCATTACG GTTGGACTAA 3060 
GAAAGATATG TCAGATCAAG AATGGCGCGA TGTACGCTAT TTATATGATT ATAATGAGCA 312 0 

AGAATTAATA GACTTGGCAC AAAAGGCACA AATATTAGCA CAAAAAGCTA AGAAAGTTTA 3180 
CGTCATATTT AACAATAATT CTGGTGGTCA TGCAGCAAAT AATGCCAAAA CATATCAGCG 3 24 0 

ATTATTGAAT ATAGAATATG AAGGGTTAGC ACCACAACAA TTAAAATTAT TTTAAGAGGC 3 3 00 

GACGACTATG TTATTAACAA TTACATTATT AGTTTTAATC GGAGGTTTGT CAGCGATTAT 3 3 60 

AGGGTCTATC GTAGGCATTG GAGGCGGTAT TATTATCGTT CCAACAATGG TTTACCTCGG 3420 

TGTTGAACAT GGATTACTAC ATAATATTAC AACACAAGTA GCGATAGGGA CGTCTTCAGT 34 BO 

CATTCTAATT GTGACAGGAC TTTCTTCATC ACTTGGATAT TTAAAAACAA AACAAGTTGA 354 0 

TATTAAAAAT GGTTCCATCT TTTTATTTGG ACTATTACCA GGTTCATTGC TTGGGTCCTT 36 0 0 

CATTAGTAGA TATTTAACAT TTGAGTCATT TAATTTATAT TTTGGTATCT TTTTAATTTT 366 0 

CG TAG CCA TT TTATTAATGG TAAGAAATAA GATTAAACCG TTTAAAATTT TCGATAAAC C 3720 

CAAGTATGAA AAGACTTATG TAGACGCTAA AGGTAAAACA TATCATTATA gTGTTCCACC 378 0 

ATTGTTTGCT TTTATTACAA CGTTTTTAAT TGGTATATTG ACAGGTTTAT TTGGTATTGG 3 84 0 

AGGTGGCGCA CTAATGACGC CACTAATGCT TATTGTATTT AGATTTCCAC CTCATGTAGC 3 90 0 

TGTTGGAACA AGTATGATGA TGATTTTCTT TTCAAGTGTC ATGAGTTCTA TAGGGCACAT 3 960 

TGCTCAAGGT CACGTAGCTT GGGGTTATGC AATCATnTTA ATTATTTCTA GTTATTTTGG 4 020 

TGCGAAAATC GGTGTCAAAG TGAATCAATC AATTAAGTCA GATACGGTAG TAACATTATT 4 080 

GAGAACAGTA ATGTTGTTAA TGGGTATATA TTTAATTATT CGTGCGTTGA TTTAATACAA 414 0 

CTTTAAAAGG AGGACGTCAA TTTGAGGCTT ACAATTTATC ATACGAACGA T ATT CAT AG T 4 200 

CATTTACATG AATACGAACG CATTAAAGCA TATATGGCAG AACATCGGCC ACGACTTAAT 4 260 

CATCCTTCTT TATATGTTGA TCTAGGTGAT CATGTAGATT TATCCGCACC TATAACTGAA 4 3 20 

GCAACTTTAG GTAAAAAGAA TGTGGCATTA CTAAATGAAG CAAAATGTGA TGTTGCAACA 4 3 80 

ATCGGTAATA ATGAAGGGAT GACCATTTCA TACGAAGCTT TAAATCACCT TTACGACGAA 4 44 0 

GCAAAATTTA TAGTGACATG TAGCAATGTT ATAGATGAAT CAGGTCATTT ACCAAATAAT 4 500 

ATCGTTTCTT CTTATATTAA GGACATAGAC GGTGTGAAAA TACTATTCGT TGCAGCGACA 4 560 

50 GCACCTTTTA CCCCATTTTA TCGTGCACTA AATTGGATTG TTACCGATCC ACTTGAATCT 4 6 20 

ATAAAAGAAG AAATTGAACT TCAACGAGGT AAATTTGATG TATTAATCGT GCTAAGTCAT 4^80 
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GTATTGAATA TCCAATTATA CAAGCAGGTA TGGCAGGAAG TACGACACCG AAATTAGTTG 1260 

CATCAGTAAG TAACAGTGGT GGGTTAGGCA CAATAGGCGC AGGTTACTTT AATACGCAGC 13 20 

AATTGGAAGA TGAAATAGAT TATGTACGCC AATTAACGTC AAATTCTTTT GGCGTAAATG 13 80 

TCTTTGTACC AAGTCAACAA TCATATACCA GTAGTCAAAT TGAAAATATG AATGCATGGT 144 0 

TAAAACCTTA TCGACGCGCA TTACATTTAG AAGAGCCGGT TGTAAAAATT AC CGAAGAAC 1500 

AACAATTTAA GTGTCATATT GATACGATAA TTAAAAAGCA AGTGCCTGTA TGTTGTTTTA 1560 

CTTTTGGAAT TCCAAGCGAA CAGATTATAA GCAGGTTGAA AGCAGCGAAT GTCAAACTTA 1620 

TAGGTACAGC AACAAGTGTT GATGAAGCTA TTGCGAATGA AAAAGCGGGT ATGGATGCTA 1680 

TCGTTGCTCA AGGTAGTGAA GCAGGTGGAC ATCGTGGTTC ATTTTTAAAA CCTAAAAATC 174 0 

AATTACCTAT GGTTGGAACA ATATCTTTAG TGCCACAAAT TGTAGATGTC GTTTCAATTC 1800 

20 CGGTCATTGC CGCTGGTGGA ATTATGGATG GTAGAGGAGT TTTGGCAAGT ATTGTCTTAG 1860 

GTGCAGAAGG GGTACAAATG GGCACCGCAT TTTTAACATC ACAAGACAGT AATGCATCAG 1920 

AACTACTGCG AGATGCAATT ATAAATAGTA AAGAAACAGA TACAGTCATT ACAAAAGCGT 1980 

TTAGTGGAAA GCTTGCACGC GGTATCAACA ATAGGTTTAT CGAAGAAATG TCCCAATACG 204 0 

AAGGCGATAT CCCAGATTAT CCAATACAAA ATGAGCTAAC AAGTAGCATA AGAAAAGC CG 2100 

CAGCAAACAT CGGCGACAAA GAGTTAATAC ATATGTGGAG TGGACAAAGC CCGCGACTAG 2160 

CAACAACGCA TCCCGCCAAC ACCATCATGT CCAATATAAT CAATCAAATT AATCAAATCA 222 0 

TGCAATATAA ATAATCGACC GCAATCCACA AAAGCACAAG CACCCCCAAA CATTATTTTA 2280 

GTGCTTGCCA TTTTTGTGGA TTGCGTTTCT ATTTTACCAA TTTAATCAAA CGAAAACATC 2340 

AAGCTGAAGA TCGCCGAAAG ATTTTAATCA AGCAAAAACA TCAAACTAAA GTTCGCTGAA 24 00 

ATGATTATGA TAAAAGTTAT ATGGTATGAT GACATTGGTG ATATATATGA TAAACATCGG 2460 

40 ATTAACAGGT TGGGGTGATC ACTATTCATT ATATGAAGAT TTAGAACGCC AAACCGATAA 2520 

ACTTAAAACA TATGCTGGAC ATTTTCCGGT TGTCGAATTA GATGCGACAT ACTATGCGAT 258 0 

ACAAC CGGAA AGAAATATAT TGAAATGGAT AAAAGAAACG CCTGATACAT TTGAATTTGT 264 0 

GGTCAAAATT CATCAAGCAC TcACATTGCA TGCAGACTAC AAAACATTTG CAGATACAAG 27 00 

GCAAGAACTA TTTGATCAAT TTAAGAATAT GTTAGAGCCC TTACATACAC AGAAAAAATT 2760 

AGCAATGGTA TTGGTTCAAT TTCCGCCATG GTTTGACTGC AATGCACAAA ATATCAAATA 2820 

TATTTTGTAT GTAAGACAGC AATTACAAGC ATTTCCAATG TGTGTAGAAT TTAGGCATCA 28 80 

ATCATGGTTT AGTGATGCAT TTAAAGAACA AACATTGGCA TTTTTAACAG AACATCAAAT 2 94 0 
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TATGTAyTAT TTTGATATAT TTTTATGGAA TTTACGGTGC TGCTATTGCG CGTTTAATTA 1860 

CAGAGTTTTT CTTGCTCATT TGGCGATTTA TTGATATTAC TAAAATCAAT GTGAAGTTGA 1920 

ATATTGTAAG TACGATTCAA TGTGTCATTG CTGCTGTTAT GATGTTTATT GTGCTTGGTG 19 80 

TGGTCAATCA TTATTTGCC 19 99 
(2) INFORMATION FOR SEQ ID NO: 214: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7769 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 214: 

20 TCATTATTAA GACTATTATA TATAATGAAT TTTAACTGGT TTATTAAACG AGAACGTOOO 60 

GAATTAAGTA ACTACAATAA AAATAAGATA TGACAATAAG GAGACTACAC GCGTGATCAT 12 0 

TGCCATAATT ATATTGATAT TTATTTCGTT TTTCTTTTCA GGAAGCGAGA CGGCATTAAC 18 0 

GGCTGCCAAT AAAACAAAAT TTAAAACTGA AGCTGACAAA GGTGATAAAA AAGCAAAAGG 24 0 

CATTGTAAAG TTACTTGAAA AACCAAGTGA GTTTATTACA ACGATTCTAA TTGGGAATAA 3 00 

TGTCGCGAAT ATTTTATTAC CAACACTTGT TACAATTATG GCTTTACGTT GGGGGATTAG 3 60 

CGTTGGTATT GCATCAGCTG TTTTAACAGT TGTTATCATT TTGATCTCCG AAGTGATTCC 42 0 

CAAGTCTGTC GCTGCAACAT TTCCAGATAA AATAACAAGG CTTGTATATC CAATTATTAA 4 80 

TATTTGTGTC ATTGTGTTCC GTCCTATCAC ATTACTTTTA AATAAGTTGA CGGACAGTAT 54 0 

TAATCGAAGT TTATCTAAGG GCCAACCTCA AGAACATCAA TTTTCAAAAG AAGAATTTAA 6 00 

AACAATGTTA GCAATTGCTG GACATGAAGG TGCTTTAAAT GAAATTGAGA CGAGTAGGTT 66 0 

GGAAGGTGTC ATTAATTTTG AAAATTTAAA AGTAAAAGAT GTAGATACAA CACCTAGAAT 72 0 

TAATGTGACG GCATTTGCTT CAAATGCGaC ATACGAAGAA GTTTATGAAA CGGTTATGAA 78 0 

TAAGCCATAC ACTAGATATC CAGTGTACGA GGGAGATATT GATAACATTA TTGGGGTGTT 84 0 

TCATTCTAAA TATCTGTTGG CTTGGAGTAA TAAAAAAGAA AATCAAATTA CAAACTATTC 90 0 

AGCTAAGCCA TTATTTGTGA ATGAACACAA TAAAGCTGAA TGGGTATTAC GTAAGATGAC 96 0 

TATTTCTAGA AAACATTTAG CAATTGTGTT GGACGAATTT GGTGGTACTG AAGCGATAGT 102 0 

GTCACATGAA G A CTTAATTG AAGAATTATT AGGTATGGAA ATTGAAGATG AGATGGATAA 10 80 
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GCTTACAAGT ATA7TCATAA TTACATATTC AAGGTCCTTG CATGTGGTAT TTTGCTATGG 60 

yCtTTaACTA CAACGGGGTC TAAGACTGCG TTTATCATAT TAATCGTCTT AGCCATTtAT 120 

TyCTTTATkA AAAAGTTATT TAGTAGAAAT GCGGTAAGTG TTGTGAGTAT GTCAGTGATT 180 

ATGCTGATAT TACTTTGTTT TACCTTTTAT AATATCAACT ACTATTTATT CCAATTAAGC 24 0 

GACCTTGATG CCTTACCGTC ATTAGATCGA ATGGCGTCTA TTTTTGAAGA GGGCTTTGCA 300 

TCATTAAATG ATAGTGGGTC TGAGCGAAGT GTTGTATGGA TAAATGCCAT TTCAGTAATT 360 

AAATATACAC TAGGTTTTGG TGTCGGATTA GTGGATTATG TACATATTGG CTCGCAAATT 42 0 

AATGGTATTT TACTTGTTGC CCATAATACA TATTTGCAGA TCTTTGCGGA ATGGGGCATT 48 0 

TTATTCGGTG CATTATTTAT CATATTTATG CTTTATTTAC TGTTTGAATT ATTTAGATTT 54 0 

AACATTTCTG GGAAAAATGT AACAGCAATT GTTGTAATGT TGACGATGCT GATTTACTTT 60 0 

TTAACAGTAT CATTTAATAA CTCAAGATAT GTCGCTTTTA TTTTAGGAAT TATCGTCTTT 660 

ATTGTTCAAT ATGAAAAGAT GGAAAGGGAT CGTAATGAAG AGTGATTCAC TAAAAGAAAA 720 

TATTATTTAT CAAGGGCTAT ACCAATTGAT TAGAACGATG ACACCACTGA TTACAATACC 780 

CATTATTTCA CGTGCATTTG GTCCCAGTGG TGTGGGTATT GTTTCATTTT CTTTCAATAT 84 0 

CGTGCAATAC TTTTTGATGA TTGCAAGTGT TGGCGTTCAG TTATATTTTA ATAGAGTTAT 900 

CGCGAAGTCC GTTAACGACA AACGGCAATT GTCACAGCAG TTTTGGGATA TCTTTGTCAG 960 

TAAATTATTT TTAGCGTTAA CAGTTTTTGC GATGTATATG GTCGTAATTA CTATATTTAT 1020 

TGATGATTAC TATCTTATTT TCCTACTACA AGGAATCTAT ATTATAGGTG CAGCACTCGA 1080 

35 TATTTCATGG TTTTATGCTG GAACTGAAAA GTTTAAAATT CCTAGCCTCA GTAATATTGT 114 0 

TGCGTCTGGT ATTGTATTAA GTGTAGTTGT TATTTTTGTC AAAGATCAAT CAGATTTATC 1200 

ATTGTATGTA TTTACTATTG CTATTGTGAC GGTATTAAAC CAATTACCTT TGTTTATCTA 1260 

TTTAAAACGA TACATTAGCT TTGTTTCGGT TAATTGGATA CACGTCTGGC AATTGTTTCG 132 0 

TTCGTCATTt AGCATACTTA TTACCAAATG GACAGCTCAA CTTATATACT AGTATTTCTT 13 80 

GCGTTGTTCT TGGTTTAGTA GGTACATACC AACAAGTTGG TATCTTTTCT AACGCATTTA 144 0 

ATATTTTAAC GGTCGCAATC ATAATGATTA ATACATTTGA TCTTGTAATG ATTCCGCGTA 1500 

TTACCAAAAT GTCTATCCAG CAATCACATA GTTTAACTAA AACGTTAGCT AATAATATGA 1560 

ATATTCAATT GATATTAaCA ATACCTATGG TCTTTgGTTT AATTGCaATT ATGCCATCAT 1620 

TTTATTTATG GTTCtTTGGT GAGGAATTCG CATCAACTGT CCCATTGATG ACCATTTTAG 16 80 

CGATACTTGT ATTAATCATT CCTTTAAATA tGTTGaTAAg CaGGCAATAT TTAtTAAtAG 174 0 
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TGATTCGTTA GACGGTCTAC CTGTGGTTAA CCTTAAATCA AGCAACTTAA AACGTGGTGA 288 0 

ATTAATCACT GGTGACTTCG ACAAATTGAT TTATGGTATC CCTCAATTAA TCGAATACAA 2 94 0 

AATCGATGAA ACTGCACAAT TATCTACAGT TAAAAACGAA GATGGCACAC CTGTAAACTT 3 00 0 

GTTTGAACAA GACATGGTGG CATTACGTGC AACTATGCAT GTAGCATTGC ATATTGCTGA 3 0 60 

TGATAAAGCG TTTGCTAAGT TAGTTCCTGC TGACAAAAGA ACAGATTCAG TTCCAGGAGA 3120 

AGTTTAATAA ATAATTAGGA GTGGTAACAT GCCCGAAATC ATTGGAATTG TTAAAGTAGA 3180 

TTTTACAGAT TTAGAAGATA ACAGACATGT CTATATGAAA GGGCATGTCT ACCCTCGTAA 324 0 

AGGTTATAAT CCTACAGATG AACGTATCAA AGCTTTAGCT AGTGTTGAAA ATAAACGCAA 33 00 

CAAACAAATG ATTTACATTG TAAATGACAA ATTAACCAAA AAAGAACTTG TCGAAATAGC 33 60 

AAGTGTTGCT GGCTTACAAG TTGATGAAAA ACAAACAAAA GCTGAAATTA TCAATGCTTT 34 20 

TGAGTCACTA GAGTAGGTGG TTATATGACT ACGCTAGCTG ATGTAAAAAA ACGTATTGGT 2\ SO 

CTTAAAGATG AAAAGCAAGA TGAACAATTA GAAGAAATCA TAAAAAGTTG TGAAAGCCAG 354 0 

TTGTTATCAA TGTTACCTAT TGAAGTTGAA CAAATACCGG AAAGgTTTAG TTACATGATT 3 6 00 

AAAGAAGTTG CAGTTAAACG CTACAACAGG ATTGGTGCTG AAG t ATGACA TCAGAAGCGG 3 66 0 

TTGACGGACG TAGCAATGCG TATGAATTGA ACGATTtCAA GGAGTATGAA GCTATTATTG 3 72 0 

AT AATTA CTT TAATGCTAGA ACGAGAACTA AAAAAGGAAG GGCTGTGTTC TTTTGAGATA 3 78 0 

TGAAGATAGA GTTATTT7TC AATTAGAACA AGTAGCAACT TACAATCCTA AAACTAGCAA 3 84 0 

AAAAGAAAAC ACACTAATCA CTTATGATGC GATACCATGC AATATTAACC CCATTTCTAG 3 90 0 

J5 AG CAAGAAAG CAACTTGAAT TTGGTGATGT AAAAAACGAT GTAAGTGTTC TGAGGATAAA 3 96 0 

AGAATCAATA TCTTACCCTG TTAGCCACGT GTTGGTTAAT GGCATTCGCT ACAAGATAGT 4 02 0 

TGATACAAGG ATATACAGAC ACGAAACGTC ATATTATATC GAAGAGGTCA ATTGATGAAT 40 8 0 

ATAGATGGAT TAGACGCACT GTTAAACCAA TTTCACGATA TGAAAACCAA CATTGATGAT 414C 

GATGTAGATG ATATTTTACA GGAAAACGCC AAAGAATATG TAGTACGAGC TAAATTGAAA 42 0 0 

GCTAGAGAAG TAATGAATA 4219 

(2) INFORMATION FOR SEQ ID NO: 213: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1999 base pairs 
qQ (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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GACGCTAAAA TTGGTGTTAA TTTCTATCCT ATGCATATCA ATTGTCGTTC AGATTGCGCT 1080 

TTACTACCTA AATCTATGTG GCCGAAAAAA CCAAGCAAGA AACGAAAAAC AAAATACTTC 1140 

GGAGGGAAAG TGAAAAGCGG TGATTGATTT AAAAGTGAAG TTTTTTAAAG GCAAGTTAGT 1200 

TTTGTATGAC AGTAAATTAA ATGTTTGGAG GATACTAATA TGAGTAATAC TGACAAATAC 126 0 

CTTAGAGACA TAGCAAGAGA ATTAAAAGGT ATACGTAAAG AGTTACAAAA GCGAAACGAA 1320 

ACAGTTATTA TTGATGCAAA CTTAGACAGT TTAAGGTCGG CAGTATTAGC CGATAAAGAA 13 80 

AAATCGAAAT ATAATGAACC TCTCTTTTAA TAGCTAGCAC TTAATTGTGT TGGCTATTTT 1440 

TTATGTCCAA AACGTGCTGA TGACATAAAA AGCACGCATG GAAAAACAGT CGACAGACTA 1500 

TAAATGGAGG TATATCTCAT GGAAGAAAAT AAACTTAAGT TTAATTTGCA aTTTTTTGCA 1560 

GACCAATCAG ATGATCCGGA CGAACCAGGC GGAGATGGTA AAAAAGGAAA TCCTGATAAG 1620 

20 AAAGAAAATG ACGAAGGTAC TGAAATAACT TTCACGCCAG AGCAACAAAA GAAAGTTGAT 1680 

GAAATACTTG AACGTCGTGT AGCCCACGAA AAGAAAAAAG CTGATGAGTA TGCAAAAGAA 1740 

AAAGCAG CAG AAGCTGCTAA AGAAGCTGCT AAATTAGCGA AAATGAACAA GG A TCAAAAA 1800 

GATGAATATG AACGCGAACA AATGGAAAAA GAACTGGAAC AATTACGTTC AGAAAAACAA 1860 

TTAAACGAAA TGCGTTCAGA AGCACGAAAA ATGTTGAGTG AAGCGGaAGT TGATTCATCA 1920 

GATGrGGTTG TCAATTTAGT TGTAACAGAT ACTGCTGAAC AAACTAAATT GAATGTTGAA 1980 

GCTTTTTCTA ATGCAGTAAA AAAAGCGGTT AATGAAGCGG TTAAGGTTAA CGCTAGACAA 2 040 

TCGCCATTGA CTGGTGGAGA TTCATTTAAT CACTCGACTA AAAATAAACC GCAAAACTTA 2100 

GCTGAAATAG CTAGACAAAA AaGAATTATT AAAAATTAAC GGAGGCATTT AAATGGAACA 2160 

AACACAAAAA TTAAAATTAA ATTTGCAACA TTTTGCAAGT AACAATGTTA AACCACAAGT 2220 

ATTTAACCCT GACAATGTAA TGATGCATGA AAAGAAAGAT GGCACGTTGT TAAACGACTT 22 80 

TACAACACCT ATCTTACAAG AGGTTATGGA AAACTCTAAA ATCATGCAAT TAGGTAAGTA 234 0 

CGAACCAATG GAAGGTACTG AGAAGAAGTT TACTTTTTGG GCTGATAAAC CAGGTGCTTA 24 00 

CTGGGTAGGT GAAGGTCAAA AAATCGAAAC GTCTAAGGCT ACTTGGGTTA ATGCTACAAT 24 60 

GAGAGCGTTT AAATTAGGGG TTATCTTACC AGTAACAAAA GAATTCTTGA ATTACACTTA 2 520 

TTCACAATTC TTTGAAGAAA TGAAACCTAT GATTGCTGAA GCTTTCTATA AAAAGTTTGA 2 5 80 

CGAGGCAGGT ATTTTGAATC AAGGTAACAA TCCGTTCGGT AAATCAATTG CACAATCAAT 2 64 0 

TGAAAAAACT AATAAGGTTA TTAAAGGTGA CTTCACACAA GATAACATTA TTGATTTAGA 2700 

GGCATTGCTT GAAGATGACG AATTAGAAGC AAATGCATTT ATCTCAAAAA CACAAAACAG 27 60 
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ATCTGTTGAA ATCCGAAAAG AGACATTTAC CATTCAACAA GCTACAGAAC AAGTGACATC 174 0 

GAGATTGAAA GATAAAGATC ATTTTAACTT CTTTAGTCTG TTTACGTTTT CTGAGCCAAT 18 0 0 

TGAACAAGTA GTCACTCACT TTTTAGCTAT TTTAGAGATG TCAAAAGCAG GAATAATTAA 18 6 0 

TATTGAGCAA CAACGTAATT TTGAAGATAT TAACATTATT AGAGGAGTGA ACTACCATTT 192 0 

TGGATAATCA TGGTATATTA GAGTCGCTTT TATTTACAGC TGGCGATGAA GGTTTAGATG 19 8 0 

AAAAACAACT ATTAGAAATA TTAGATATGT CGAAAGACCA ACTCGTTGAA TTAATTGAAA 204 0 

ATTATTCATC ACATGGATTA ATGATACAAC GATTTGGAAT GA 20 8 2 
'5 (2) INFORMATION FOR SEQ ID NO: 212: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4219 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS : double 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 212: 

TCTATTCTCG TTCTTCCAAG ACCCTGaATT AGAAGTTAAG AAAATCGAAG AAGATGAGAA 6 0 

AGAATCTATT AAAAAAGCTC AAAAAGGTAT TTATAAAGAC CCTAGAGACA TCAATGATGA 12 0 

CGAACAAGAT GATGATACAA AAGATACTGT TGATAAAAAG GAATGATTGT AATTGCCTAA 180 

CAAAAACACT CAAGAATATT GGGAAGAACG CGGACGCAAA GCAATCGAGA ATGAGTTGAA 24 0 

GCGTGATAAA ACTAAAGCTG AAGAAATAGA ACGTATATTG AATATGATGA TTAAGCGCAT 30 0 

TGAAAAAGAG ATCaATGCGT TTATTGTCAA GTACGGAGAT TTTGCAGGCG TTACATTACA 36 0 

AGAAGCACAA AAGATTATTG ATGAGTTCGA TGTAAAAGCG TTTCAAGAAG AAGCAAAAAG 4 20 

ATTGCflCGAA AACAAGGAGT TTAGCGATAG AGCAAATGAA GAATTAAAGA AGTATAACAC 48 0 

GAAAATGTAT GTATCTAGAG AACAGATGTT AAAGATTCAA ATAGAATTCT TAATTGCTTA 54 0 

TGCAACAGCT CAAACAGAAT TATCGATGAG GGAATATTTC GAATCAACAG CTTATCGTGT 60 0 

GTTCAGTGAT CAAGCGGGTA TTTTAGGTGA AGGTGTACAA GTAGCTAAAG AAGTTATAGA 66 0 

TACAATCGTT GATACACAAT TTCATGGTGT CGTTTGGTCA GAGCGATTAT GGACTAATAC 72 0 

CGAAGCAATG AAACAAGAAG TAGAAGAAAT AATTGCTAAT GTAGTTATTA GAGGTCGACA 78 0 

TCCTAATGAA TATGTTAAAG ATATGCGCAA CACTTAAATA AATTCGAAGG CACAGCACGA 84 0 
rAAAAGA-T^ ^AGrAATTA A A^A^Tnr^ "ft '"a ^nr. — zv^-t * 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 211: 

GATTTAAATA AAATTAATGG ATATCGTGAT CGTACGATGT TAGAACTTCT GTACGCAACG 60 

5 

GGAATGCGTG TATCTGAATT GATACATTTA GAGTTAGAAA ACGTGAACTT AATAATGGGA 120 

TTTGTACGCG TATTTGGTAA AGG CG AT AAA GAAAGAATTG TACCATTAGG CGACGCAGTC 180 

w ATTGAGTACT TAACTACTTA TATTGAAACG ATTAGACCGC AACTTTTAAA AAAGACTGTT 24 0 

ACTGAAGTCT TATTTTTAAA TATGCATGGT AAACCTTTAT CACGACAAGC AATATGGAAA 3 00 

ATGATTAAAC AAAATGGTGT AAAGGCAAAC ATTAAAAAGA CGTTAACGCC ACATACGTTA 360 

75 CGCCACTCTT TTGCGACACA TTTATTGGAA AATGGCGCAG ATTTAAGAGC AGTGCAAGAG 420 

ATGTtAGGtC ACTCTGaCmT ATCTACTACC CmaCTCTATA CmCATGTTTC GrAATCTCAA 480 

ATTAGAAAAA TGTATAACCA ATTTCATCCT AGAGCATAAA GTGAACAATA ACTCAAAAGT 54 0 

20 

CACAATACAC ATGACTAAAA ATGTCTGTGC TATTGTGGCT TTTTTAAATT GGTTGATTAA 600 

TTACGTCTAT GTTTTCTTAA TTGAATCGCT TCTTCTTTTG CTGCAATCAC TTCTGAACGA 66 0 

TCACGGCGCA TGTGATGGTC TACAATAAAA GGATCTGTTG CTGTTTCCTG ATTATAATCA 720 

25 

TAGTCTGGAT AGTTGGCCTT GATGATGCGT TCAAAGACTG GAGTTATTGG TAATATAACA 780 

GATGAAAAAG GCTTTGCTGC ATTCAATTTT GCAATCTGTT GCTCAATTAA CAACTGATAA 84 0 

30 TCATTTAAAT TAAGGTATAA CGCATCTCTA TCTTTAGCAT TTTGTATTAT TT CTTT AG AT 90 0 

TTATTAAAAG ACTTATAGGC GCCTTTTAAA TTATTGCGGC GATAATGGTA ACAAGCAGTT 96 0 

GCAAACAAGA TTAAACTAAC AACTGCATCT TGCTTACTGT AGTTATTTTC AGCTTTCCAT 102 0 

35 GCATCTTCTA AAATGTCATG ACATAGGAAA TAATGTTGCT TAGTATGAAA TTGATAATAG 1080 

AAA^TTATCA GTGCCTGTTG CATTTTGTTA TCACCCCAAT TTAAAAGTAA GTTATTTTCA 1140 

TGCTATAATA TTTTAGAGAA TTATGCACAT ATGACGCAAT ACGAGGTAGA TATTATGTAT 1200 

40 

GAAGTTAAAT T AG AT G CTTT CAATGGACCA TTAGATTTAT TGCTGCATCT TATCCAAAAA 1260 

TTTGAAATAG ATATTTATGA TATTCCTATG CAAGCATTAA CAGAGCAGTA TATGCAGTAC 1320 

45 GTTCATGCAA TGAAACAGCT TGAAATTAAT ATTGCAAGTG AATACCTAGT ATTAGCGTCA 13 8 0 

GAACTCTTAA TGATTAAAAG TAAGATGCTA TTACCACAAT CAACATCAGA TATGGATGTT 144 0 

GATGATGACC CACGGGAAGA TTTAGTtGGG CGTTTAATAG rATATCaAAA TTATArAGAA 1500 

50 TATACTG C t A TTTTAAATGA CATGAAAGAA GAAAGAGATT TTTATTTTAC CAAAAAGACC 1560 

GACAGATTTA TCtCATTTGG AAAcAGATGA ATCyTGGGAT CCaAATCATA CGATTGATTT 162 0 
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AACCATTACC ATTTTTAATT ACATAAGTGT AGTTATAATC TTTGGcAGCT GATGTAGTTG 7 80 

GTTTCACAGC AGTTGGTGCA GTTAAATCTT TTGCATTTAC CCAACCAGTG CGGTTATTAA 84 0 

5 

TAGTACCGTA TAAATAAACA TCTTTGCCTA CAGATACTTG TTTCGTTGCA TTAAATGTAC 9 00 

CTTGAGCAAT GTTATTGCCT GTTAAAATGA CTTGGTTTTT AGTACCCCAA GGAACCATTG 960 

10 ATAAGCCGTT ATTTGATTTA TTAACAGTAT ATTTTTGAGT CGTTTTAACT TCTTTGCCTA 1020 

AGTTTTGAAC ATTTAAGTCT TTTACATTGA ACCAACCTAA TGGGATGTTA TGGCTTGTAT 10 80 

TGTTTAATAA TACATACGTT TCATTACCAT GAGCACGCTC TTTTGTTACA TAGAACGTAC 114 0 

75 GGTCTGCATA TTTCGCACCG TTTTTCGCTG TTTTTTCATA AACAGAAGCA CGAATACCAG 120 0 

TGTTGTTTGG TTTAACTTGA GCAATCTTGC TAACTGTTTG AGTCGTTTGT GGTTTAGTAA 1260 

CAGTATAAGC TTTTACAGCT GTTTTTGGTT GTGCTACTGC TTTTTTAGGT GCAGCAGGTA 13 20 

20 

CAGCTAAATA TGCTTTACTT ACCCAACCAG ATTTACCATT TACAGTTCCA AATAAATAGA 13 8 0 

TAGATTTATC AATTTGTTGT TGCTTAGTCG CTTTAAAAGT TTGGTTACCT GTACCAGAAA 144 0 

25 CTGCACCAGC TTCTTGTTTA TAAGTGCCCC AAGGTACTGA ATATAATTTA GTGCCTGGgT 150 0 

TTACTGTATA TGTTTGCATT ACATTTACAG GTGATTTTGC AT t G t TATAA ATACGTCACC 1560 

TTGTTTAACC CAACCAATTA AAGTTGGACT ATTGTAATCT TTAACTAAGT AGAATTTGTT 1620 

30 TCCACCTAAA CTTGCTTCTT TTGTTACAGC AAATGTTTTT TGAACTTCTT TCGTTGGCTT 16 80 

ACCAGTTTTG TCATAAACTG TAGTGAATAA GCCATTGTTT TTAGCATTAA TTTGAG CAAC 174 0 

ACCGTTTAAT GATGAAACTG TTAATTTATT ATTTGTTGTA GGTGTTGATG GCTTAGGTGT 18 00 

35 

TGGTGTAGGC GTAGGTTTAG CAGTATCAAC TAAATATGCT TTACTTACCC AACCAGATTT 18 60 

ACCATTCACA GAG CCA TAT A AATAAATTGA TTTATCAATT TGTTGTTGCT TTGAAGCCTT 192 0 

AAATGTTTGG TTTCCAGAGC CAGACACACT ACCAGCAACT TGTTTAGATG TACCCCAAGG 198 0 

40 

TACTG TATAA AGTTTCGTAC CAGGTTTGAT TGAATATGAT TGATTTACAT TTACAGGTGA 2G4 0 

TTTAGCTGTG TTGTAAACCA CATCGCCTTC TTTAACCCAA CCAAATTTAT TACCAGAATT 2 10C 

45 GTAATCTTGA ACAAGATAGA ATTTTTGATT ACCTAATGTA GCTGTTTTAG ATACAGCAAA 2160 

TGTTTTTTGA ACTTCATTAG TTGCTTTACC AGTTTTGTCG TATACAGTAG TATATAAACC 2 22 0 

ACTATTTGTT GG 22 3 2 

50 

(2) INFORMATION FOR SEQ ID NO: 211: 
~ 7 RAND EDNESS : a ; >■ 
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TGTTATACAA ACGATGTAGT TAAAAATTTT ACAGCGAATG GTTTATTAAG TATTGGTGCT 1020 
AGCCCTGCAA TGAGTGAAGC TCCCGAAGAA GCTGAAGAAT TTTACAAAGT TGCACAAGCG 1080 
5 CTATTAATCA ATATCGGTAC TTTAACAGCA GAAAATGAAC AAGATATTAT TGCGATTGCT 114 0 

CAAACGGCAA ATGAGGCAGG CTTACCTATT GTATTTGACC CTGTAGCTGT TGGTGCTTCT 120 0 

ACATATCGAA AGCAATTTTG TAAATTATTA TTGAAATCAG CGAAAGTATC AG T AATT AAA 1260 

10 

GGCAATGCAT CTGAAATATT AGCGTTGATT GATGATACAG CAACTATGAA AGGTACAGAT 1320 
AGTGATGCTA ATCTTGATGC GGTTGCAATA GCGAAAAAGG tTACGCAACA TATAAAACTG 13 80 

1S CAATAGTAAT CACAGGTAAA GAGGACGTTA TTGtTCmAGA TAATAAAGCC TTCGTATTAG 144 0 

CTAATGGATC TCCATTATTA GCACGAGTAA CTGGAGCTGG TTGTTTATTA GGAGGCGTTA 1500 
TTGCTGGATT TTTATTTAGA GAAACAGAAC CAGACATAGA AGCGTTAATT GAAGCGGTAA 1560 

20 GCgkATTTAA TATTGCTGCT GAGGTAGCTG CTGAAAATGA AAATTGTGGT GGTCCTGGTA 1620 
CGTTTTCACC ATTGTTGCTT GATACGTTAT ATCATTTAAA TGAAACAACC TATC 1674 
(2) INFORMATION FOR SEQ ID NO: 210: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2232 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 210: 

35 ATGAGTTGCC GATGAATTTA GCACCACCAA CGATTGCnTT TGATACTGTG TCCCAACCAG 60 

CTTGTTTAGC ATATTTAATA CCTTCACGTA AAGGATCGTT ATCATATGCA GCAATACCAA 12 0 

ATACGTTATG GTATTTCGTG TTTGAGTTAG TTACAACTTT GTnTTGCACT ACATCTGCAC 180 

40 

CTTTCGCTAA TTGAGAAGTA CCGTTACCTG TTTCTAATAG GGCATGTGAG ATAAGATAAA 24 0 

CTTCATTAAT GCCATACATT TGAGCAGCTT TGTTAAATGC AGCACCTTGG TTTTCTAATA 300 

CACCTTTACC TTTTAAGAAT TGATTAATTT TATCAATAGA AATATTTTGT GGTTGGTCTA 360 

45 

AGCGTAAGAA TTGATATTTT AATGCTGGAT CTTGAGCTAA ACGCTTCGTA TCCATTGCAT 420 

GCTTAACATC ATTAAATTTA GCATCTGTCC ACTTACCTGG TACACGTTGT ACTTGTGGTT 480 

50 TATATTGTAA ACCAGCTTGT ATTTGAGCAA CTTGGTTTAA TGTCATACCT GTTTGATTAT 54 0 

ACTTAATTAA TTCTTTAGCT AAATCAGTTG ATTTAATCCA TGCtAATTTA CCGTTAGATA 600 

ATTTACCATA GTACCAAGTT TGTCCATTAA TGACTTGTTC TTTAACAACT GCGAATGGTT 660 
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CGTTGTACAA AACGTTTTAA TGGTCTTGCA CCGTATTGAG GTTCATAAGC TTCTTGACCT 4 80 

AGCCAAGCTT TAGCATCATC AGAAACTTCA ATTGAGATTC GTTGTTCTAA TAATCTTATA S4 0 

TTTAATTGCG TTAAGATTTT ATCTACAATC ATACTCATGT CATCAATAGA TAATGGTTTA 6 00 

AATAATACGA TAT CATC CAT ACGATTCAAA ATTTCTGGTT TGAAATATGC ATTTAAACTT 6 60 

GTCATAACAG CTTTTTCTGT TGATTCTGTA ATTTCACCAG TCTCTTTTAC GTTTTCTAAT 7 20 

AAAACTTGAG ATCCAATATT ACTTGTCATA ATAATAATAG TATTTTTAAA ATCAACGCTA 7 80 

CGTCCTTTAG AATCAGTTAA ACGGCtTCAT CTAAAATTTG CAATAATACA TTAAAGACGT 84 0 

CAGTAT 84 6 



15 

(2) INFORMATION FOR SEQ ID NO: 209: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 16 74 base pairs 

(B) TYPE: nucleic acid 
20 <C> STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 209: 

nTGGGAACAG TAAGCCAGTA TTTTTAGAAA GTTGCCATAC ATGAGCGTCG ATTTTTCCAA 6 0 

TATGGCTATG ACACTAGAAC AATGGGAATT TGGAGGAAAA GTAAATGATT AAACCTAAAA 120 

30 TAGCATTAAC CATTGCAGGT ACTGATcCaA CAGGTGGTGC CGG CGTAATG GCTGATTTAA 180 

AATCATTTCA TTCATGTGGT GTATATGGTA TGGGCGTCGT TACAAGTmTT GTTGCTCAAA 24 0 

ATACATTGGG CGTACAACAT ATTCATAATT TAAATCATCA ATGGGTAGAT GAACAACTTG 3 00 

35 ATAGTGTCTT CAATGATACC TTACCTCATG CTATTAAAAC GGGGATGATT GCTACAGCAG 3 60 

ATACTATGGA AACGATTCGT CATTATTTAA TGCAACATGA ATCTATTCCA TATGTAATtG 4 20 

ATCCTGTTAT GTTGGCGAAA rCggTGATTC ATaATGGwTA ATGACaCAAg CaAAACTTGC 4 80 

40 

AGCATaCGTT ATTGC CATT A GCTGACGTAG TAACAC CGAA TTTACCAGAA GCTGAAGAAA 54 0 

TAACGGGACT AACCATTGAT AGTGAAGAAA AAATTATGCA GGCTGGCCGC ATCTTTATTA 6 00 

ATGAGATTGG TAGTAAAGGT GTCATCATTA AAGGCGGTCA TTCAAATGAT ACTGATATAG 6 60 

45 

CAAAAGATTA TTTATTTACT AACGAAGGTG TTC AAA CATT TGAAAATGAA CGATTTAAAA 7 20 

CAAnACATAC GCATGGAACA GGGTGTACAT TTTCAGCAGT TATAACGGCA GAACTTGCAA 7 80 

5Q AAGGTAGACC ATTATTTGAG GCTGTACACA AGGCTAAAAA GTTTATTTCA ATGAGTATAC 84 0 

AATATACGCC TGAAAT CGG C CGTGGTAGAG GTCCAGTGAA TCATTTTGCA TATTTAAAGA 900 
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AGCTGACTTA TTAGGTCAAC CACTTTACAA ATATTTAGGT GGATTTAATG GTAAGCAGTT 600 

ACCAGTACCA ATGATGAACA TCGTTAATGG TGGTTCTCAC TCAGATGCTC CAATTGCATT 660 

5 CCAAGAATTC ATGATTTTAC CTGTAGGTGC TACAACGTTC AAAGAATCAT TACGTTGGGG 720 

TACTGAAATT TTCCACAACT TAAAATCAAT TTTAAGCAAA CGTGGTTTAG AAACTGCAGT 780 

AGGTGACGAA GGTGGTTTCG CTCCTAAATT TGAAGGTACT GAAGATGCTG TTGAAACAAT 840 

10 TATCCAAGCA ATCGAAGCAG CTGGTTACAA ACCAGGTGAA GAAGTATTCT TAGGATTTGA 900 

CTGTGCATCA TCAGAATTCT ATGAAAATGG TGTATATGAC TACAGTAAGT TCGAAGGCGA 960 

ACACGGTGCA AAACGTACAG CTGCAGAACA AGTTGACTAC TTAGAACAAT TAGTAGACAA 1020 

15 

ATATCCTATC ATTACAATTG AAGACGGTAT GGACGAAAAC GACTGGGATG GTTGGAAACA 1080 

ACTTACAGAA CGTATCGGTG ACCGTGTACA ATTAGTAGGT GACGATTTAT TCGTAACAAA 1140 

CACTGAAATT TTAGCAAAAG GTATTGAAAA CGGAATTGGT AACTCAATCT TAATTAAAGT 1200 

20 

TAACCAAATC GGTACATTAA CTGAAACATT TGATGCAATC GAAATGGCTC AAAAAGCTGG 1260 

TTACACAGCA GTAGTTTCTC ACCGTTCAGg aAACAGAAGA TACAACAATT GCTGATATTG 1320 

25 CTGTTGCTAC AAACGCTGGT cAAATTAAAA CTGGTTCATT ATCACGTACT GACCgTATTG 13 80 

CTAAATACAA TCAATTATTA CGTATCGAgA TGAATTATTT GAAACTGCTA AATATGACGG 1440 

TATCAAATCA TTCTATAACT TAGATAAATA ATTTTCTnTA TAATCAAATG CTGACATAAT 1500 

30 TTTAGTTGAG GATTATTATG ACGGTATAAA TAAATAAAG 153 9 

(2) INFORMATION FOR SEQ ID NO: 208: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: B46 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 208: 

GAATTTCTAT CTATCAATGA TGTGCATACT TCCAnTTAAA TTAAtCGAAA TGaATCAAGG 60 

TATATCATTC CTGCCTCTTT ATATAACaAC AAATAGTGAT TACAATATTT CGGTTATTAA 120 

45 

CACGAAAATT TTACAAGCAC CTATTTCATT TACATATATA TACAGCAAAA AAGAAAGCCC 180 

AGAAATATTG GTGTTTATTA AATCATTTAA AAAGTATATT GCCAATGAAC AATTATAATA 24 0 

5Q AATTTCAAAT CTAAAAAACC AAGAATGCGA TTAATCATCA CATTCTTGGT TCAATTTTAT 3 00 

TCATGAATTT TTTCAACATT AAACGTTAAG TTATTGTCTG AATTTAAATT AACTTTAATC 360 

55 
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GCTTTTTTAA AATCTATTTT CAATACAACT TTCGAGATTG AATACGTGAG TAaAATGACG 2 8740 

ACGACCGTAT TAATCATTAG CAAGATTGCT AACATCTTAG CACCTGTAAT ATCATATGAA 2 880 0 

CCTATACTTA TTGTTTCAAA CTGATCCTTT AGTCTAATAG CAATATATGA GGAGATTGAA 2 8860 

AATTCACCCA TCATGATGAT ACTGAACCCC GAAATCAATA ACATATAATT ACGGTCTTTC 2 8 920 

AAAACTAATT TATAACTGCG AAATATATTC ATTATTTGTA ATTTTTGATA ACGACTTGCA 2 8 980 

TGCCTCTTGT CATCACTTTG CTTTACTTGA TTTCGGTCTT GAGGTAACCA AATATATAAA 2 9040 

ATAAAGAGTA CAATTAAAAA TATACAAGCT GCTATTAAGA AAAGTAGTAA CATACTGTAG 2910 0 

CCATACATCA AGCCACCTAA CAATGCCCCA ATAGCTACCG ATAAGTTTGT CATCCAATAG 2 916 0 

CTAATCTTGT AAATATAATG TTCCACGTCT TCGGTAATTG CATCCATAAT TAATGTGTCC 2 9220 

ATAACTGGAA ATTGTAATCC CCAAACGATT GTAAATATGG CATATGCAAC ACAAAAACCA 29280 

ATAATTTGCC ACAATTGATG TGACCCAAAT ACGCCCATGA ACACAAGCAT TATCACCATC 2 9340 

GTCGCTTGAT AAATAAGTAC TAGCAACTTT tTCGGAAATA TCTCAATAAG GTAACCAGAT 2 9400 

ATAATGGACA ATGGAAATTT nAGAACCACT AAACCAACAA G AT AT AT AC C GACAATTGAT 29460 

TGACTTAACA TATCTGTTAA ATATAGTGCT ATAAACGGTA TAAATGCTGT CGTAATAATT 29520 

AGCTGTAAAA nATTGCTAAT CAATCGTACT TTCAA 2 9555 
(2) INFORMATION FOR SEQ ID NO: 207; 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1539 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 

T (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 207: 

AAAAAAAAAA AAAAAnGGTG AATCTTTAAT TAAACACTAA TATTGTAAAA GATGTTAAGT 6 0 

AAACGCTTAA TGACACTTAT TTTTTGAAAA TAATAGTAAT ATCATTTTGT TAAATGAAAG 120 

AATAAAGCTA TAATmATTAT AGAATAACTA TTTAAAGGAG ATT AT AAA CA TGCCAATTAT 180 

TACAGATGTT TACGCTCGCG AAGTCTTAGA CTCTCGTGGT AACCCAACTG TTGAAGTAGA 24 0 

AGTATTAACT GAAAGTGGCG CATTTGGTCG TGCATTAGTA CCATCAGGTG CTTCAACTGG 3 00 

TGAACACGAA GCTGTTGAAT TACGTGATGG AGACAAATCA CGTTATTTAG GTAAAGGTGT 3 60 

T ACTAAAG CA GTTGAAAACG TTAATGAAAT CATCGCACCA GAAATTATTG AAGGTGAATT 4 20 

TTCAGTATTA GATCAAGTAT CTATTGATAA AATGATGATC GCATTAGACG GTACTCCAAA 4 90 
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CTAAAGATGT 


ATGTGCATCA 


TGTATTTGAT 


GTACATCCTC 


AATAACATCA 


TTACAACAAG 


26940 




ACATCATATG 


ATGATAACAA 


TGAAAAATAG 


TCTCTAGTGA 


AAGCGAAGAC 


TTCGCGCACT 


27000 


5 


GTAATTGCGA 


CATATTTTGC 


AACACACATT 


TATTAAAATC 


TGTTGTAAAA 


CGTTGACATA 


27060 




CATCTTTTGC 


ATCGATTGTT 


AACGTTAACG 


GCACAATATT 


TCCGTGTAAA 


TCATTTGGTA 


27120 


10 


AATGTGATGG 


TACATGTATA 


CCTAATGTGA 


CATCATGTTG 


TTGACTCATT 


ATATGATTAG 


27180 




CTAAATACAC 


ACTAACAGCC 


AACGATGCCA 


TATCTATTGA 


TGTCATATCA 


TCAATCAAAT 


27240 




ACGTTTGATA 


TAAAGCTTGT 


TCAAATGGAT 


GCTTAATTGG 


AAAATAACTA 


TCAATATGGA 


27300 


15 


TGTCAGAGTT 


ATTCTCTAAC 


CGAAAATAGT 


TTGAGTCTAA 


TGCTATATGC 


GATGCATCTT 


27360 




GATTGTCTTT 


ATCATCATTT 


CTATTTATAT 


GTGCATGCTG 


TCGAGTATTG 


TTAATAACAG 


27420 




TATTGCCACG 


ATATGCATTG 


CATAAATCAT 


CAAGAAnAAT ATCAATTTGA CTATCATCga 


27480 


20 


AAATGGaCAC 


ATGAAAATCT 


aATAGTATAT 


ATGcAGCATC 


AGCGAACTGm 


AACAATTTAA 


27540 




CTTTGAATAA AGGTGAATCA 


TTAAAATGGT 


AAGTACTTAA 


TTCTTGCTTA 


AAAAAAGCTT 


27600 




CTAAATCATA 


GTTTGCGGAA 


GAAGATGGAA 


CTTGTTTTAT 


CTCAATAAAA 


GGCAGAAATT 


27660 


25 


CATGAAGTAT 


CATTGTTAAA 


TTGTCATCGG 


TAGTAACATC 


AAAAAAATGT 


CTTATAGATG 


27720 




CATGTTGTgC ACAATTGTCG 


ATAATGCATA 


CATCATTTTA 


GTAGCTTCAA 


CATTTTTAGC 


27780 




GAGTTTAACC 


CAATACGCAT 


TACGGTGTGT 


CGTTGATTCT 


GTATTATTTT 


TGTATATACG 


27840 


30 


AAAATATTCC 


TGTTGAAATC 


TCAAATTACC 


CATAATCATA 


AAAAGTCCTT 


CTTTCATATC 


27900 




ATAATACTCA 


TTACTTACTG 


AAATTGCATG 


ATGATATGAT 


AACCGACGAA 


ATGTTAATTA 


27960 


35 


ACTCGTTATG 


TAATGaTTAA 


TATaAAACAC 


CATTCGCAAC 


ATATGAGCGA 


TATATTCTAC 


2S020 


CCTAAAATAC 


ATCTTGTATC 


ATCGTTACAA 


TTGGTATATT 


TTTCAATGTA 


AATTACATAC 


28080 




ATCTTCGATA 


AATAGCACAC 


TACAAATCGT 


TAATCACTTT 


CTGTTGTTCA 


CATCTCATTG 


28140 


40 


CAAACTCAAT 


ATTGTTGTTA 


CAAAATATCC 


ATGAAGCAAG 


TTTATATTAA 


ACAAACAACT 


28200 




CGCATAAAAC 


AATTGTTATC 


CTTAAATTTT 


AACAAATTCT 


TAATAAATTT 


ATCTCTATTT 


28260 




TAATTACGAC 


CAAATTAATA 


GGTTTTCCAT 


ATAAAAAGAT 


GCATAAAATA 


AATATTTAAA 


28320 


45 


TAAATTCAAT 


TTGTATTCAC 


TTGTTTTTGT 


CCCCCAAATA 


CACCAGCAAC 


AAGCATGCTA 


28380 




GCACCAATTG 


TTAAAACGAT 


AAACATATAC 


AGTCCCATTT 


GTAATGACGT 


TAAGAAAACA 


28440 




CCCAACACAA 


TCCCTAACCT 


AGCTAGTGTT 


TCTGAAAAAT 


GAATACCTAA 


TGCATTAACT 


28500 


50 


GCACTATATG 


TTCCTCTTTT 


AGCTTTAGGA 


ATAATTTTAA 


AGCGTTGTTC 


TGAAACTATA 


28560 




GGCGAATAAA 


TAATTTCACC 


TACAGTCGCA ATTATCATAA AAACAACTAA TAAGCCAAAC 


28620 
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w 



15 



20 



AATTATCATG AATACCTACT TGATTCACAT GTAAAATATC TGCAAAAATT TGGCATAGCA 2 514 0 

AGTGTTCGGT ATCTGTACTC GGTGCTACAT AGGCATCCGT ATCGACATAG TCCATGATAG 2 5200 

GCAATGCCTT CTTATCTAAT TTCCCATTAA TAGTAATAGG AATTTGCTCA ATATGCATGA 2 5260 

AATTAACTGG TATCATGTAC TCCGGTAAGG TCATACGTAA TTGTGATTTA ATCTTATTAT 2 5320 

GTGATAATGT ATGCATCGCT TCATAATAAG CAACGATATA CTGATCTTGA TCATGATTTT 2 5380 

GAACAATAAC AACTGCTTTA TTAATACCTT GTATACGCTC GAGCGCATGC TCAACCTCTG 2 5440 

ACAACTCAAT CCTAAACCCT CGAATCTTAA CTTGTTTGTC CTTTCGATAT AAATAATCTA 25500 

TGTTGCCATC GGGTAACAAA CGAACGATAT CACCACTTCT ATACATCAGC TGaTTTATAT 25560 

TTGAATCTTT GATAAATTTA TCTGCTGTCA ATTCTGGCTG ATTTAAATAA CCTGCAGCTA 25620 

ACCCAAAGCC ACTTGTACAT AATTCTCCAG GAATACCAAC GCCACACCGA CGCTCGCCTT 256 8 0 

GCATGATATA AACATGAGTA CCCAGAATCG GTTTACCAAT AGGAATACGA TTTGGAACTT 25740 

TGTTAGGTAT ATTATACGTC GTTGTAAATG TTGTATTTTC AGTTGGTCCA TAACCATTAA 25800 

TAATTTGAGG ATGCTTCGGT TTTTGATTAA GCAAATCCAC CCACTTAGCA TTCAATACTT 25 3 60 

25 CTCCACCAAT TAATAAATAC TTTAACGGTA CCAATACTTC TATTCGTTCA CTAGCAATCT 25 920 

GATTAAATAA TGAGGAGGTT AACCACATAG TATTAACGTC ATTTTCATTG ATTAATTGTT 2 5980 

CTACCGCTAT TGGATTTAAT AATTGTTCTT TTTTAGCAAC AATCAGCTTT CCACCATTGA 2 6 040 

GCAATGCACC ATATATTTCA AATGTTGCAG CATCAAAGGC TATAGTTCCT GATAACAAAA 2610 0 

TCGTCGTCTC TTCATTTAAT GGTACATAAT GATTTTGATG GACCAAGCGA ACAATACCTC 2 616 0 

GGTGCGGAAT TAGTGTCCCT TTAGGGTTAC CAGTTGTCCC CGACGTGTAA ATAACATAAG 2 6 220 

CATGATCTTC TAACGTGTTA CATTTAGAAA GATTATCAAT ATTTTTCCAC GCTATCTTAT 2 6260 

TCAAiTCAAT GTGATTAATA TTTTGTTTAC CATTTTCATA TAAAGCTTGG TACGTTATTA 2 6 340 

CAACTTTAGG CGTTACATCT TTTAAAATGT ACTCCTGACG ATCACTTGGA TAGTTCGGAT 2 6400 

CAATTGGCAC GTAAGCCCCA CCAGCTTTCA ACACACCTAT CATCGCTATT ATCATCTCAA 2 6460 

TACTTTTTTC AGCTATGACA GCGACACGAT CATTAGGTTC AACACCATAC TGGTTTCTCA 2 6 520 

AACGGTGTGC TAAATCATTC GCGCGTGCAT TCAATGTTTG ATATGTTATA AACAcTCCGT 2 6580 

CAAATTGCAC AGCGACATGA TTCGGCGTTG CTTCAACTTG TTGCTCAAAT AAGGTAACAA 2 664 0 

CTGTTTGCGC ATCATCTATC TCAGGCAAAC TTAAATTGAT ATCGTCATAT AATTGAATAT 2670 0 

50 CACGTTCTGT CATCAAATTA AGTTCATCTA CAGTTGTTCG TTTATTTCCA TTTTCTTCAG 2 6 760 

TAATTTGCAA ATAAATATTT CGAACTAAGT CACTCAGCGT CTCGATTGAG AGCAAATCAT 7^920 
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TAGCATCCTC CAAAATTGCA CCTTGTCGTT TATTCGGAAA ATCAATATCG ATAGGTATAT 23 340 

AAGATGCACC TACTTTAACT GTCGCCAACA TCGCCGCAAT CATTTCAAAA CTACGTTCTG 23400 

TAAACAAGGC AACCCGTTGA CCATTGCCCA CACCATTTGA TAGGAGCATG TGCGCAATGG 2 34 60 

CATCCACATA GTTGCGTAAT GTTTCATACG TCATTGTCAA ATCATTCATG ACTAGCGCAA 23520 

CATGATTACC TTGTCGTGAG ACAACTTCAT TAAAGTAACT TATGATAGAT TTATTTCCCG 23 580 

GGACATTAAG CATTCGATCG TTAACATGCG TATTGACCCA ATTTAGAAGT TCCTCCGTGC 23640 

CGTTTGGTAT ATCACAAATT TGTAGTGTAT CTTGATGCTT CAAAATATAA TCAATCATAA 23700 

TCATACATTG ATTACCCATG TGACGAACTG TTTCTGAGTG ATATAAATCG GTATTATACT 23760 

CGATATTGAT TGTATAGTCA TCGCGATCTT CTTCAATGAT GAAAGATAAA TCAAATTTCG 23820 

CCGTCACTGA TTTGGGTTGA ATGTGTGTTA ATTTACTATG CCCAAAATGA GCATGATTCG 23 8 80 

TTTCATTGTT TTGTAGTACT AACATGACAT CAAATAATGG ATTCCGTGAG GCATCATGTG 23 940 

ATTGATCTAA GTCATTTACT AAACATTCGA ATGGGTATTC TTGATGCTCG TATGCCTCCA 24 000 

AACTCATTTC CTTAACCTCT TGTAAAAACT GTGTCCACAT TTTATCAGGT GACGGTTGCC 24060 

CTCTATATAC CAACGTATTA GCAAACATGC CTAGCATTTG CTCCGTGCCT TTATGCATAC 24120 

GCGCACTCAT CACACTACCG ACAACAACAT CATCTTTTCG AGCATATCTA CTTAACAACG 24180 

TCATGACCAC ACTCATAAAG AACATAAAAT CAGTAATTTG ATGCTTTTCT ACATACTTTT 24 240 

30 GAAGTAGCTG TCTCATTTGT TGATTCATTG TAAATGACAT CATTGCTCCA TTTGTCGTTT 24 3 00 

TAATATTTGG TCTAACATAG TCTGTCGGTA AGCTTAAAAT AGGTACTTCA TCTTTGAATT 24 360 

GAGATAACCA ATATTGTCTA TGTTTCGTCA TATCACGATG CGACATCCAC TCACTATAGT 24420 

CTTTATATTG CAATTTAAGT GGTAACAATA ATTTATGTTG ATAAAGTGCG TTAAGATCAT 24480 

TCATTAATTG TATATTACTC ATACCGTCAT TAATGATATG ATGCGTATCT ATAAAGAGGT 24 540 

ATG CATGTAA GGGACTTCTA ATGTATCTCA CTCTAATTTG ACTTGGCTTT TCCAAATTAA 24 600 

AAGGTGCTAC AAATTGGCGC ATGATTTCTT GTTCATCCGT AAAATGCGTG TTAACTTCTT 24 660 

CAAAGTCAAC TGCAACATCT GCCACAATAC GTTGTCGAAC CTCATCATCT ACAACAATAT 24720 

ATTGTGTTCG TAAAATCTCA TGTCGCGCTA TCAAACGCTG CACTGCTTGT CGCAATTGAG 24780 

CTACATTAAG TTCTGATGAT AACCGCCATA AAAAAGGTAC GTTATACACC GTATCTTTAT 24 840 

GGTTTGATTT CCATAATAAA TACATACGCT TTTGTGCAGA GCTCAGCACA TAATCATCTT 24 900 

TAACTATAGT TTCTGGAATC ACTT CAT AGT TTTGTTCTTG AACCTTAGCA ATCGCTTGTG 24 960 

CTAGTTCAAA TACAGTTGGC TTTTGTAATA AATCACCAAT TTGTAATCGT TTCCCAGTAG 25020 
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CACCTGCATG AATAATCGTA TCCATGTTTT CTGGTAAAAC AACATCATCC ATACACTCGA 21540 

AATCACCAAC AATGACTTCA ATGTTTGATA ACATTATTTC AACCGTCTCT TCTGAAAAAT 21600 

5 

AATCATTTAA ATTCGTCATC AACTTATACC ATGCTATTTC CTCATTATCA GCACGTATGA 216 60 

AACAATAAAT GCGATGACTG TATCCTTGTA GTACTTCAAT CAGATAAGCA CCTAAAAAAC 21720 

CTGTCGCGCC AGTCAATAGT GTATTTCCTA GAGGTCGATG ACTTAGACTA TCCTCTAAAA 217 80 

w 

TACCCAAGTT ATAACGAGAC ATAACAATCT TTTGTAATTC CGAAAGATTA TCCGGTAATG 21840 

CAACTAATGA TTGTTGATTT TGGTACATAT AATTAACAAT CTGTCGCACG GTTTTATATT 21900 

1S GGTATAATGT CTG CATTGAA ATATGATGGC CAAATCGTTT TAAATGCGAG ACAACTAACA 21960 

TCGCCTCTAA TGAGTTACCA CCAAGTTCAA AGAAATCATC GTCAACACCG ACATCATTTT 2202 0 

GTTTCAATAC CTCTCCAAAT ACATCAACAA ATGTCTGCTC AATTTCATTA GAGGGTTCGC 2208 0 

20 TATACACTTT ATTAGACTGT TGTATAGGTG ATGGATTTGG CAAACGCGTA GTATCCACCT 2214 0 

TGTCATTCGT GGTTAATGGC ATACAATCGA TATGCGTTAT AGTCTTAGGA ATCATATACT 222 0 0 

TAGGCAGCTG ATCATTTAAA TATTGCTTCA AATCCTGTTC CACTTGTTGC TCTCCGACAT 2 2260 

25 AATAAGCATT CAATATATCA TGCGTATCAA AGTGACTTAC TGTTACAACA CAAT CAGAT A 2232 0 

TACCACGAAT AGCTAATATT GCATTTTCAA TTTCATCAAG TTCAATACGG TACCCGTTAA 223 80 

CTTTCACTTG TTTATCTATT CTTCCTAAAA ATTCAATTTG ACCATCAGAT GTATAACGTG 2 244 0 

30 

CTAAATCACC ACTATGATAC AACTTTCCTT TACCAAATGG ATT ATTTTG C CATTT AT CAG 22 500 

CCATTAATTC TGGACGATTA ATATATCCTA TCGCTAAACT ATCACCTGCA ATACACAACT 22 5 60 

CGCCTGGCAT AC CAAT AC CG CATAACAAAC CATCTGACAT AATATACACT TGGATGTTAG 22 62 0 

35 

ATAAGGGTTT GCCAATTGGA ATCGTCTCAG GTATCAAATC ACCACAATGA TGTGACCAAT 22 6 80 

ACGATGTGAT GACTGTTGAC TCAGATGGTC CATAGGCATT GAAATACGTG CCACAATGCT 22 740 

TCTCAATATA TTTAACAAAG GATGCCGTAC TAGTTGCCCC GCCTGTAATC AACTTTTCAA 2280 0 

40 

TATAAAAGTC TTCCATAACA CTACACATCT GTAACGGAAT CGACGCAACC GTCACACGAT 22860 

GCTTATTAAT GAGTTGTTGT AACTGTTCTG GATTAACACG TTCCTCTCTA TCTGGAATCA 2 2 920 

45 CAAGCGTATG AC CATTT AAC AAACAACAAT AAATCTCCAT AACTGATGCA TCAAAAACAA 22 980 

TATTTGCATG TTGCAAAAAT ACTTCATTGT CGCCTAATTG CAATTCAGTT GACCATGCAT 23 040 

GCACTAAATT CAACAAATTT CGTTGTCGTA TGGCAACCCC TTTAGGCATC CCGGTCGTAC 2 3100 

50 CAGATGTGTA AATAGCATAC ATCTCATTAT CTAACATCGC TGTGTTTTCA AGTTGATTGC 23160 

^AT^^^A^^ * i m ^*'rrvT ir "<^'" ^^t^t^ ^^^,r^p..^ ? , - * ft ^ ^ / ^^ rT ^ VT . » ^ ,^ „ . T »„^,„ .. « 
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AGTGCGACAA CATTTTGTCT ATCGCTATAA 
TGAGTTTtAA ATCATCGTTT GAGTATGATG 
5 ATGATATGAT TCCAGCCAAC TTACGTGAGC 

ATTATTTTCG GTAATTTCAA TATTGCTCAG 
TCGTCATGAT TTTGAATGAT GCCATCGTGT 

10 

ATCTAACAGT AAACATTTAG GCAGTATAGT 
CGTCATATTA ACTACTCAAT AACTGAAATA 
GTCCTAAGAA ACATACCTGT ACCATAAACT 

15 

CACGTGATTG ATAAAAGTGT GATTGTGTCA 
CTTCACCAAT CAATTTTGTA AAACTTTCTT 

20 AACTACATAT TTGATGTGCT TCGTTTGTAG 
AACGTTGTGA TATCTTTTCG ATATCAATAC 
TATAAGGATA ACTATATGAT AAGCTCACAT 

25 CAATATCTGC CTTACCTCGT GGCGAAATGT 
TGTCATGTTG AATTCCATAT TGCACTAAAA 
GATTGTATCT ATAGTTGACT GTACGCGGTT 

30 ATTCTTCAAT ACTTTTCAAG TTACTCTGTA 
CTACTTACTT ATTGAATATT GTTTTGATAT 
TTATCGTTGG CCATTTTTCA CTGATGTGAT 

35 

TCATTGCTAG TTGTTGTTCA CGGTCAACAC 
GTTTCTGTAA AATTTCATTA AATGATTCAT 
CGCATTCTAA CAAAGATTTC ACCGGCATTT 

40 

GTGGTGTGTT GACCTGTGCT AATGCGACAA 
AATCTACAGG CATTTCAGCC ATGCTAACCC 

45 TTACCATTGA AAAACGGTTA GTCTTTATAT 
TCGTCAAATT ACCAACACGT ACAATCCGAC 
CTTTTAATTC ACTATAAAAT TTGCTCCGTG 

50 AGACATCCGC TTsTGAAAAT GTCACATCTT 
TT AT CGTAGA CACATATATT AACCTTGCAT 



ATTAAATTAA TACACTGACT AACATTAACG 19740 

ATTGATGCTC ACGTTATTTT ATTAACTGAC 19800 

ATTAAAGTCT CAAATGCGTC GTAACAAACT 19860 

TATATTTTTA CCTTATCACT TACTTTAATC 1992 0 

ATTCACCTTT CATTTTTCCA ATAAAAAAAC 19 980 

TTAAAATCAC TGCGCAATGA TACTGTCAGA 2 0040 

CAGACACTTT TTTATAACCC CAGGGTGCCT 2 0100 

GATCAAAAAT AAATTGTTTG AACTTCACTT 2 0160 

TATCATAAAT GTCCAATCCT TTGATTAAAC 20220 

TTTGTGTCCA TATTTGATAA AAATCATTTA 20280 

AGAAACACGT CACTAACGTA CGCCAGTCTA 20340 

CAACTGGTTC TTTATCGACA ACACACACGA 20400 

AGATGGGCTG TCCATCACGA TTGTGTTGAA 20460 

GATAATGCCA TTCATGTGGT AATAAACCTG 20520 

TATCTCCCAA TCTGTGCATG AGTTTATCTT 20580 

TTTTATATGA CCAACGACTT TGTGATATTA 20640 

ATTGCATTAC AAATACTGTC ATAACTTTCC 2 0700 

ATTGTGCCCA ATGATACAGC CAATTGTTAG 20760 

TCATTATTTT TAATGTTAAT GTTGTATCTA 20 820 

TAGTTAATCC AATCGTTTCG TACATGTCTT 20880 

CGCTGACGAG TTCAATTTCT TTGCGCTTAA 20940 

TATTAGGTGA TAGCACATGG TAAATGATTT 21000 

TTTGTCTTGC AGTCGTATCC ACAAAAGAAA 21060 

CGATACAATC CAGTTGTAAC AAATCATTCA 21120 

TTCTCATATG CCATCTTCCA TTGTAAGGAT 21180 

CATCTAAGCC ATTATTTACA GCTTCTAATA 21240 

TATATGGTGA TGTTAGTAGT TGCCCTTTAT 21300 

CTGTGTCTAT ATCAAAATAA GTTCCCACAC 21360 

GATGTTGTTG TGCCAAACGT ATGACATCAA 21420 
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CTATCGTAAG TTCAAATCTA GTAAAATCAG 
GAATCATTAC AGCTATTGGC TACAGCGGAT 
5 AACTGAACCA AATACCTGTC GTTGATAAAG 

TGT CG AATAC ATTAACTCAT GAAAATGTCA 
CTGCAGCGAC TGCATACGGT TGTGATTTAT 

10 

ATCGAACATA AGGTGGTGTT GGTTAGATGA 
CACTTAGTGA CATGCATCCA TCAATTATTA 
TCTACCCCAT TATCGTTCAT GGCGGTGGCC 

15 

AAATCGAGCC ACACTTTGTT AATGGCCTAA 
CTAAACACAC GCTCATTGCA GACGTTAACA 

20 AATGTTCTGC AATAGGCTTA TGTGGTTTGG 

ATCAACAATA TGGATATGTC GGTGTTCCGA 
TATGTACTAA ATTTGTACCT ATCATCAATT 

25 TTTACAATAT TAATGCTGAC ACGCTTGCCT 

TTTATGTATT AAGTAATATT GCAGGTGTAC 
CATTAGTCGA TATTCATCAA TATATTGAAC 

30 

AAGTGCTAGA TGCCAAAAAT GCGATTGAAA 
GAAACAAGCC AAATATCATT GAATCTATTT 
TTAATTCATA ACTATGAAAT TAAGGCCTAA 

35 

CGATTATCCA TTGCG CTAAA ACATTTATTT 
AGT2TCGATAC AATATGAGAT GTAAAAAACT 
TATTTTAAGT TTTGCAAAGC TTTTTATTGT 

40 

TAGTAAATAG AGAGGAGTCC TTAAGTTGAC 
TGGAATTATC GGTGGCTTGT TATCTGGAAT 

j 5 ACCTCGCACA CCAGAACGTA ATGCAACGAA 

ATTTAGTAGT GAGTTTACGC ATCAAACATA 
AAGCTTTATT GTCCACTTTA GTTTTTCTAT 

50 TAAAAAATAC GCTTACTTAG CAATGGGACA 



CTATTTTTGG CGAAGATGCC AATTTTGGTC 17 940 

GTGAAATTGA TCCTAACTGC ACATATGTTC 1800 0 

GTATGGCTGT ACTATTTGAT GAGCAAGCTA 18 060 

CAATTGACGT TCAGCTTGGT TTAGGTAACG 1312 0 

CCTATGATTA TGTGCGTATC AACGCATCAT 1818 0 

AATTTATTGT CATTAAAATT GGTGGCAGTA 18240 

ACAACATTAA GCATTTACGA TCAAACAACA 18300 

CATTTATTAA TGAAGCATTA TCAAACCAGC 1836 0 

GAGTGACTGA TAAAGCAACC ATGACCATTA 18420 

CTGCATTAGT AGCTCAATTT AACCAGCACC 184 80 

ATGCACAGCT GTTTGAAATT ACATCTTTTG 18540 

CCGCTTTAAA TAAGGATGCT TTACAGTATT 18 600 

CGATTGGTTT CAATAACCAT GATGGAGAAT 18660 

ATTTTATTGC ATCATCATTA AAAGCGCCTA 18720 

TCATCAATGA TGTTGTTATA CCTCAATTGC 18 780 

ATGGTGATAT TTATGGAGGT ATGATTCCCA 18 840 

ATGGCTGTCC TAAAGTTATC ATTGCATCAG 18 900 

ACAATAATGA TTTTGTTGGC ACAACAATCC 18 960 

CAAGTTTTGA CACGCGAGAT GATTCCAGTT 19020 

ACCGTTCATC TCGTTAACAA TTTTGAATAC 19080 

AATAACCTTT TACAAATTTG TTTATCAAAA 1914 0 

GATTATTTTC ACAAAATACT ATAATGAGGA 19200 

GAAACGACAA ATGGGTATAT TCATTTATGC 19260 

TGTAAAATTA GGTTGGGAGG TCATGTTTCC 1932 0 

CCCACCTCAA GAGTTATTGC AACAATTAGG 1938 0 

TACATTTTCA AATATGGAAT TGCCTTGGGT 19440 

CGTCATTGCA ATTATTTACT GCATATTAGT 1950 0 

AGGTGCTGTT TTTGGTATTG CTATTTGGGT 1956 0 
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ACTTACAGTG GaTAATAATG ACTGTGATGT AATTTTCTTT GCGACACCAG CACCCGTAAG 1614 0 

TAAAACATGT ATCCCTCCCT TAGTAGAAAA AGGTATTCAT GTTATCGATT TATCTGGCGC 16200 

5 

ATTTAGAATT AAGAATCGTG AAATATATGA AGCATATTAC AAAGAAACTG CTGCAGCACA 16260 

AGATGATTTG AATCATGCTA TTTACAGCAT TTCAGAATGG CAATCGTTTG ATAACAATGG 1632 0 

AACGAAGCTC ATTTCTAATC CTGGCTGTTT CCCTACAGCA ACATTATTAG CATTACATCC 16380 

10 

ACTTATTAGC GAAAAAATAG TAGATTTGTC ATCTATTATT ATTGATGCTA AGACCGGCGT 1644 0 

GTCAGGTGCT GGTCGTTCAT TATCACAACG gTTcATTTTT CAGAAATGAA TGAAAATCTA 16500 

15 AGCGCTTATG CAATCGGAAA CCATAAACAC AAACCGGAAA TCGAGCAATA TTTATCTATC 16560 

ATTGCGGGTC AAGATGTATC AGTCATATTT ACACCACATC TCGTACCAAT GACACGAGGT 16620 

ATTTTATCAA CAATATATGT CAAATTATCA TCTGAATATA CGACTGAATC ATTACATAAA 16680 

20 TTAATGACCT CTTATTATGC TAATCAGCCA TTTGTCAGAA TTAGAGATAT TGGGACTTTT 16740 

CCAACCACAA AAGAAGTACT CGGTAGTAAC TACTGCGATA TCGGCATCTA TGTAGATGAA 16800 

ACAACGCAAA CAGCAATTTT AGTATCAGTG ATTGATAACC TTGTCAAAGG CGCAAGTGGG 16860 

25 CAAGCCATTC AAAATTTAAA TATATTATAT GATTTTGAAG TGACGACTGG CCTAAATCAA 16920 

TCACCAGTTT ATCCATAAGG GGTGTTAGAA TGAAACATCA AGAAACGACA TCACAACAAT 15980 

ATAACTTTTC AATTATTAAA CATGGCGATA TCAGTACACC TCAAGGCTTC ACGGCTGGTG 17040 

30 

GTATGCACAT CGGTTTACGC GCTAACAAAA AAGACTTTGG GTGGATTTAC TCATCGTCTT 17100 

TGGCAAGTGC AGCTGCCGTA TATACTTTAA ATCAGTTTAA AGCTGCACCA CTTATTGTCA 17160 

CTGAAGACAC TTTACAAAAG TCTAAAGGAA AATTACAAGC ACTTGTTGTT AATTCAGCTA 17220 

35 

ATGCAAATTC TTGTACCGGT CAACAAGGCA TAGATGATGC ACGACAAACA CAAACATGGG 17280 

TTGCTCAACA ACTTCAAATA CCATCTGAGC ATGTTGCTGT TGCTTCAACT GGGGTCATTG 17340 

GTGAATATTT GCCTATGGAT AAAATTAAGA CTGGGACCGA ACATATTAAG GATGCTAATT 17400 

40 

TTGCAACGCC AGGTGCGTTT AACGAGGCAA TTTTAACAAC TGATACCTGT ACAAAACATA 17460 

TCGCTGTATC ACTAAAAATC GATGGTAAAA CCGTTACAAT TGGTGGTAGC ACCAAAGGTT 17520 

45 CAGGTATGAT TCACCCAAAT ATGGCTACCA TGCTTGCTTT TATAACAACC GATGCATCGA 17580 

TTGAATCGAA TACACTTCAT CAATTATTAA AATCTTCGAC TGACCATACA TTTAATATGA 17640 

TTACTGTTGA TGGCGATACA AGTACAAATG ACATGGTATT AGTCATGGCA AATCACCAAG 17700 

50 TTGAACACCA AATACTTAGT CAAGACCATC CACAATGGGA AACATTTGTT GATGCATTCA 17760 

ATTTTGTCTG TACATTTTTA GCTAAAGCTA TAGCCAGAGA TGGCGAAGGC GCAACAAAGT 17820 
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GCA CATC ATT CTCACAAACT TTGATTAATT TCAGCCAAAA ATATTTACCA TTATCAGACA 14 34 0 

TTGGTATGGG CTGGGTTGTT CTCAGTTTGA TTGGTTTCAT TATCGGCTTC ATTATTTATA 14 4 00 

5 

AAATT AAG CA TCGTAAAATT CCACAAGCAT AATACTATGC CACAGT CAT A TGTTAAACAT 144 6 0 

ATGCTTGTGG CATTTTTTAT TCATACTACA TTAAACTGCA ATCGTATACA TACATATCAA 14 52 0 

TGATTATCCA CAAAAAATAT TAGTACTTTC ATTTTACAAA TCACATTAAT ACAAACACAA 14 580 

10 

CCTTATCTTT ATATTATTAA ATTTATATTT GACACTTATA TTGAACAACT GTAATATATT 14 64 0 

AATATTAATT CTTTAAAATG TATAAATATA AAGGAGGGAG ACCGATGaAT TCAATCATTG 14 700 

aATTAACTGA TTATTATAGC TCTAATAATT ATGCACCACT TAAGCTTGTC ATTTCTAAAG 14 76 0 

15 

GTAAAGGTGT CAAAGTTTGG GATACTGATG GCAAACAATA TATAGATTGC ATTTCGGGTT 14 82 0 

TTTCAGTTGC AAACCAAGGC CATTGTCATC CAACAATTGT TAAAGCGATG ACAGAACAAG 14 880 

20 CTTCAAAGTT GTCTATCATT TCA CGTGTCC TTTATAGTGA CAATCTCGGG AAATGGGAAG 14 940 

AAAAAATTTG TCATCTTGCT AAGAAAGACA AAGTACTCCC CCTTAACTCT GGTACTGAAG 15000 

CTGTTGAAGC AGCCATTAAA ATTGCTAGAA AATGGGGCTC TGAAGTTAAA GGCATTACTG 1506 0 

25 ACGGACAAGT TGAAATCATC GCTATGAATA ACAATTTTCA CGGTCGTACA CTTGGCTCAT 1512 0 

TATCACTATC TAACCACGAC GCATATAAAG CAGGATTTCA CCCCCTACTT CAAGGCACTA 15180 

CAACAGTAGA TTTTGGAGAC ATTGAACAAT TAACACAAGC TATTTCACCG AATACAGCAG 15240 

30 CAATTATTTT GGAACCAATT CAAGGTGAAG GTGGCGTTAA TATACCACCG AAAGGATATA 153 00 

TTCAAGCTGT GCGTCAACTA TGTGATAAAC ATCAAATATT ATTGATTGCA GATGAAATTC 15360 

AAGTTGGTCT TGGTAGAACT GGGAAATGGT TTGCTATGGa ATGGGAGCAA GTCGTTCCAG 15420 

35 

ACATTTATAT TTTAGGTAAG GCATTGGGTG GCGGCTTATA CCCTGTATCT GCTGTACTTG 15480 

CAAATAATGA TGTCATGCGT GTTCTAACAC CAGGTACACA TGGTTCAACA TTTGGTGGTA 15 540 

ACCCTTTAGC CATTGCAATA TCGACGGCAG CGCTTGATGT ACTTAAAGAT GAACAACTGG 1560 0 

40 

TTGAACGATC AGAACGCTTA GGTTCATTTT TATTAAAAGC GTTGCTACAA CTT AAA CATC 15660 

CTAGTATTAA AGAAATTAGA GGTCGTGGTT TATTTATAGG CATAGAGCTT AACACAGATG 15720 

45 CTGCACCTTT TGTGGATCAA CTGATTCAAC GTGGAATCTT ATGCAAAGAC ACGCATCGTA 15780 

CTATCATTCG ATTGTCTCCA CCTCTAGTCA TTGATAAAGA GGAAATCCAT CAAATTGTTG 15 940 

CAGCTTTTCA AGACGTTTTT AAAAATTAAC AATTAATCAT TTATATATGA CATAGGAGGG 15 900 

50 ATTCATGATG ATTAAAGTAG GTATCGTTGG CGGTAgcGGT TATGGCGCAA TTGAATTAAT 15 960 

TPGATTGTTA CAAACACATC CTCATCTAAP PATTOrAPAP AT^TA^^A^ bt^AAAAP- < r - — 
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AGTCGTTTCC ATCATTTATT AGACAAGAGA GATGACGATT TTGTCATAGA CAAACGACAT 12540 

TTTAGTGCAT TTGTAGGAAC AGATTTGGAC TTACAATTGC GACGTCGAGG AATTGATACG 12600 

s ATTGTTCTTG GTGGTGTCGC AACGCATATT GGCGTAGATA CGACAGCGCG AGATGCCTAT 1266 0 

CAATTAAACT ACAATCAGTT TTTTGTTACA GATATGATGA GTGCACAAAA CGAAACGCTA 12720 

CATCAATTTC CAATAGATAA TGTATTCCCA TTGATGGGAC AAACAATAAC TACAAACGAC 12780 

10 TTTCTAAATA TATTGAACTA AACATATACT TCCCCCCTTC GATCATGTTG AGGGGG AT CT 12 840 

TTATTTCACA AAGTATTAAT ACGTCGGGTT GTCTAACCTT CTATATTTAA CATATTCTAT 12 900 

ATCTGTTAAA TCGTTCTTAA CTTACGCCCC TACTACATAA AAAACAGTAT TTATTCCGGA 12960 

15 

ATTTTCAAAA AATTTAGTAT TTATTGCAAA ATTATGTATC ACTTTATGTT TAATTTTTGA 13 020 

TATTATCTTA ATTAAGTAGA TTTTTATAAG TTCTAAAAAG GAGAACAAAT ACATATATGA 13 080 

AGAAGAAACT AACATTTAAA GAAAACATGT TTATAGGTTC TATGTTATTT GGTTTATTCT 13140 

20 

TTGGTGCCGG CAATCTTATC TTCCCAATAC ACTTGGGTCA AGCTGCTGGT TCTAACGTTT 13200 

TTATCGCTAA CTTAGGATTT TTAATTACAG CAATTGGCTT ACCATTTCTA GGTATCATTG 13260 

25 CTATTGGCAT TTCAAAGACA TCTGGTTTAT TTGAAATTGC ATCGCGTGTT AATAAAACAT 13 320 

ATGCTTACAT TTTCACGATT GCCTTATATC TAGTTATCGG ACCATTTTTC GCCTTACCTA 13 380 

GACTGGCAAC GACATCATTT GAAATTGCAT TTTCGCCATT TTTATCACCA AAGCAAATCA 13440 

30 CTTTATATTT ATTTATTTTT AGCTTCGTCT TCTTTGTGAT TGCATGGTTT TTTGCGAGAA 13 500 

AGCCATCAAG AATTTTAGAA TATATCGGTA AATTTTTAAA TCCGGTATTC TTAGTATTAT 13560 

TAGCAATTAT TTTATTATTT GCTTTTATCC ATCCATTAGG TGGCATATCT GATGCACCTA 13 620 

35 

TTAGTAAACA ATATCAATCA CATGCCTTAT TTAACGGCTT TTTAGATGGA TACAATACCT 13 680 

TAGATGCGCT AGCGTCATTG GCATTTGGTA TTATCATTGT TGCAACGATT AAAAAGTTAG 13740 

GTATCGAAAA TCCAACTGAT ATCGCTAAAG AAACAATTAA GTCTGGTACT ATCAGTATCA 13 800 

40 

TTATGATGGG GATCATTTAT ACCCTACTAG CAATCATGGG TACATTAAGT ATTGGTCATT 13 860 

TCAAACTTAG TGAAAATGGT GGTATTGCCT TAGCGCAAAT TACTCAATAC TACTTAGGTA 13920 

45 ACTACGGTAT CGTCCTGTTG TCACTTATCG TTATGGTTGC TTGTTTAAAA ACAGCCATCG 13 980 

GTTTGATTAC GGCATTTTCA GAAACATTCG AACACCTTTT CCCTAAAATG AATTACCTAG 14 040 

CGATTGCAAC AGTTGTAAGC TTTATTTCGT TCTTATTCGC GAATGTTGGT TTAACTAAGA 14100 

50 TTATTATGTA CTCAGTCCCA GTGTTAATGT TCTTATATCC ATTAGCAATT GCCTTGATTG 14160 

TACTAACATT ATTTAGTAGC AAATTCCATC ATTCAAAACT TATTTATCAA TGTACCATTT 14220 
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ATGAATTAAA CGCAAGTTAC GCAGCGGACG GTTATGCCCG TCTTAATGGA CTCGCTGCAT 10 740 

TAGTTACTAC ATTTGGTGTT GGCGAATTAA GTGCCGTCAA CGGTATCGCA GGTTCATATG 108 00 

CTGAACGCAT ACCTGTCATT GCGATTACAG GTGCGCCGAC ACGTGCTGTT GAACAAGGCG 108 60 

GTAAATATGT ACATCACTCA CTTGGTGAAG GTACATTTGA CGACTATCGA AAAATGTTTG 10 920 

CACATATAAC CGTTGCACAA GGTTATATCA CACCTGAAAA TGCAACAACC GAAATACCAC 109 80 

GTTTAATTAA TACAGCAATC GCCGAAAGAC GCCCAGTTCA TTTACATTTA CCAATCGATG 11040 

TCGCAATCTC TGAAATTGAG ATACCGACAC CATTTGAAGT GACGGCAACT AAATATACGG 11100 

ATGCATCAAC ATATATAGAG TTATTAGCAA CTAAACTGCA TCAAGCGAAG CAGCCTATCA 11160 

TCATTACTGG ACATGAAATT AACAGTTTTC ACCTCCATCA AGAATTAGAA GATTTTGTAA 11220 

ATCAAACACA GATACCAGTA GCACAACTTT CATTAGGAAA AGGTGCTTTT AATGAGGAAA 11280 

ATCCATATTA TATGGGTATT TACGATGGGA AAATTGCCGA AGATAAAATA CGAGATTATG 1134 0 

TGGACAACAG CGATTTAATT TTAAATATTG GAGCCAAATT AACAGATTCA GCAACAGCAG 114 00 

GTTTTTCATA CCAATTCAAT ATCGATGATG TCGTTATGTT AAATCATCAC AATATCAAAA 11460 

25 TTGACGATGT TACAAATGAT GAAATATCTC TACCATCATT GTTAAAACAG TTATCCAATA 11520 

TTTCATATAC GAATAACGCA ACGTTCCCTG CGTATCATCG TCCAACATCA CCCGATTATA 1158 0 

CTGTTGGCAC AGAACCATTA ACACAACAAA CTTATTTTAA AATGATGCAA AATTTCTTAA 11640 

AACCAAATGA TGTCATCATT GCTGATCAAG GTACATCATT CTTTGGTGCT TATGATTTAG 11700 

CATTATACAA AAACAATACT TTTATAGGGC AACCGTTATG GGGTTCTATC GGCTATACAT 11760 

TACCTGCAAC ATTAGGTTCA CAATTAGCAG ACAAAGATCG TCGTAACTTA TTATTAATTG 11820 

GTGATGGCTC ATTGCAACTA ACTGTTCAAG CTATTTCAAC TATGATTAGA CAGCATATTA 118 80 

AACCGGTATT ATTTGTGATT AATAATGACG GCTATACGGT AGAACGACTT ATTCACGGCA 11940 

TGTATGAACC TTATAATGAA ATTCACATGT GGGATTATAA AGCTTTACCA GCTGTATTTG 12 000 

GTGGTAAAAA TGTTGAAATT CATGACGTTG AATCATCAAA AGATTTACAA GACACGTTTA 12 06 0 

ATGCAATTAA TGGTCATCCC GATGTGATGC ATTTTGTCGA AGT CAAAATG GCTGTCGAAG 1212 0 

aCGCACCGAA GAAACTCATC GATAtCGCTA AAGCTTTTTC ACAACAAAAT AAATAATTTC 12180 

ATCGTATACA GGGTATAAGT TTAAGCGAAT ACTTTATTAA ACGAATAGGA CTCTGATATA 12 240 

AGATGATTAA TTTTAATAAA ACCGCTTTAG TGTTAATCGA CCTGCAAGAA GGTATTCTTA 123 00 

AAATGGATTA TGCCCCATAT ACAGCTGAAA ATGTCGTTCA AAACGCTAAT AAATTAATAG 12 36 0 
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TATTAATTCC ATTCGGTCTA CATCACATTT 
CATGGAAAAA TGCAGCTGOT GAAATTATTC 
5 TTCGTGAAGG CGCACATTTG ACAGCTGGTA 

TGTTCGGTTT ACCTGCAGCA GCTTTAGCAA 
AAGTAGTAGC AGGTTTAATG GGTTCTGCTG 

10 

AACCATTAGA ATTCTCATTC TTATTTGTAG 
TTGATGGTTT ATCATTCTTA ACATTG TACT 

15 CAGGTGGTTT CATCGACTAC TTCTTACTCG 

TAGTCATTCC TGTAGGTCTT GTATACGCAG 
TTGTAAAATT AAAATACAAA ACACCAGGTC 

20 CTTCAGCAAC TGAATTACCA TATGCAGTAT 

AACATTTAGA CGCTTGTATC ACACGTCTAC 
ATGTTCCTGG TTTGAAAGAT TTAGGCGCAT 

25 AAGCAATTTT TGGTCCTAAA TCTGACCAAA 

GTCAAGTAGT AGAAAATCCT ACTACTATGG 
CAGAAGATAA ATCTGCAACA AGCGAATTGA 

30 

AAGTAACACC ATT AT CAG AA GTGCCTGATC 
GTATCGCTAT CAAACCTTCA CAAGGTGAAG 
TGATTTTCCC AACAAAACAT GCAATTGGTC 

35 

TCCACATCGG TTTAGACACT GTTAAATTAA 
AAGGTCAAGA AGTTAAACAA GGTGATTTAT 

40 ATCATGCAAA GAGTGATATT ACGCCTATTA 

ATTTTAAACA AGGTGAACAT GGCAACATTT 
AATGCTTACT ATAAACAGGT GCGTATACCT 

45 CTATTGTATT TTG CAG CATC ATTGATAGTT 

TCATCAATTA AAGCCCCCCT TCATACTCAT 
TTAGCGTTTT CACAACAAAG TCAACTTCCT 

50 

TTAAGTGTAA ATATAGAGAA AAGGTGGCTT 
TAATTGACGC TATTCATCGA GCAGGCGTCG 



TCCACGCACC GTTCTGGTTC GAGTTTGGTT 894 0 

ACGGTGACCA ACGTATCTTT ATCGAACAAA 90 0 0 

AATTCATGCA AGGTGAATTC CCTGTTATGA 906 0 

TTTATCACAC AGCTAAACCT GAAAATAAGA 9120 

CTTTAACATC ATTCTTAACT GGTATTACAG 9180 

CACCATTATT ATTCTTTATT CACGCaGTAC 9240 

TATTAGATCT TCATCTAGGT TATACATTCT 93 00 

GTATACTACC TAATAAGACA CAATGGTGGT 93 60 

TTATTTACTA CTTCGTATTC CGATTCTTAA 942 0 

GTGAAGATAA ACAATCACAA GCGGCTACTG 9480 

TAGAAGCTAT GGGTGGCAAA GCAAACATTA 954 0 

GTGTTGAAGT TAACGACAAA TCTAAAGTTG 9600 

CTGGTGTATT AGAAGTCGGC AATAATATGC 966 0 

TCAAACATGA AATGCAACAG ATTATGAATG 9720 

AAGACGATAA AGACGAAACT GTTGTTGTTG 978 0 

GCCATATCGT GCATGCACCA TTAACTGGTG 9840 

AAGTGTTCAG CGAAAAAATG ATGGGTGACG 9900 

TTCGTGCACC ATTCAACGGT AAAGTACAAA 9960 

TTGTATCAGA TAGTGGTTTA GAACTATTAA 1002 0 

ACGGAGAAGG CTTTACTTTA CATGTTGAGG 10080 

TAATCAACTT TGATTTAGAC TACATCCGCA 1014 0 

TCGTGACACA AGGAAACATT ACAAACCTTG 10200 

CATTTGGCGA TCAATTATTT GAAGCTAAAT 10260 

TCATAAGGTG ACGCGCCTGT TTTTTCTTTG 10320 

CGCTCTCCCC TTAAATTTTG AATTTTAAGA 10380 

TTCCTAAAAA ATATTAATTG TTCACTATTG 1044 0 

TGACCTTACA CTATATTCGA GGCTATCATT 10500 

TTTTTATGAA ACAACGCATT GGAGCTTACT 10560 

ATAAAATTTT TGGTGTTCCT GGTGATTTTA 10620 
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AAGTTGTTTT GCCTGCCTCA TTTCAATCAA TGAGTCTCCA ACTAACACTT CAGATACACC 714 0 

AGTTTCTTGT AATAATTTAG CTGCAACGAC AGGATGACTA TGTCTCGTTG CTTCAATTGT 7 20 0 

5 

TGGCAAGCCT TTATGCAAAG GACCTCGCAA ATCACTCCCT ACAATAAAAC CATATATTTG 7 260 

TGCCTTTGGA TTAAATTGAT AAATGAGTTC ATTTTTCTTA TTGACCAAGT CAACAGATAA 7 32 0 

TCCCGTATCT GGTCTTGGAT AATAGTTATG ACAAAATGAA AGTAATGTAA AATCATTCAA 73 8 0 

10 

TTGTTGATGT AAGCTTGTTA ACAATTCCCG GGAAATAATA CTTGCATTCA AACAGCACTT 744 0 

TAAACCCTGT GCCATTATCG CTTCGATTGC CTCAATTGAT GTACTATGAT CGATACGAAT 7500 

15 CATAAATTGT GCATCATATT GTCGAAGATG GTCATAAAAA GATGGTGTTA AAATAGATGG 7560 

ATTAGCATCT ATGAGGTAAG TCACTTGTTC ATGTTTTAAT AAATTGAGTA GTTTTGTGAA 762 0 

ATAATGATAT TTTGTCTCGT CATCTTCTTC TGGTATTTGT ACAGATGTAA AAATCATTTG 7680 

20 GTAACCTTGT TTAATCATTC GCTTAATATA CGCTTCATCT AAAGGTTGTC CTAAATACAC 7 740 

TGAAAAGCCT GTCAAAGTAG CCCTCCTTAA CAATATAATT ATTAGGAAAA TATAGTTGAT 7800 

TTGTGTAATC GCTTACATTT TACTATAAGA GAAAACACAT TACAATATTA AT CAGTTAAA 7 360 

25 

GCCTGTTCAT TGTAATAATC TTACATATTT CTGTCACAAG TTAATTATTA CACCATCAAA 792 0 

GATTATCCTT TCTTTTAAGT GCTGATAATA GCTGCTACTG CTGGATTATT ACAATAACTT 7 98 0 

TTATACATTT TATTCAGGAT TATCTTATAT TATGTTTTAA TAATAATCTG TGAACAATTA 804 0 

30 

AGAGATTTGA AATTGAATTT AATAATTGTA TTGAAAACGC ATACTTCACC ATGCTAAAAT 8100 

AGGAGTCGCA AACAAATAAG ATTCAATAAG ATGTGATGGT TACCAACACA GTCTATTTGC 8160 

35 TCGTGTCTTT TTTTATTGAA TCTTAAATAA TAAATACAAC TTTGGAGGTT GGACAAGTGA 8220 

GGAAGAAACT TTTCGGTCAA TTGCAACGTA TTGGTAAAGC GCTAATGTTA CCTGTTGCGA 8280 

TTTTACCAGC AGCTGGTCTG TTATTAGCTA TCGGTACAGC TATGCAAGGT GAATCATTAC 83 4 0 

40 AACACTACTT GCCGTTTATA CAAAATGGTG GCGTACAAAC TGTCGCTAAA TTAATGACAG 84 00 

GTGCTGGTGG TAT CATTTTT GATAACTTGC CTATGATTTT CGCATTAGGT GTCGCAATCG 84 6 0 

GATTAGCTGG CGGTGATGGC GTAgcAGCTA TCGCAGCATT CGTCGGTTAC ATAATCATGA 8 520 

45 

ACAAAACAAT GGG CGACTTT TTACAAGTTA CACCTAAGAA TATTGGTGAT CCAGCGAGTG 853 0 

GTTACGCTAG CATTTTAGGT ATCCCAACAT TACAAACAGG TGTGTTCGGC GGTATTATAA 864 0 

TCGGGGCCCT GGCAGCTTGG TGTTATAACA AGTTCTATAA CATTAACTTA CCATCTTATT 87 00 

50 

TAGGTTTCTT CGCTGGTAAG CGTTTCGTAC CTATTATGAT GGCTACAACA TCATTTATTT 87 60 



GCGTAATCCA AGCACCTGAA ATATAGCCTG 
TACCACCAAT TAATCCAGCT CCAATAAATG 
5 TTGATTTCAA TACTTTATTC AACTTACCAT 

TCGCCTTTGC TTTATCAGCT GCATATGATT 
GGTGTGGTAT TGGGTCACCT AGTTTAACAC 

10 

TATTGatGTA CCAGGTCCAA CCACAACTTG 
AC CATCAATA TGCCTTAGTT CTTGGTCATC 
CACACGTGTC ATACAGTTCA TGACACTATC 

15 

AATTCGTTCT GCAAGTTGTT GTTCTTTGGT 
CTGATTGCTC GTTTAAaATG TCACCATTGT 

20 AAATGCCACA CATACCCATA ACTGTCGCAA 

ACGCCATTGC TTCATCATAT GTGATAGCAC 
CGATCAGTTT TTGATTGGTT GCTTTAACAT 

25 CACCAACCAT TGTGATGGTT G AAAT CAT AT 

TTAAACGTGT TGAACCAGTT AATACTTCTG 
CAATTTCACT TATAACTGCA TGTTCATTGC 

30 

TGTTAGCAAA TGTTAAACCG CCTATAACAT 
TAACGACATC TTTTGATGTT AAATCTATAT 
GATCTTCCGC ACCTTCTACA GCCATCGTCA 

35 

TAATTTCATG AGGGTCAGTA TTGAATGTAG 
ACCTTCCACT TGTACCTGCA CCGATATAAA 

40 TTGTTTTTTT AATTACTTTT GTCAATTGTG 

GCTGATCTTC TTTATTCATC GTAATTAAAG 
TCGTCGCTTC ATTACGCGCT TCGGTCGTAC 

45 TTTTTGAAAA TCAAATGTAT CATTCGGCTC 

AATAATTTGT GCAaCAACAT TCACATTGTC 
CATTTCGCCT TGATAACGTC CGTTATTCAA 

50 

TTGCGTCGTA AACTGTGGTT GAATCGAATG 
AATGACATTT TCCGGATTAT CCGGGCGTAC 

55 
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CCACCATTAA GTTACTCAGT ACTGCTGCAA 5340 

CAGGAATCAA CGGTATAAAG ATATTGGCAA 54 00 

TCTTTTGTTT TGCTTTATGC GCTTCCTTAT 54 60 

TATAGTCCAT TTTTTCACTA TCATTGTGAT 5520 

CACTTAATTC CGCCATATGA TTAGCCACTT 5580 

AATGCGTTCA TCGTGTATAA CACCCATGAC 5640 

TACTTTATTC TCATCTAATA CTTTAATACG 5 7 00 

TATATTATCC ATACCACCTA CTGCAGCAAT 5760 

CATTTAAATC CCTCCTAAGG TTGTCTATCT 5820 

TTAATAACCG TCTTGTTGCT TCTTCCTTAG 5880 

CTTTCACATC ATGCTCAGAT ACCTGATATA 5 940 

ATATTTCTTG AATAATACGC ACTGAACGGT 6 000 

CAATCATGAG GTTATCGTAA ACTTTTCCGA 6 060 

TTAAAATTAA CTTTTGTGCt GTACCAGACT 6120 

GACCAACTTT AACTTCTACT GGATAcTGCG 6180 

ATGAAATAGA TACTGTTGTA GCACCGATTG 624 0 

ATGGCGTTTT GCCACTCGCG GCAATTCCTA 63 00 

TTTTCAAATC TTCTTCCGCT AATTTTTTGT 6360 

TAGCATGTTG TCCACCAGCA ATAATACCTA 6420 

GTACACACTC CGCTGCATCT AAGACACCCA 6480 

TCAATCGTCC ACCCTTTTTA TACTGTGCAA 6540 

GTATTGCCTT TCGAACTGCT AACGGGACTT 6600 

CCTCTTCCAC AGTCATTTCA TCAAGATGCA 6660 

TATTTTCCAT CACTTCTTAC ACTCCCTAGT 6 720 

GATACAACTT AACAGTGGTA AGTCTTCTIT 6780 

ATGTGCACTA AGCGTTTGTC TCACAATTTG 6 840 

ATTATCAACG GTTACTGAAC CAATGCGTCG 6900 

TGGACATATT TGTCTTGACG TTTCCGAACG 6 960 

TTTATGACAC ATATCGAAAA GGTAAGTCAC 7020 
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30 



35 



1 ^-<j^_AAvjAi 


GCTTCCTGTA 


ATTATCAAGT 


GCCATTTTCG 


ATTGGGTTAT 


ACAATCTAGA 


3540 


Al LOwil kjAl 


AATTT AATG c 


TACAAATCGA 


TAGTACAATA 


TATCTACCGT 


GAATAACTGT 


3600 




ACGI lGTAGC 


CGCCATACGC 


ATTTCATTTT 


CATCAGTTCT 


GCCATAAATC 


3660 


AA I VjCA 1 ALj 1 


CTGCAATTTG 


AGCCACTGGA 


TTATTAGCTG 


TACTAGATAT 


AGTTATGATG 


3720 


LjvjAA 1 AC I vj I 


AATGTGTGGC 


CACCTGTGCA 


ATTGACTGCA 


ATTCACTATG 


ACTACCTTGA 


3730 


1 1 CU 1 CALAA 


AAA IV r«r^ ^ * 

AAATCATGCA 


ATCTCTATCA 


TCATGCGTCG 


CAAATGTTGA 


CACAAGTAAA 


3340 


1 VjCCt III CAT 


GTAATAACCT 


GACATTTAAG 


CCAATACGAG 


ATAACTTTTG 


AAAAAGATCA 


3900 


LUAA 1 Atj i CA 


AAC I CGATGC 


GCCAAATCCA 


AATAAAAATA 


TTGTCCTGGc 


ATTTTTCAAC 


3960 


AC A I CACAAA 


TTGCATCAAT 


TTGCGCATCC 


ATAATATTAG 


TAG CT ACAAA 


TCGCATCGTA 


4020 


I 1 Cdl HjLIL 


T A O A A T^r^ A 

1 AGCAATCAT 


TTTAi^n~i"i'C 


AAAGTTTCTA 


CAGATTCATT 


TTCAATCAAT 


4080 


I C I AAA1\»TG 


GATTGGTTGC 


AATATCTTCG 


GGTAAGTATC 


GAGATATCGC 


AATCTTTAnC 


4140 


TCTTGAAAAC 


CTTGATGTGT 


CATTTTCCGA 


CTAAATCTAA 


CAATTGATGC 


TGTACTAACA 


4200 


TT CGTAACAT 


CTGCCAAATC 


ATTCACAGTC 


ATATCAATGA 


TTTTATGTGG 


ATTCTTTAAA 


4260 


ATGTAATCAG 


CGATTATCTT 


TTCTGTCTTC 


GTAAAATCAC 


TCAACTGCTT 


ATCAATGCGA 


4320 


I 1 A 'P A TV A A ^ A T* 

TATAAAATAT 


TTGTCATCAT 


TAATCACCCA 


ACAAATCTGT 


CTGTCGCATC 


GCCTTTGTCG 


4380 


I rCCAAATAA 


ATATGTACAA 


ACGAATCCAC 


CAGCATACGC 


AGCAAGTAAT 


CCTGCAATAT 


4440 


AAL C I AAA I A 


CATATTATCT 


GAGATTAATG 


GTAATAGTGA 


CACACCACTT 


/"* r*T< a 

VJVJVJ\-UiAi X\J 


4 5 00 


L1I IlKjCACC 


AATATGTCCA 


ATTCCACCTA 


TTACAGCGCC 


ACCAATACCA 


CCACCAATAC 


4560 


AAGCAGTTAA 


GAAAGGTCGA 


CCTAATGGCA 


AAGTCACACC 


ATAGATTAAT 


GGTTCTCCGA 


4620 


1 ALL. I AGOAA 


ACCAACTGGC 


AATGCACCTT 


TTAAAGTATT 


ACGTAATGTT 


GTGTTGCGTT 


4680 


1 AlA i C 11 AC 


C CAAAGTGCT 


AATGCGGCAC 


CTACTTGTCC 


AGCACCAGCC 


ATCGCTGCAA 


4740 


1 IWjUAAIAA 


^"T 1 A A /™**T* A O O A 

1 AAG TAGCA 


CCTGATTGGT 


TAATCATTTC 


TATATGAATT 


GGCGTAAAAA 


4800 


T 1 A TV™* A TT" A A 

1 A 1\jA JajAAG 


CCCTAACATA 


ACTAACGGTA 


GGAAGCTTGC 


ACCAATGATA 


AATC CACTAA 


4860 


a t 1 a c^r* C*C* a 
A 1 ALOLlALL 


AATACTAATA 


ATTCCGTTAA 


CTACTGAAAC 


TAAACTGTCT 


GAAACAAAAC 


4920 




a HnKun ± A 


AAGATAvjiTA 


AT AGT C CTAC 


AATCAACAAT 


GCAATAGTCG 


4 9 80 


GCGTTACAAT 


AATATCAATC 


GCATTTGGCA 


CAATTTTATG 


TAATCTCTTT 


TCGACAATAC 


5040 


ttaaaatcca 


AACGGCAAAA 


ATAACGCCAA 


TAATCCCACC 


TTGTCCAGGT 


TGCAATGGTT 


5100 


ctccagtgaa 


GACATTCATT 


AAAATATTTT 


TACCAGCAAT 


ACCCGTTAAT 


AACGTTGTAC 


5160 
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ACTTTATTCG TTTTATACTG TTCTAATAAA GGGGCATTCC CTTGAATCGC TTTAGCCATT 174 0 

TTTTGAATCG TCGTTACAAT AAGTGGCAAA CTTTTATCAT TTAGTTGGCG TACCAGTTGC 1800 

GAGGTATTAA AAGTTTTGTC TACAGCACCC TTAGCAAATT TATTAAATTC CTCTTCTGTT 186 0 

TGACTATCCA AGTCTTTACG GTCAACCAAA AAGATAACTT TCTTAATGTC ATCTTGCTGT 192 0 

GATAAAATCT GACTCGCTTT AAAAGAAGTC AACGTCTTAC CACTTCCAGT TGTATGCCAT 198 0 

ACATATCCAT TATTCCCTGT CTCAGTCGCT TGTTGAATAA GTGCTTCTAC CGCATACACT 204 0 

TGATACGGAC GCATTGCCAT CAGTATTCTA TCTGTTTCAT TAATAATCAT ATAGCGCGAT 2100 

ATCATCTTAG CTAATTGACA AGGTCTCATA AATGACTCAG CAAACGATTG CAATGTATTG 2160 

ATACGGTTAT TCTGTTTATC ACTCCAATAA AACATGTGAC TCTTCAATAG TTCGCTATCA 222 0 

TTATTAGAAA AGTATCGCGT TTCAACACCA TTACTAATGA TAAACATTTG TATGTAGCGG 2280 

AATAAGCCTG TGTAATTTTG TTTGCGGTAA CGTTTTACTT GGTTAAACGC CTCATTAATA 234 0 

TCAATACCTC GACGTTTCAA TTCAACTTGG ACAAGGGGTA GTCCGTTGAT TAATATCGTT 2400 

ACATCATAAC GTGCTTTATA TGTATCCTCG ACAGATACTT GATTCGTCAC TTGAAACTTA 2460 

25 TTTTTACACC AACTTTTCGT ATCTAAAAAC GACAAATAAA TCTCAGACTC ATCATCACGT 252 0 

CTAAGTGGTA ATTTATCACG TAAAATACGG GCACTCTCGA AAATACTTTT TCCATCAATC 2580 

ATCGTTAACA GACGTTGaAA TTCTTTATCT GTTAaGGGAT TGCCTTCTAA TTTGTCCGGA 264 0 

TGACGCTCAT TTAAAATCGT TCTAAAATTA TCAAGCAATT GCTTATTATC ACGTATCGTT 2700 

ACTCTTTCGT AACCCAATTG TTCAAGTTGA TTCATCATTT CATTTTCTAA TGCGTATTCA 2760 

CTTTGGTATG CCATTCATAT CCCCTTCCAT ACACTTTCTA TTGCTCTAAA TAT AT CAT AA 282 0 

ACTTTAATGA AAAATGTTTG TTTTTTATCT TCAAACGTAA ATTTATTCTA ATTTTATTGT 2880 

CTTAXCTTTT AATATTTGTC TTTGAGGTAA GTCGTATACT AAAATTTGAA TACAAATAAT 2940 

CAAATCATTG ATAAATTTTT TGTCTACGAT TAATGGAGGG ACTTGAATGG TGTTAATTAC 3000 

CTATCAAATC ATTTTATTTT TTATTATTAG TCTAAGTTAC TATTTAACTT TAAATCATTA 306 0 

CATGGCAGTC ACTGTAGGTA ACTTCACTTC AATATTCGGC ATGTTCGCAG CCATACTCTT 3120 

45 TATGTACTAC TACCTACTCT ATAAAAGTCC CGAATACAAT CAACGCAAAC GATTTAAACA 3180 

TTTCATTCAT ATCACTAATT TGATAATAAT TGCTTTTAGC ACCTTCGTAT TAGTTCATTT 324 0 

AGCATTAAAA TTATTCTTCA GCATTTAATT TCCATCTATG AAAAAAGCAA AGCTCAAATC 3300 

50 

TGAACTTTGC TTTAATTTGT CACGCCTTTA TCATTTTCAA AATAGCCTCT ATGCCAGTTT 3360 

TACAAACTTG TAGCAACAAT TTTTCATCAA GCAACTGAAT CACATCAAAA ACTTCAATTG 3420 

55 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 206: 

TnAGTTGTTT CTGCCACGAA AGATTCAATG GCTTTTCTTG CTTTACGCTT TTCTTTCAAT 6 0 

GGCAAATCAC CAATCATTTT TTTAAGGTGA TGTGGGTTTA CAACACCACT ATACTGGTAG 120 

TCATTTGAAn TTGTTTTTAG GGCTTGTTCA TCGATAGATC TCTCTCCAGC AAATCCTTTG 180 

AACTCCGCTT CTTTTTTAAT ACTTTCGAAA TTAACATATT CTTGATCGAT AT CAT CAT CC 240 

TTATTTAAAG AAGGTACAAC ATTGTCGATG AATTCTCTAA TTAGATCTCG TTTTAACCTC 3 00 

AATGtCGGAT CATCTGCATG ATCTAAAATG CGTCTAATTT GTTCTTGGTT ACGACGTTGT 3 60 

TCCGCTTTGT CTTCAAGATC AATTTGTCTC AATATATTCA TAATATAATT CACATTAATC 4 20 

GTATCATTAC GCATCATTTC TATTTCGAAA TCAATATCAT TTAAAATGGA TACTTTATTT 4 80 

20 TTCTCAOCCG TCGCTCTTTT TACTTGATCG TACACAGCTA AATATTTACT TTTATAGTCT S4D 

TCATTCTCTT GTTCATCCAT TCCAATTTCA TCAATTGTAA ACTCAAACTC GTCAAATGCT 600 

TTTAAACGTA ATATTATTTT AGCTAATAAA CGATAAGCtT CAACAAAGCG CTTTAGCTCT 660 

TCTTCATCtT GaATGtCATC AACCATGTGT GGTGTCGGCA CAATCATTTT AAGCTCACGA 720 

TAAGCGTCCA TAAATTCTTT TTTATACTCT TCATAACTGC GCATTAAAAT TGTATCCGTA 780 

TCATTTGTTT GTGAGAATAC TCTCAGTGCA TCGTCTGTCT CTTTTTTCAA GTCACGATAG 840 

TTTACAATTT TACCAAATGG CTTTGATTCT TTTTCAACCC TATTTGTACG TGAATACGCT 900 

TGAATTAAAT CATGATACAT TAAATTCTTA TCAACATATA AAGTGTTCAG TACTTTACTA 960 

TCAAAACCAG TTAAGAACAT ATTAACAACG ATTAAGATAT CAATTTTACT ATCTTTAACG 1020 

CCCTTTTTAA CGTTTTTtGA AATATGATTA AAATACTCAT TAGTTGtGGC TGnTGaAAAA 10 80 

TTCGTCTCGA ACTTTTTATT ATAATCACTA ATCATTATCT CTAATTTTTC ACGTGAATGA 114 0 

40 TATGGCACTT CACCATCACG ATCATCTTCA TTAGGTTTAA ACGTAAATAT ACCAGCTATC 1200 

GTTAACGGTT GTTCCAACTT TTTGTTAAGT CGCTTAAATG TCTCATAATA TTTAATAAGC 1260 

GCGTGAATAC TTTGGACTGT AAATATACTT GAATATTGAC GATTACGTGT ATATTTATCA 1320 

TGATTATTGA TGATATGTCG TGTTACTAAT TCCACACGTT TATCCGCTAA CCATACTTCT 13 30 

TCCGTATCAA TTGCTTCAAC CATGCtGTTA TCTTCTGCTT TTAAAGCTTT ATTTTTAAAA 144 0 

GTATTAATAT AGTCAACTGA GAAACCAAGT ACATTACCAT CATGAATGGC ATCTCTAATT 150 0 

AAATACGTAT GTAAGCATCT ACCGAAAATA TCTGCAGTTG TTCTACCATC TTGACTACTA 1560 
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CCTTGACGAG CACGACCTGT TCCTTTTTGC TTCCATGGTT TACGTCCGCC ACCGCTTACT 14 940 

GCTGAACGAT TCTTAACAGC ATGCGTACCT TGACGTAATG AAGCACGTTG TAAATTAATA 15000 

5 

GCTTCGAATA AAACGCTATT ATTTGGCTCA ATACCGAATA CTGCATCGCT TAATTCGATT 15060 

GAACCTGATT TAGTTCCGTC TAATTTTAAA ACATCATAAT TAGCCATTAT GCATTTCCTC 15120 

CTTTCACTTC TTATTATTTA TTACCTTTTT TAATTGAAGT TCTGATTTCT ACTAAACCTT 15180 

10 

TTTTAGGTCC AGGTACGTTA CCTTTTACTA AGATAACTTT GTTTTCTGTG TCAACTTGAA 15240 

CTACTTCTAA GTTTTGAACA GTTACAGTGT TTCCACCCAT ACGTCCTGGC ATTTTTTGGC 15300 

J5 CTTTAAATAC TCTAGAAGCA TCTGAAGCCA TACCTACAGA ACCTGGTGCT CTGTGGAAAT 153 60 

GAGAACCGTG TGACATAGGT CCACGAGATT GTCCGTGGCG TTTAATTGCA CCTTGGAAAC 15420 

CTTTACCTTT TGATACGCCT GTTACGTCAA TAACGTCGCC AGCTACAAAA GTATCTACTG 15480 

20 AGACTTCTTG AaCCTAcTcG TAAGCATCCA CGTCTACATT GCGGAATTCA CGAATGAAGC 15540 

GCTTAGGTGC TGCGTCAGCT TTTTTAGCGT GACCTTCAGC TGGTTTATTA GCATATTTAT 15600 

TAGATTTTGC ATCTTTTTTG TATGCTTTTT TGTCTTCAAA TCCAACTTGG ATTGCGTTGT 15660 

25 ATCCATCAAC TTCTACAGTT TTCTTTTGTA ATACAACATT TTCTTTAGCT TCTACTACTG 15720 

TTACAGGGAT TAATTCACCG TTTTCTCCGA ATACTTGTGT CATCCCAATT TTTCTTCCTA 15780 

AGATTCCTTT GGTCATCGAA AGTCCACCTC CTAAAATTGT CTATTATAAT TTGATTTCGA 15840 

30 

TGTCTACACC AGATGGTAAG TTTAAGCCCA TTAAAGCGTC AACTGTTTTT GGTGTTGGGT 1590 0 

TTACAATATC GATTAAACGT TTGTGTGTAC GTTGTTCGAA TTGTTCACGT GAATCTTTAT 1596 0 

ACTTATGCAC GGCACGGATG ATTGTGTAAA CTGATTTCTC AGTTGGTAAC GGAATTGGTC 16020 

35 

CAGAAACATC TGCACCAGAA CGTTTCGCTG TTTCTACAAT CTTCTCTGCT GATTGATCAA 16080 

TTAC3CGGTG ATCATAAGCT TTTAATCTGA TTCTGATTTT TTGTTTTGCC ATAATTTTCC 16140 

40 CTCCTTATTC GTCTACATTT AGTGATAGAC TTCTCCACGA AAACTATCTT ACACAGCGCC 16200 

ATGGCAAAGC GGCCGGGTGT GTCAGTAACC TTTCGCTTCA TCGCTTTTCT TAAAGTCCAA 16260 

CGTTAGTTAT ATTACACGAA AAACATCGAT AAATCAAGGC TTTTCACATA ATTTTTCTAT 16320 

45 CTGTCTAACA CATACTTTTA TATTTnACTT TATATACTTA GTCAGTTCAA CTATTTTCGA 163 80 

GATATTTTnA ATTTCCn 163 97 
(2) INFORMATION FOR SEQ ID NO: 206: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29555 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

55 



EP0 786 519 A2 



5 



TO 



15 



30 



35 



40 



TATCTTCAGT 


TACATATACA 


GGTACGTGTT 


TACGTCCGTC 


GTATACTGCA 


AAAGTATGTC 


13140 


CGATGAAATT 


AGGGAAAATT 


GTAGAACGAC 


GTGACCATGT 


TTTGATTACT 


TGTTTCTTTT 


13200 


CGCTTCCTTC 


TTGAGCTTCA 


ACTTTTTTCA 


TTAAATGCTC 


ATCGACGAAA 


GGTCCTTTTT 


13260 


TAATACTACG 


AGCCATTTGG 


GCGCCTCCCT 


TCTTATTATG 


TGCGTGCAGC 


TTTAAGCCGC 


13320 


ACACCCAAAT 


AAGTTGATTA 


TATTATTTTT 


TCTTACGTCC 


ACGAACGATA 


AGTTTGTCTG 


13380 


ATGATTTriT 


ACCACGACGA 


GTTTTCTTAC 


CAAGCGTAGG 


TTTACCCCAT 


GGTGACATTG 


13440 


GAGATGGTCT 


ACCGATAGGA 


GCACGACCTT 


CACCACCACC 


GTGTGGGTGA 


TCGTTAGGGT 


13500 


TCATTACAGA 


ACCACGAACT 


GTTGGACGGA 


TACCTTTCCA 


TCTTGAACGT 


CCGGCTTTAC 


13560 


CAACGTTAAC 


TAATTCGTGT 


TGTAGGTTAC 


CAACTTGACC 


GATTGTAGCA 


CGG CAAGTAG 


13620 


ATAAGATCAT 


ACGAACTTCA 


CCAGATCTTA 


ATCTGATTAA 


TACGTATTTA 


CCTTCTTTAC 


13680 


CAAGTACTTG 


AGCACTTGCA 


CCAGCTGAAC 


GAGCGATTTG 


TCCACCTTTA 


CCAGGTTTAA 


13740 


GCTCGATGTT 


GTGTACTACT 


GTACCAACTG 


GAATGTTTTG 


TAATGGTAAT 


GCGTTACCAA 


13800 


CTTTGATGTC 


AGCTTCAGCA 


CCACTTTCAA 


CGATTTGACC 


TACTTCTAAT 


CCTTTAGGAG 


13860 


CAATGATATA 


TCGTTTTTCA 


CCGTCTGGAT 


ATACAACTAA 


AGCGATGTTT 


GCTGAGCGGT 


13920 


TTGGATCATA 


TTGAATAGAA 


TCAACTTTTG 


CATTGATACC 


ATCTTTGTTA 


CGTTTGAAAT 


13980 


CGATAACACG 


GTATTGACGT 


TTGTGTCCAC 


CACCATGGTG 


TCTTACAGTC 


AATTTACCTT 


14040 


GGTTGTTACG 


TCCCGCTTTT 


TTCGGTAGCG 


GTTTTAATAA 


TGACTTTTCA 


GGTGTAGTTT 


14100 


TCGTGATTTC 


TGCGAAATCT 


AACGAaGTCA 


TATTACGACG 


ACCATTTGTT 


ATTGGCTTAT 


14160 


ACTTTTTAAT 


AGCCATTGTC 


GCTTACCTCC 


TTAATGGTAA 


TTGTTTTATT 


AGTTAAATAA 


14220 


GTCGATTGAT 


CCTTCTTTAA 


GAGTTACAAT 


CGCTTTTCTT 


CTTTTGTTTG 


TATAGCCTTG 


14280 


GTAACGGCCC 


ATACGTTTTT 


TCTTAGGTTT 


GTAATTCATG 


ATATTAACAC 


TTGCAACTTT 


14340 


TACGTTGAAG 


ATTTCTTCAA 


CTGCCATTTT 


TACTTGTGTT 


TTGTTAACAC 


GAGTATCAAC 


14400 


GTCGAAAGTG 


TATTTGTCTT 


CAGCCATTGC 


TTCAGAAGAT 


TTCTCAGTGA 


TTACGGGGCG 


14460 


CTTAAGAATA 


TCTCTTGCTT 


CCATTATCCG 


AGCACCTCCT 


CAACTTTTTT 


AGCAGCAGCT 


14520 


TCAGTAATTA 


CTAAGCTGTC 


AGCATTAGTG 


ATATCTAAAA 


CATTTAAACC 


TTGAGCAGTT 


14580 


GTCACTTGAA 


CGCCAGGGAT 


GTTGCGTGCT 


GATAATTCAA 


CATTTACATC 


TTCGTTTTCA 


14640 


GTAACTACTA 


ATACTTTTTT 


AGGTTGTTCT 


AATGTAGATA 


ATACATTTTT 


GAATTCTTTA 


14700 


GTTTTTGGAG 


CTTCGAAGTT 


GAATGCGTCA 


ACTACAGTTA 


AGCCATTCTC 


TTGAGCTTTG 


14760 
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TTTTGTAACC TCCTCTTACT TAATTATTGA TTAGCCTTAC TTTGTTCAAT TTCTCTTTCA 1134 0 

CGAGCAACAG TTTTTAGACG TGCAATCGTT TTTCTTACTG TACGAATACG TGCAGTTTCT 114 0 0 

TCTAATTGAC CTGTAGCTAA CTGAAAGCGT AGGTTAAAAA GCTCTTCTTT TGAAGATTTG 114 6 0 

ATTTGTTCTT CGATTTCTGA AGTGGTTAAG TCTCTAATTT CCTTAGCTTT CATTTGTTTC 11520 

ACCACCCAAT TCCTCACGTT TTACAAACTT AGTTTTTACT GGAAGTTTGT GACTTGCTAA 11580 

ACGTAGTGcT TCACGCGCAA CTTCTTCAGA AACGCCAGCA ACTTCGAATA AAATTCTACC 11640 

TGGTTTAACA ACTGCGATCC AGCCTTCAAC CGCACCTTTA CCAGCACCCA TACGTACTTC 117 00 

TAAAGGTTTT TTAGTATATG GTGTATGTGG GAAGATTTTA ATCCAAACTT TCCCGCCACG 11760 

TTTCATGTAA CGTGTCATTG CTATACGAGC AGATTCGATT TGACGAGATG TGATCCAAGA 11820 

CGTTGTTGTA GCTTGTAAAC CAAACTCACC AAATGTTACG TAtTACCGCC TTTAGAACGA 11880 

CCAGTTGTTT TAGGACGATG TTGACGACGA TATTTTACAC GTTTTGGTAG TAACATTATT 11940 

ATTTTCCTCC TCCACTAGTG TTCTTAGTAG GAAGAACTTC TCCACGATAA ATCCATACTT 12000 

TAACGCCTAA TTTACCGTAA GTAGTGTCAG CTTCAGCGTG tGCATAATCG ATGTCAG CAC 12 06 0 

25 GTAACGTATG AAGTGGAACA GTTCCTTCTG AATATTGTTC AGCACGAGCG ATGTCAG CTC 12120 

CGCCTAAACG ACCAGATACT TGaGTTTTGA TACCTTTAGC ACCAAGTTTC ATAGCTCTAG 12180 

TGATTGCTTG TTTTTGTACA CGACGGAATG AAGCACGGTT TTCTAATTGA CGTGCGATGT 1224 0 

TTTCAGCTAC TAAACGAGCG TCAAGATCAA CTTTTTTGAT TTCAATTACG TTGATGTGTA 12300 

CTTTTTTATC AGTTAACGCA TTTAATTTGT TGCGTAATTT TTCGATTTCT GAACCGCCTT 12360 

TACCAATTAC CATACCAGGT TTACCAGTAT GAATTGCAAT GTTGATACGG TTTGCAGCAC 12420 

GTTCAATCTC TACGTGAGAA ACTGATGCTT CTTTTAATTC ATTATCAATA AATTTACGGA 12480 

TTTTTAAATC TTCGTGTAAA AGTGAAGCGA AGTCTTTTTC AGCATACCAT TTAGCTTCCC 12540 

AATCACGGAT AATACCAACA CGAAGTCCGA TTGGATTAAT TTTTTGACCC ACAGTATTCC 12600 

CTCCTTAAAA GTTAATTAAG CTTCTTTAGC TTCTTCTTTA CCGTCACTTA CGACGATTGT 1266 0 

AATGTGGCTT GTACGTTTGT TAATCGCACT TGCACGACCT TGCGCACGTG GACGGAAACG 12720 

45 TTTTAATGTT GGTCCTTCGT TAGCATATGC TTCTTTAACT ACTAATTCAT CTGTGTTCAT 12780 

GTCATAGTTA TGTTCAGCAT TAGCTAAAGC GGACATTAAT ACTTTTTCAA TTACTGGTGA 12 84 0 

TGAAGCTTTG TTTGTTAATT TTAAAATTGC AATAGCTTCA GCAGCATTTT TACCTCTGAT 12 900 

50 TAAGTCAAGA ACTAGTCTTA CTTTACGAGG TGCGATTCTT ATTGTTCTAG CAACCGCTTT 12960 

TGCTTCCATT AGGATGTCCT CCTCTACTTA ATAGATATTA TCTTCTTGTT TTCTTGTCGT 13020 
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TCTTTTTTAT TACCAGCTAG CTTTACGAAC GCCAGGGATT TGGCCTTTGT AAGCTAATTC 954 0 

ACGGAAACAA ATACGGCATA ATTTAAATTT ACGATATACA GAATGTGGAC GGCCACAACG 96 00 

5 

TTCACAACGA GTGTATTCAC GAACTGCATA TTTTTGTTTT TTTTGTTGCT TAGCAACCAT 96 6 0 

TGAAGTTTTA GCCACTTAAT TAGCCTCCTT TAAATAATTA TTTACGGAAT GGCATACCGA 972 0 

AGTTAGCTAA CAATTCACGA GCTTCTTCAT CAGTGTTAGC AGTCGTTACG ATAACAATAT 978 0 

w 

CCATTCCTCT AACTTTACTT ACTTTATCAT AGTCGATTTC TGGGAAAATT AATTGTTCTT 984 0 

TAACACCTAA AGTGTAGTTA CCGCGTCCGT CAAATGCTTT TTTAGAAACA CCTTGGAAGT 9 9 00 

15 CACGTACACG TGGTaATGAT ACTGAAATTA ATTTGTCTAA GAATTCATAC ATTCTTTCAC 9 960 

CGCGAAGTGT TACTTTCGCA CCGATTGGCA TACCTTCACG TAAACGGAAA GTCGCGATTG 1002 0 

aTTTTTTAGC TTTAGTTACT AATGGtTTTT GACCAGTGAT CAATTCTAAT TCTTCAACAG 100 80 

20 CATTGTCTAA TACTTTAGAA TTTTGTACTG CGTCACCTAC ACCCATGTTC ACAAPGATTT 10140 

TATCTATTTT TGGTACTTCC ATTACTGAAC TATAATTGAA TTTTTTCATT AAGTTTTCAG 10200 

TAACTTCAGT GTTaAACTTT TCtTTTaAAC GGTTCaAAGT GGGATCCTCC TTTCaACTTG 10260 

25 TtATTAATTA TTAGAkTTAA TTTCTTCGCC AGATTTTTTA GCGATACGAA CTTTTTTACC 10320 

ATCAACAAAT TTGTAACCTA CACGAGTTGG TTCGTTTGTT TTAGGGTCCA ATAATTGTAC 10380 

ATTAGAAACA TGGATTGCTG CCTCTGTTTC TAAGATTCCA CCTTCAGGAT TTAATTGAGT 10440 

30 

TGGTTTTTGG TGTTTTTTCA TAATGTTAAC ACCTTCCACA ACGACACGGT CTTTTTTAGG 10 500 

TAGAGTAGCA ATTACTTTAC CTTCTTTACC TTTGTCTTTA CCTGCGATAA CTTTAACGTT 10560 

GTCACCTTTT TTGATATGCA TGTGGGCACC TCCTTATTTG TATTGGTTGT TATTAATTAA 10620 

35 

AGTACTTCTG GTGCTAATGA TACGATTTTC ATGAAGTTAC CTTCACGTAA TTCACGAGCA 106 80 

ACAGGTCCGA AGATACGAGT ACCACGTGGG CCTTTGTCAT CACGGATGAT AACACATGCA 10740 

40 TTTTCATCAA ATTTGATGTA TGAACCGTCA TTACGACGAA CACCTGACTT AGTACGTACG 10800 

ATTACAGCTT TGACAACGTC ACCTTTTTTA ACAACGCCAC CTGGTGTTGC ATTTTTAACA 10860 

GTACATACGA TAACATCGCC GATGTTTGCT GTTTTACGAC CAGATCCACC TAATACTTTG 10 920 

45 ATTGTAAGAA CTTCACGAGC ACCAGAGTTG TCTGCTACTT TCAAGCGTGT TTCTTCTTGG 10 980 

ATCATTAGTT AAACCTCCCT TATCTCTAAA CTTGTATTAA ATAATTACTG ACTCTTCAAC 11040 

AATCTCTACT AAACGAAAAC GTTTTGTTGC TGATAAAGGA CGAGTTTCTT GAATTTTAAC 11100 

50 

AATGTCTCCT AATTTAGCTG AATTGTTTTC ATCATGAGTT TTGTATTTTT TAGAGTATTT 1116 0 
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CATGTTGATT GGTGTGTTTG ATCCTAATGA TTTACTTAAG ATATCAGTGA TACCTGCTAA 7 740 

TTCAAGTACG GCACGAACAG GACCACCAGC GATAACTCCT GTACCAGGTG CAGCCGGTTT 7800 

CATAAATACG CTTCCTGAAC CGTAACGGCC AGTAATTGTG TGTGGAGTTG TACCTTCAAC 7860 

ACGTGGAACA ACTACTAAAT CTTTTTTAGC TGCTTCAACA GCTTTTTTGA TTGCTTCTGG 7920 

TACCTCTTGA GCTTTACCAG TACCGAAACC TACACGACCA TTTTTGTCTC CAACTACAAC 7980 

TAATGCAGTG AAACGGAAAC GACGACCACC TTTTACAACT TTTGCTACAC GGTTGATTGT 8040 

AACAACGCGT TCTTCAAATT CTTTCGTCTC TTCTTCtCTA CGAGCCATGT ATTTGTCCCT 8100 

CCTTTAAATT AAAATTCTAA TCCGCTTTCT CTTOCTGCTT CAGCTAATGC TTTAAcACGT 8160 

CCGTGATATA AATATCCTCC ACGGTCAAAT ACGATTTCTT TAATGCCTTT GTCAGCAGCT 8220 

TTTTTAGCAA TTGCTTCACC GACTTTAGTT GCTAATTCAA CTTTAGTTGC TGTAGTAGCA 8280 

ATGTCGCTGT CTTTTGAAGA AGCTTGAGCT AATGTTACGC CTTTATTATC ATCAATAATT 8340 

TGAGCGTAGA TATGCTTGTT TGAACGATAT ACGTTTAAAC GTGGCTTTTC AGCTGTACCT 8400 

GATAAGTTAG TACGAACACG AGCATGTCTT TTTAAACGCA CTTTATTTTT ATCAATTTTA 8460 

CTGATCATTT CAATACTCCT TTCTTTAGAG TTTATCTATT ATTTACCAGT TTTACCTTCT 852 0 

TTACGGCGAA CGTATTCACC TTGGTAACGA ATACCTTTAC CTTTGTAAGG CTCTGGAGGT 8580 

CTTACTGAAC GGATGTTAGA TGCTAATGCT CCAACTTGTT CTTTTGAAAT ACCTTCAACT 8 64 0 

30 TTAACGACTG TGTTTTTCTC AACTGAGAAA GTAATGTTTT CTTCAGCTTT AATTTCTACT B700 

GGGTGAGAAT AACCAACGTT AAGGATTAAG TCTTTACCTT GCATTTGAGC ACGGTAACCT 8760 

ACACCAACAA GTTCAAGTAC TTTTACGTAT CCTTGAGAAA CACCTTGTAC CATATTGTTT 8820 

AATAAAGCAC GAGTTGTACC ATGGTTTGTT CTATCTTCTT TAGAATCAGA TGGTCTTACA 8880 

ACTTCAATTG TGTTTTCTTC TTGTTTGAAT GTCATTCTTT CATTTAAAGT TCTTGATAAT 8 94 0 

TCACCTTTAG GACCTTTAAC AGTTACATGA TTTCCATCAA AAGTTACTGT TACGTCACTA 9000 

GGGATGTCAA TAATTTTCTT ACCAACACGA CTCATGTTAT GGCACCTCCT TATTTTTTAT 9060 

TACCAAACGT ATGCGATAAT TTCTCCACCA ACATTACGTT TTCTTGCTTC TTTGTCAGTG 9120 

ATTACACCTT CAGAAGTTGA TACTAATGCA ATACCTAAAC CATTTAATAC TTTAGGCATT 9180 

TCGCTAGCTT TTGCATAAAC ACGTAAACCT GGTTTTGAAA TACGTTTTAA TCCTGTGATA 9240 

ACACGCTCAT CGTTTTGACC ATATTTTAAG AATAAACGAA GTACACCTTG TTTATCATCT 930 0 

50 TCTACGTATT CAACATTTTT AATGAAACCT TCACTCTTTA AGATTTCAGC AATTTCTTTT 9360 

TTAATATTTG ATGCAGGTAA TTCTAACTTC TCGTGACGCA CCATGTTTGC GTTTCTTACA 9420 
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CCAGGTCTAA TACCTGGAAC ATAGCTACCT TGTTTCTTAA GGTTATCAGC CATTTTTTCC 5 94 0 

GGATTAACTT GTACAAATGC ATAGAAGTAT GTGAATAGTA TAATTAGTAC AATATATACA 6000 

ACCATACCAA CATTACTTGA AGGATTTGCA GCATTCGCAA TGTTTTGTGC CCATTCTTTA 6 06 0 

TCTGGATAGA ACAACGTTAA TGTTCTAGGC AGTAAGAAGA ACGCCATTGC AAAGATTACA 612 0 

GGAATAACAC CGGCTGAGTT CACTTTTAAA GGTAGATAAG TTGCCTGTGA ACCTAATCTT 6180 

TGAGCAGTTT GTTTCTTAGC ATATTGAATC GGAATTTTAC GAACGGCTTC AAGTACATAA 62 4 0 

ATAGCACCTA CAGTTAATAG TATCAGTGAC ACTAAAAGTC CTAATACTTT CAACCATGCT 63 00 

AATGATGTAT CTTCTTGCCC AACGAACGCA tTTGTcCAAA TTGAATTAGA CTGGCTGGCA 63 6 0 

ACGTTGATAA AATACCCGCA AATATGATAA TAGAAATACC ATTACCAACA CCGAACTGAG 64 20 

TGATTTGATC ACCAAGCCAT ATTAAGAAAG CAGTTCCTGC TGTnCAAAAC TAGTGCTATT 64 8 0 

20 AATAAATAAC TCATAATTGA CTGATTGATA ATCAGCGCAC CTTTGAGATA ATTATTAAAT 654 0 

TGGAATGCCA TACCTATAGA TTGGATAAAT GCTAAAGAAA TTGCTAAATA ACGAGTAACG 66 0 0 

TTATTTAACT TTCTTCTACC TACTTCACCT TGTTTTGCCC ATTCTGAGAA TTTAGGGACA 6660 

ATATCCATTT GTAATAATTG CATTACGATT GATGCAGTGA TGTAGGGTAC AATACCCATT 672 0 

GCAAAAATAG AAAATCGTTT CAAGGCTCCG CCACCAAAAG TATTTAATAA CTCAGTGGCA 67 90 

CCTTGAGAAC CTTGGGGATT ATCAAAAGCT GCAGGATTTA CTCCTGGAGC TGGTATATAA 6 84 0 

GTCCCTATTT TAAAAATTAC TAACATTGCT AGTGTGAAGA AAATCTTGTT ACGAACCT CT 6 90 0 

TTTGTTCTAA AGAAGTTCAC AAGGGTTTGA ATCATTAGAT CACCTCGTGT GcTCCACCTT 6 960 

TAGCATCAAT AGCTTCTGCT GCTGAAGCTG AGAATTTATG AGCTTTCACT GTCAATTTCT 7 02 0 

TATCAAGTGA ACCATTACCT AGTATTTTGA TACCAGATTT TTCATTCTTA ACAACACCAG 7080 

ATTCXACTAA TAAAGCTGGA GTTACTTCAG TACCATCTTC AAATTTATTA AGTTGGTCTA 714 0 

40 AGTTAACAAT AGCATATTCT TTACGATTTA TGTTAGTAAA ACCACGTTTT GGTAAACGAC 72 00 

GGAATAATGG TAATTGACCA CCTTCAAATC CTGGTCTTAC ACCACCGCCT GAACGAGCTT 72 6 0 

TTTGACCTTT GTGTCCGCGA CCACTTGTTT TACCGTTACC TGTCGCAACA CCACGTCCAA 73 2 0 

CACGATTGCG TTCTTTACGT GAACCTTCTG CCGGTTTTAA CTCATGTAAT TTCATTTCGG 73 3 0 

CACCTCCTTG ATTATTTTTC TTCTACTGTT ACTAAGTGCT TAACTTTGTT GATTTGCCCA 744 0 

CGAATAGCAG GGTTATCTTC AACAACTACT GAACTGTTAG TCTTTTTAAG ACCTAAAGCT 7 50 0 

TCAACAGTTT TACGTTGTGT TTCAGGACGA CCAATAACAC TACGAGTGAG GGTAATTTGT 756 0 
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CGCGCGTkGT TTTTCGTTTT TTGACCACGA ACTGGTAAAC CACGACGGTG ACGGATACCc 414 0 

ACGGTATGAT GAAATTnCCA TTAAAcGTTT GATATTTAAG TTAGTTTCAC GACGTAAGTC 4200 

5 ACCTTCGACT TTATAACCGT CTACAACTTC ACGGATGCGA CCTAATTCGT CATCAGTTAA 42 60 

ATCTTTCACA CGAGTATCAG CTGATACGTT AGCTTCTTCA AGAATTTTTT GAGCAGTTGA 4320 

CGTACCGATA CCGTATATAT AAGTTAATGA GATAACTACG CGTTTTTCAC GTGGAATATC 43 80 

10 

TACTCCTGCA ATACGTGCCA TATTAATTTA CACCTCTCTT TTATTAACCT TGTCTTTGTT 4440 

TGTGTTTTGG ATTTTCACAA ATTACCATTA CTTTACCTTT ACGTTTAATG ACTTTACATT 4 500 

?5 TTTCGCAAAT AGGTTTTACT GATGGTCTTA CTTTCATTTT TATACCTCCC TATATTATGG 4560 

AGTGACGATT ATTTATAACG ATAAGTAATT CTTCCGCGTG TTAAATCGTA CGGAGACATC 4620 

TCAACAGTTA CTTTGTCGCC AGGTAGAATA CGAATGTAAT TCATTCTGAT TTTACCACTT 4680 

20 ACGTGAGCnA AAATCTCATG ACCATTTTCT AATTCTACTT TAAACATTGC GTTCGGTAAA 4 74 0 

GTATCTAATA CAGTACCTTC TAATTCAATT ACATCTTGTT TAGCCATTGA TTAACTTCCC 4 800 

CCTTTTTGCA ATAGTAAGGT AATCGTCAAT AGACAACTTT ATTGTTACGA ATCTATCAGT 4 860 

25 GATTAATTTT ATAAGTTAAA CAAAAATTAC GGGAATTAAT TATCGTTAAT TGCCACTCTC 4 920 

ATCTATCTAA TATGATTAAA TCATGCCTCA CTTAAAATAG ACCGCTAAAA GTTGATCTAT 4 980 

TACAAATGAT CTAAAATATC AATGACATCT TTGGTAACGT CGCTAATATC TTTTGAACCA 504 0 

30 

TCAATATTTT TCAATACACC TTTTTGATCA TAGAAATCTA AAATAGGCTT AGATTGTTTA 5100 

ATATTAACAC TCAAACGATT AGCTACCGTT TCAGGATTAT CATCTTCTCG TTGATACAAT 5160 

TTACCACCAT CGATATCACA AATACCTTCG ACTTCGGAGG ATTAAATACA AGATGATACG 5220 

35 

TTGTACCACA TGACTCACAG ATTCGACGAC CTGTAAGACG GTTCATTAAT TCTTCTTCCG 5280 

GAAClTCGAT ATTGATGACA GCATCAATGT TTCTGTCAAG CTCAGACATA ATATTATTTA 5340 

40 ATGCCTCAGC TTGCTCGATT GTTCTTGGGA AGCCATCTAA TAAAAAGCCT TTTTTTGCAT 54 00 

CGTCTTCAGA AATTCTTTCC TTAACGATAC CTACAGTCAC TTCATCAGGA ACTAATTCGC 5460 

CACGGTCCAT ATAAGACTTA GCTTCTTTAC CTAATTCAGT TTCTTCTTTT ATAGCTTTTC 5520 

45 TGAACATGTC ACCAGTTGAA ATGTGGGGTA TTGGGAATTT CTTGaCAATT TCACTTGCTT 5580 

GAGTTCCTTT ACCTGCGCCA GGTAAACCCA TCAAAATGAT ATTCATAAGT GCCCTCCTAA 5640 

AATTATCTAC CACCAAAGCC TTTATATTCT TTTTGAGATA CTTGAGCTTC TAAAGATTTC 5700 

50 

ATTGTTTCAA TCGCTACACC AATAACGATA AGTAAACTTG TACCACCAAT CTGAATTGAT 5760 

TGTGGTAATC CCATAAACTT AGTTGCTAAT ATCGGTAGAA TTGAAATAAC GGCTAAGAAG 5820 
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ACGAGTGTAA CCACCTTGAC GTTCTGTGTA 
AAGTGCAGTT TGTGTAGTTT CATCTTCGTT 
5 TGCATTACGA CGAGAAGCTA AATCTCCTTT 

ACTGCGAACT TCTTTTGCAC GAGCTTCTGT 
TGTAGCTAAG TCACGTAACA TAGCTTTACG 

w 

AC CC ATGAGT TAACCTCCTT TATCAATCTT 
ATTTGTATTT AACTTCTTCT AAAGATTTAC 
CTTCAGATTT GTCAGCTAAC TCTTGAACAG 

15 

ATGAACGTAC AGATAAGTCT AATTCTTCAA 
CTTCTTCTTT TTCAATCATG ATTTCAGCGT 

2Q TATTCAAGTG TTCAGTCATT ATTTTTGCTG 

AACCATTAGT CCAAACATCC AATGTTAATT 
TATTTTCAAC AGTATAGTTC ACACGTTCAA 

25 CACCAATTGG TAAATCACTA GTATTATTTT 

TAGCAACTAG ACGAATTTTT AAGTGACCAC 
GGTTTAAAAT TTCAACATCA CTATCATGTG 

30 CACGTACATC AATTTCTAAA GTTTTATCTT 

TAATGTTCAT AATAATTGTA GAAACATCTT 
GTAAAACTCC CTCAATTTCA ATATACTTAA 

35 TACGACGTAA GGAGTTTCCT AGTGTAGTAC 

ACTTACCGAA TTTAGCATCT TCACTAATTT 
CTATCATTTA AATATCCTCC TTATATACGT 

40 

CTGTAACAAT AC CAT CAT AA ATTATACACG 
AGGTACTGGA GTAACGTCTC TGATCGCAGT 
AATAGCTGAT TCACGACCTG GACCAGGTCC 

45 

ATGCTCCATA GCTGATTTAG ATGCAGTTTC 
TTTTTTAGAT CCTTTGAATC CTAATGCACC 
CTCATCAGTG ATAGTTACAA TAGTGTTGTT 
TTCAATATTC TTTTTCACTC TACGTTTACG 



AcGCTCTGCG ATTTCACCAA ATAATTTTTG 234 0 

TAAGATTTCA ACATTACGTA AAGTTTTAGC 24 0 0 

TTTACCTAAA GTGATTAATT TCTCAACAAC 24 6 0 

AG TTTCAAT A CGTTCACTAA TAATAAGTGA 2 52 0 

TTGATCAGAA GTACGACCTA ATTTTCTGTA 2580 

CTTTTCTTAA TCCTAATCCT AAATCTTCTA 2 64 0 

GACCTAAATT ACG CACTTTC ATCATGTCAG 2 70 0 

AATTGATTCC TGCGCGTTTT AAGCAGTTAT 2 760 

TAGACATTTC TAATACTTTT TCTTTTTGAT 2 820 

TTTGCGCTTC ATCAGTAAGA CCAACGAAGA 2 8 80 

CTAATGAAAC TGATTCTTGT GGTGTGATTG 2 940 

TATCAAAATC ACTGCTTTGA CCTACACGTG 3 0 00 

CAGGTGAATA CAATGAATCA ACAGGGATTA 3 060 

GTTCTGCTAA TGCGTAACCT CTACCCTTGT 3120 

CTTTAGATAC TGTTGCAATT TTAAGCTCTG 3180 

TAATGTCGCT TGCTGTTACT TCGCCTTCAT 3 240 

CTTCAGAGTA AATTTTCAAT GCTAATTGTT 3 3 00 

CAACTACATT GTCTACTGCT GAGAATTCAT 3360 

cGGCTGCACC TGGTAATGAA GATAGTAGGA 3420 

CGTAGCCACG TTCTAGTGGT TCAACAACGA 3480 

CAATTGTCTC AATTCTAGGT TTTTCGATTT 3 54 0 

CGACTTAATT TAAAATGTTT GCTCAGTGAC 3600 

ACGACGTTTT GGTGGACGAC AACCGTTATG 3 66 0 

TACTTCTAAA CCTGCAGATT GTAATGCACG 37 20 

TTTAACTGTT ACTTCAACTG TTTTTAAACC 3 78 0 

AGAAGCCATT TGTGCTGCAA ATGGTGTTGA 3 84 0 

AGCTGATGAC CATGATAAAG CATTAC CGAA 3 900 

GAATGTTGAA CGGATGTGTG CTACACCATT 3 96 0 

AGATACTTGT TTACGTGCCA TTTAAAATTT 4 02 0 
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AATCCATCAA CAGACGATGG GCATAGTTTT TGGCTTCATC TAAATTCATT TTAAAGTTTT 540 

TAGGTCCAAA TATCATTTCA CGCTCTACTG TGTCCTCAAA TAATTGAGAT TCGGGAAATT 600 

GAAATACCAT TCCAATTCTT TTTCTTACAG GTCTAATATA TTTATCTTTG GTCTTATGTG 660 

TAATAGTAAT GTCATCAACT GTAACTGTCC CAGTAGTCGG CTTTAACAGC GCATTAATAT 720 

TTTGTATCAA CGTTGATTTA CCACTACCCG TTTGTCCAAC GATGGCGTAA TATTTACCTT 780 

GTTCAAATTC TGTATTAACA TCATGAATAG CTTGATGCTG ATATGGTGTC CCTTTTTGAT 840 

AGGTATAACT TACATTGTCA AACCGTATAG TCATAGTTGA TCCACCAGCC CTTCATAAGT 900 

TAAGAATGAT GTTTGGTGTC CCAGCATTTG ATTTATTTTG ATTGGGAATG GCAAATCTAG 960 

ACCTATTCTT GTTAACTCTT CTGCATTGTC GAAAATTTCA GTCGCTGTGC CTTCTTTATA 1020 

GACAGTCCCT TTATTCATAA CGATAACATG ATCTGCTTCC ATCGCCTCAG ATAAATCATG 1080 

CGTAATAGAA ATGATTGTAA TATTATGTTC TGATTTAACT TTTCTCACTA AATCCAATAA 1140 

ATTTTGACGT GCATCAGGAT CTAACATAGA AGTCGCCTCA TCTAATATAA TGACAGAGGG 1200 

GTTAAGTGCT AATACACTTG CTATAGCCAC ACGCTGCTTC TGTCCCCCCG ATAATGCATT 1260 

AGGTT CATAA TCTGCACGTT CTAACATATC AACTTGTTTA AGTGCTTCGC TGACTCTTCT 1320 

ATGCATTTCG TCATATGGAA CCGCATGATT TTCGAGTCCA AATGCCACAT CGTATTTTAC 1380 

AATTGAACCA ACAAATTGAT TATCCGGATT CTGAAATACA ATTCCTATGT CTTTTCTTAA 1440 

30 CTTTT CAAAA TTATCATCAG TTATAGCTTG ATTATTATAA AAAATTTCTC CAGATTTAAC 1500 

TTTCTCTATG CCAATCATTA ACTTGGcAAT TGTAGATTTT CCAGAACCGT TATGACCAAC 1560 

AATAGATGTC CACTGACCTT TAGGTATATT AAAAGAAACA TCTTTCAATG TGAAGGATGC 1620 

ATCACTTTGA TATTGAAATG AAACATTTTT AAATACAATA ACTGAATTCT TATCCTCCAC 1680 

TTGTCTCTCT CCTTTACGAT TCGTGTATCT ATCATATTTT ACAATATTTA TAAATCGCTG 1740 

TATATGACAT TGACTGGGTT CTCTATATAT TACTAGTATT TTCTGACTCA TTTCTAGTCT 1800 

TTAAAGTGTT GTTTAACAAC TAATGATAAG GACTTTTATT CCTCTCTAAC AATTATGTAT 18 60 

AAACGTTAAT AAAATAAATG ATTTACTAAT ATAGGGGTGG TCGCGTTTGA TTCAACGATA 1920 

ATACTTTCAC TTCATTCAGT TCTAGTGAAA TTGATCAAAC TAGCTTCATC ATATTTTTAG 1980 

ATTCGCACTC AAAAAAGTAA ATATAAAGAA ATCGGACTTA AAAACATTTC TGTTCATAAG 2040 

TCCGATATTT TATTCAATAA AAAAGCGCGC ACCCCATCAT AAGTTTGTTG AGTTCACGCT 2100 

TTAAATCTTT ATTTAGTTGA TGGGGTACTC TGAGCTAGAC AATATTTGTA TGTGGCAAAC 2160 

ATTAtCGTTG CACTCATTTG CTTTATATAA AAGTAGTTAG TGTATTTATA TAAATTCTTA 2220 
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ATAATAACTG AAATTAAAAT TGCTAAATmG TGTTaAgCTA TCGCmACAAT GAAAATwCCG 6 0 

ATTTTGCGTT GTTGAAAATA TCTTTCCAAA CCAAGAATCG ATAATGGCAA TAAATATAAT 120 

5 

AAATTTCCAT AAAATGACCA AGTAAAATTA AAGTATATAA CGACAGTTGA CATGCCGTAT 180 

AAAATCGTAG CGATCATATT TGCTGAGCGT TTAAAGTGTA ATATTTTAAA TAAGTAGAAG 24 0 

GTCACGACAA ATGTTATGAT AGCTCGTATC ATAGCCATAA TAAGTTGGTT TGTCGGCCAA 3 00 

10 

AAATGTATTG TCGTCGGATT AAATATACCA ACCGTTTCTC CTATTTTAAT GAAkAGAAAA 3 60 

TTTAGCCACA TTAAAGGTGA CAGCGAATAA TAATnTGATA GTCCTTTCAT ATAATCGCCA 420 

15 CCTAmTCCAA ACGATGCATC ATrTAAACTA GAAnAACTAC GTAGATGTTC ATACAnATAC 4 80 

ATTTGAAATG GCATCATTTG ACGGAATCCA TCTCCAGCCC CGCTAAAAAC AGTACCATTC 54 0 

ACAATATAAT CATAGATATG AGTAGAAAAT AAAATAAGCG TTAATATTAC ACTAATGAAA 600 

20 GTTATAACAA AGAATTGTTT GACGTTTGAA TTTAGCCACT TTTTTAACAC AACATTATCC 660 

TCAACTTTCA AATTTAAAAT TAAGTTTAAC TGAAACTAAA GTTAATGAGG TTCTTGATAG 720 

GTAAAGACGA AGATGACTGT GGAACAGATA CCTTATCATA GTTACTTAAA CTTTGGATCA 730 

25 TTTTCAGTTT ATCATTAAAC AAATATATTG AATAATAAAa aTGTCATACT GATAAAGATG 84 0 

AATGTCACTT AATAAGTAAC TTAGaTTTAA CAAATGATGA TTTTTAATTG TAGAAAACTT 90 0 

GAAATAATCA CkTATACC 918 

30 

(2) INFORMATION FOR SEQ ID NO: 205: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 163 97 base pairs 

(B) TYPE: nucleic acid 

35 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 205: 

40 

TCGCCCnATA ATCAATTTAT TTTTCATGTG CCACTCCTAT ACAAGCTnAC AATGCTTCTT 6 0 

CAGTTAAGGC AATATCTTTT AATTTTGTTT GATATTTTTG TTCAAAGTCA TATTGTAACT 120 

45 GAACAATTTC TGGCAAACCA ATATGCCAAT CCGCCAATTT TTTTTTAyCT TtGAAGAGCT 18 0 

CTTTTGGTGA TGkTTGcGAC ACTATACTAC CTTCTTTCAT AACGATGACT TCATCTGCAT 24 0 

AACGCGCGAC TTCATTCATA TCATGTGAAA TTAGGATAAT TGCCTTATTT TCATCTGTTT 3 00 

50 GTAGTGACTT TAGTAATCTC ATTACTTGTC GTTTACTTTG TGGATCAAGT CCTGCTGTAG 3 60 

^^"AT^AA^ ^A^HATAATA rT, rAr T nA w, °A T^^PAATA"' ^ATa^A A""* ~<~ a ~~t— - 
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CTATATTACT TGGACTTAGA TTCAAAACAT GTTTCATCAT TAAAGCTTGA AGAAGAAAGT 3 360 

CGTTCAGAAG TGACCAATGT ACATATCATT AATTTAATGA TTGATGGCCA AAAGGTTGGC 34 2 0 

5 TTTATCGCAT TGGAGCAGAT TGGTGAACGC ATGAACATTG CTGCTATTGA AGTGGATAAA 34 80 

TCATATCGCT TTAATGGTAT TGGTTCAAGT GCTCTGCGAC AATTGCCAAC TTACTTAAGA 3 54 0 

AAAAACTATG ACAACCTTAA TGTGATTACG ATGATTCTGT TTGGAGAGAA TAATGATTTT 3 6 00 

w 

AAA CC ATT AT GTTTAAATAG TAATTTCGTT GAAATCGAAC AAACTGATGA TTATGTCGTT 3 660 

TTCGAAAAAT ATTTAAATTA CTAACAGTGA TTGCGAAATA TGATATTGTC ATTTATAATT 3 720 

TAGTTTTGTT ACTATATATA AATGAATTCA GACGTATAAA TTTAGATTAT ATCCTTCGAA 3780 

15 

AGGAAGTATT GGGCAATGAA AATTCAAGAT TATACAAAAC AAATGGTTGA TGAAAAATCA 3840 

TTTATTGATA TGGCTTATAC ATTATTGAAT GATAAAGGCG AAACAATGAm mTTATATGAT 3 900 

20 ATyATCGATG AATTTAGAGC GTTAGGTGAT TATGAGTACG AAGAAATTGA AAATCGTGTT 3 96 0 

GTACAATTTT ACACGGATTT AAACACAGAT GGTCGTTTTT TAAATGTTGG AGAAAATTTA 4020 

TGGGGATTAC GTGATTGGTA TTCGGTAGAT GATATTGAAG AGAAAATCGC ACCAACTATT 4 080 

25 CAAAAATTCG ATATTCTGGA TGCAGATGAT GAAGAAGATC AAAACTTAAA ATTATTGGGC 414 0 

GAAGATGAAA TGGATGACGA CGATGATATT CCAGCTCAAA CAGATGATCA AGAAGAACTA 420 0 

AATGATCCAG AAGATGAGCA GGTTGAAGAA GAAATCAATC ATT CGG AT AT AGTCATTGAA 4 260 

in 

GAAGATGAAG ATGAACTAGA CGAAGACGAA GAAGTGTTTG AAGACGAAGA AGACTTCAAC 4 320 

GATTAATTTT TTGTTTGACT TTTAGTTGAA AGATGATAAA ATTTTATTCG GGCTCCTTTA 4 38 0 

AATAGGACAC GTGTATAAAA TTTATACGCT CCCCTTACAG AATTTGTGAG AGGGAGCGTT 444 0 

35 

TTTTtATTTA ATTGAGTAAA TCAAGAAATG ATAACGCAAA AATCAAAGTT GTAAATGATA 4500 

TACAJAGTGA CATAGCAGTA TGGAAACGGT AAGTAAACAG AATTTAATTT TGTCGAtTCG 4560 

ACAAtAAaCA aCTtGAaTGA GCTTGCTTTA ATGTTATGTn nTACGTAATT TTTACAATTG 4620 

40 

ATGAGGAAGC ATTCCCTTTA ATAATTAGGA GGTCAAGACA TGACAAAATT TATTTTTGTA 46 80 

ACAGGTGGCG TAGTTTCATC CATTAGGGGA AGGGT 4715 

45 (2) INFORMATION FOR SEQ ID NO: 204: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 918 base pairs 
{B} TYPE: nucleic acid 
(C) STRAND EDNESS : double 
so (0) TOPOLOGY: linear 
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ATGGAACAAT ACGTATTCAA AACACTTAGA 
TTATTTAAAA CAACGCGCAT ATAAATGGTC 
5 ATAGCACCTG nAAAATGCGC CGTTTTCAAC 

ATAATCTTCC ACAACTTTGC GTAATAAAGC 
GATATTTTCA GTTTTAAATT CACGTGCAAC 

w 

AACGACACCA ATAACGGCTG CTAATTTATT 
TAGTACATGA CCAAAATTAG CTGCTGTTAA 
ATAAATATGT CTAACCTTTA AATCGATAGT 

15 

TAACTGTTTA TAATCAGTGA TTTGACTTAG 
ACCTGTACCG ATACCGCCTA CACGACGTTG 

20 ACCGGTACCA ACATTTGCAA AAATATAATC 

AATACCTAGT CCTTGAGATG CAGCATCAAA 
GTTTTCAGCA ATGACACCTG CATTACCTCC 

25 GTTTAACCAT TCCACAACTT GATCAATATT 

GTTATCTTGC TCTTGAACGA TTTTAATTAG 
AAGATTCCCA CCTCATTATT AATGTCTATC 

30 AAACAAGTAA TAATAGTAAT TATTAACAAA 

cGTATCGCAA TTAAtGTTTT ACAAACGTGG 
ATTTTGAAGT TGTATAGGAT TGTTAGTTAG 

35 

GTGTATTTGT AAGTCTCTGA TTAAAATGAT 
AATTTAAAAA ATATAAATAA GTAATTTATT 
AAATAATAGG TAAAATAATA TGGGGATTCA 

40 

TATATGGCTA AAGAACTTTG TTTTGAAGGT 
CGTTCAGCAA TTAATGATTT TGACTTGAAT 

45 AAAGAAGTTA TTGATGATGC AATTAATGAT 

GATAAAAATG AAGTGGTGGG CTTTTTTGTA 
GATACACCTG AAAATGTCGT TTATATTCGT 

SO TTTGGATATG GCACGAAAAT AATGATGTCA 

GATTTTAATC ATCTATATCT AGTAGTAGAT 



CCATAAAATA AAAGGCCATT TATATAGCGT 1560 

TTTTTCTATT TTTCTAAATA TAATGCACCA 1620 

ATAGTACGGT TTGCAACCGC GTAACACAGT 16 80 

GTTATTATGA AATGAAGAAC CGATATAAAC 174 0 

AGTAATGGCC ATTGTCGTAA CAACTTCGCC 18 00 

GCTAGGTGTA AAATCAGCAT CTAAATGATG 186 0 

ATCACCGGGA ATGGGTGGTT CGGTATCTTT 1920 

GTTACGATCA CCGTGTTGTG CCATGTCAGT 1980 

TAAATAACCG AGTCCTTGAA TCATGCCTCC 204 0 

TGATTGGCCG TCGAAATAAT GTAGTGACGT 2100 

TGCTAAGTCA TGGCCTTGCT CTTTTAACAA 2160 

CTCTACAAAA ATTTGTGCAG GAATGTTGAT 2220 

AGTTAAGCAT AATTTTTCAA TTTGCTGTTG 22 80 

TTTAGTTAAT TCAGTTTTAA AAGTACGTTG 2 34 0 

TGTACCGCCA GCGTCAATGC CAACTTTCAT 24 00 

CTTAAATAAT AGTATAGTAA AATGACTAAA 24 6 0 

TTTGATGCCa TTGCATTTCA ACATTGTAAG 2 52 0 

ACGTTAAGTt ATATATATTA TTTTCTAGGA 2 5 80 

TGACGCAATA TTAAAAGTAG TTCGTACGCA 2640 

AAGTAATGAG GAATAGTACA TTAATTTTGA 2700 

TAACTTAGAG CAAATAATGG TATCGTAGTG 27 60 

TGCTTCATAT ATAAAAAGAT AGGGGTTAAA 2 82 0 

ATCACTTTAA AAG CATTTG A TGAACAATAT 2380 

GAAAGACAAC AAATATATTC ATCTTTACCT 2 94 0 

GCTGATAGGA TTGCTAACGT AGCAwTAAmC 3 000 

TTACATCGTT ACTATCAGCA TGAAGGTTAT 30 6 0 

TCATTATCGA TTAATGAAAA ATATCAAGGT 3120 

TTGCCGCAAT ATGTTCAAGG TGTATTTCCT 318 0 

GCGGAAAATT, ACAATGCTTG OA AO^.f^A^ tm ^ 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4715 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: i 

10 

GAAn CAGnTA GACAAATTAT GGaAAmCGGT 
TTTGACCTAC TCGTCGATGA GGATGATAAC 
AATGGTTCaA CGAGCATGTT ATTACTTGCT 

15 

TATAGTTATC ATTCAAAAGG TGATAACACA 
AAAGAAGGTA GTTTATACCC GTTATCTTAT 
GTTAAATCAA GGTTTGGCTG TATTTGGCAT 

20 

GAACGCGCAT TTTTAGCTGA ACTTGAACAC 
TGTTGTGTTG CCAGTAGTTG ACTGAATATG 

2$ CAACAGTAAT TAAATTTATG ATATGGCAAT 

TTTATTTCAC TATGTTATAA GAGTTGCCAT 
GTTTTGAAAT TTTTAGTAAC TAGATATGTT 

30 AAAATTTAAT GCTATTGAAT TTTTAAAATG 

ATGTCTAATT GACTAACTTG TTGGAGTCAT 

AGTGAGGGTC AAGCATGTCT TATAAAGCAT 

35 

AATGTATTTA TTTCGCCTCT AAAAATAAAA 

CGnAlGTAGG CGTTTGGACA GAAGAAAGTG 

TTC CATTTGA TAAAGTTGTA AAAATGGACG 

40 

AATTGTTTGA TGAACAAGAC CATATTATTA 
TACTAAACGT TGTAGCTGTT ACACAAGAAG 
AAGAATTTGT CCAAGATGTA GCGAAATATG 

45 

GTAAGCAGTT TATTCTCATT AGTGAAAATG 
CTGTATGGAG TATTAAAAAC AGAGCGTTAA 
50 ATTTAATTAC GATTGAAGGT TCTGTTTTCG 

ATAAAGCCGT TGCGATAGAT TTAAAAACTG 



IEQ ID NO: 203 : 

GTGAATCaAG GATTCtTTGG TGTAGCTGGT 60 

GTTTATGCGA TTGATTTAAA CTTTAGACAA 120 

AACGAGTTGA ATTCAGGATA TCAAAAGTTT 180 

CATTTCTTCA ATACGATTTT GAAATATGTC 240 

TATGATGGTG ATTGGTACGG TGAAGATAAA 300 

GGTGATTCAA AAGAAACAGT ACTGGAGAAT 360 

TATTAGAGTT CGGAACATAA GGCGCTACAA 420 

CGTTTGTAAC AAGCTTTTTT CGATTCTAGT 480 

ACTTTGTAAT ACTAATATTA AATGGCGACT 54 0 

TTTGTTGATA AAGGTATACT AAAGGTTATC 600 

TCGTGTTATA GACCGAATTT GTGTATACGT 660 

AAAAACATGA CATTAAATTG AATTCATAAT 720 

TTACTATTTT ATGTATGACA TATTTTAAAA 780 

ATCCATTCTT TAGAGATATA TTAATAAATG 840 

AACTAGTACG C CT AAATT AT AAAAGTGAAG 900 

TGGCCGTATC ATTTTTAACA AGTCGTGATA 960 

TTGATCGTTT TGCTACTTAT GAATTAGATG 1020 

TGAATCAAAC AATGGAAGAw GAAGGGCATC 1080 

TGATGACGGA ATTAGATAAA ATTAGAATCA 1140 

ATGAAGTATA CGGCTTAACT AAAAAAGGTA 1200 

ATAGCGACGA AAAAAAGCCG CATATTATGC 1260 

AAGTTCGAGA TGAAGATTTT GAAGAGTGTG 1320 

GAGAATGGCT AGATGAACTT AGAGATGATC 13 80 

GCGTGGTTGG TACAATTGTT TCAGCGCAAA 1440 
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AAATGGTATG GGAAATCACA TGCTTGTTAT 
ATTGCCTTTG TAGATGCTGA TGTAACTTTC 
5 AATCAGTATC AATTACAAGG TGAAAAAGGA 

AAGCGTTTCT ACGAAGGGTT TTCAGCGATA 
GTATTTTCTA CCTTAGACGA CGGTCGGACT 

W 

ACAAATAAAG AAGATTATTA TGCAACTGGA 
GAAGGATTTG CTTTAGGAAG TGCATATACT 
GGGTTTCCAT TTGTTGCATT TCGCATGTAT 

15 

TGGACAAAGC ATTTGTCAAC TGGGGCAGGT 
GTGTTGTGGT TGTTTGGTTC TATAGCGAGT 

20 CGCCAAATGT CTGTAAGAAA AATGGTAGCA 
TATCTGCATC GAAGGGTCGG CCAATTTTCT 
TTTATGTTTT TTACTAAAAT TTTCATCCAA 

25 GTTGAATGGA AAGGTCGTCA ATATTCTATA 
TTTCAATATA GGAGGACTAG TATGACAATG 
ATCATGAAGA AACATTCAAA AAGCTTTTCT 

30 AGAAAAGCGG TTTGGGCAAT TTATGCTGTG 
TATGGCGATA TTCAATTTTT AAATCAAATA 
CCATATGAAC ATCATCACTT T CAAAGTG AT 

35 

GCACAACATA AAAATATCGC CTTTCAATCT 
GATCAACATT TTACAATGTT TGAAACGGAC 
GCTGGTACAg TAGGTGAAGT ATTGACGCCG 

40 

TACGATGTCG CAAGAAGACT TGGTGAATCG 
GGTGAAGATT TTGACAATGA ACGGATATAT 

45 GTTGATATTG CTGAAGTGTA CCAAAATGGT 
TATTATGCAG CTATCGCAGA AAAAGATTTT 
AGTATTGAAG CACAACCAAT CATAGAATTA 

50 GAaGTGAGaC AGGCTAACTA TACATTACAT 
GCAAAGTTGT TTCA 



CAAGGTGTGA CGCATGCATG TACGAATCGC 540 

TTAAGGAAAG ATGCTGTTGA AACGTTGATT 6 00 

TTGTTAAGCG TACAGCCTTA TCATATAACA 6 60 

TTTAATTTAA TGACAGTCGT TGGTATGAAT 72 0 

AACCAGCATG CATTTGGACC GGTGACATTA 780 

GGTCATAAAA GTGCAAACCG TCATATTATT 84 0 

TCACAATCAT TGCCCGTAAC AGTTTATGAA 900 

CAAGAAGGAT TTCAGTCATT ACAAGAAGGA 96 0 

GGCACAAAGC CTAAGATCAT GACAGCAATT 1020 

ATTTTAGGGC TATGTCTTAG TTTAAAATAT 10 80 

CTTTACTTGA GCTATACTAC ACAATTTATT 114 0 

AATTTATTAA TGGTATGTCA TCCATTGTTA 12 00 

TCTTGGAAAC AAACGCATCG TTATGGTGTA 12 6 0 

TCTAAAGAAC AATAAATCAA GGTAATGGCA 13 20 

ATGGATATGA ATTTTAAATA TTGTCATAAA 13 8 0 

TACGCTTTTG ACTTGTTACC AGAAGATCAA 14 4 0 

TGTCGTAAAA TTGATGACAG TATAGATGTT 1500 

AAAGAAGATA TACAATCTAT TGAAAAATAC 1560 

CGTAGAATCA TGATGGCGCT TCAgCATGTT 1620 

TTTTATAATC TCATTGATAC TGTATATAAA 16 80 

GCTGAATTAT TCGGATATTG TTATGGTGTT 174 0 

ATTTTAAGTG ATCATGAAAC ACATCAGACA 180 0 

TTGCAATTGA TTAATATATT AAGAGATGTC I96 0 

TTTAGTAAGC AACGATTAAA GCAATATGAA 1920 

GTTAATAATC ATTATATTGA CTTATGGGAA 1980 

CAAGATGTTA TGGATCAAAT CAAAGTATTT 2 04 0 

GCAGCACGTA TATATATTGA AATACTGGAC 2100 

GAACGTGTTT TTGTGGaTAA GAGGAAAAAG 216 0 
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ATAACGGCGA TAAATGTCGC ATATGTTGGC AT CATTGG AT TCATTGGTAT GGTGATACCG 5460 

CAACTCATTA GAAAATGGCA GTGGAAACAA TCATTAGGAA GACAATTGGC TTTAAATATT 5520 

5 GTAACTGGAG GACAAATAAT GGTTATGGCA GATTTTATTG GTAGCCATAT ATTGTCACCA 5 580 

GTACAAATAC CGGCAAGTAT TATCATTGCA TTAATTGGTA TACCAGTGTT AtTTTACaTG 5640 

CkAAwAtCtC aGTCgAAAcG GTTACaCTAG CACACGACaT TTGCTAAAAT AAAAATAACT 5700 

10 ATAAACATAA AGAGGGCATA AGCGATGGAT TTGAATCAAA TTAAAGCAGT TGTATTTGAT 576 0 

TTAGAAGGTA CGTTGTTGGA CAGAGTTAAA TCTCGAGAGA AATTTATCGA AGAGCAATAT 5820 

GAACGATTTC ATGACTACTT AATTCATGTT CAACTGGCAG ATTTTAAAAA AgCATTTATT 58 8 0 

15 

GAGCTAGATG ACGATGAAGA TAATGATAAA CCTGATTTAT ATAAAGAAAT CATTAAACGT 5940 

TTCCATGTAG ATAGGTTAAC TTGGAAAGAC TTATTTAATG ATTTTGAAAT GCATTTTTAT 6000 

CGTTATGTAT TTC CTTATTA CGATACTTTG TATACACTAG AAAAg CTATC GCAAAAAGGC 6060 

20 

TTTCAAATTG GTGTTATCGC AAATGGTAAA TCTAAGATTA AACAATTTCG ATTACATTCA 6120 

CTTGGTTTGA TGCATGTTAT TAATTATTTA TCAACATCAG AAACAGTTGG TTTTCGTAAA 6180 

25 CCACATCCTA AAATTTTTGA AGATATGATT GATCAACTAG GGGTATTACC TG AG CAAATT 6240 

ATGTATGTTG GCGATGATGC GTTAAATGAT GTAGCTCCAG CACGAGCTAT GGGCATGGTT 6300 

AGTGTATGGT ATA 6313 

30 (2) INFORMATION FOR SEQ ID NO : 2 02: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2174 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
35 (D) TOPOLOGY: linear 



" (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 202: 

40 

CCGTAAACAC ATCAACAAAA GAAGGCTATA TTACAAAAGA AGACTTGGAC TTATGCTGCA 60 

CGTCGCTCTA ATTCAGCTGG AATGCAAGTC ACCGGACGAC TGGCTTACAT TGAACCTTAT 120 

GGGGCAACAA GTCGCACAAA ATAAACGCGC GAGAAGCaAG AATAGGAAGT GATATCTATG 180 

45 

AAATGGTTAT CACGAATATT AACAGTAATA GTGACCATGT CtATGGcGTG TGGTGCATTG 24 0 

ATATTTAATC GTAGACATCA GCTAAAGGCG AAAACGCTGA ACTTCAATCA TAAAGCATTA 300 

50 ACAATTATTA TTCCGGCTAG AAACGAAGAA AAAAGAATAG GTCATTTACT ACATTCGATA 360 

ATACAACAGC AAGTTCCAGT AGATGTCATT GTTATGAATG ACGGATCGAC AGATGAAACA 420 
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AACG CAGTTG GATgCTACCG CACAACTGCA TAAATCCCTC TaATCgcTAA AGCGAAAAGT 3 66 0 

GGGATTAAAA AGGAGATGTG ATAGTGTGAA GAAATCGTTA ATTGCTTTTA TTTTGATTTT 3 72 0 

5 TATGCTTGTC CTGAGTGGCT GTGGTATGAA AGATAATGAT AAACAAGGTA GCAATGATAA 3 78 0 

TGGCTCGTCT AAATCGCCGT ACCATAGAAT TGTTTCGTTA ATGCCTAGTA ATACTGAAAT 3 84 0 

TTTATATGAA TTAGGATTAG GTAAATACAT AGTTGGTGTT TCAACGGTTG ATGATTATCC 3 900 

10 

AAAAGATGTG AAAAAGGGTA AGAAACAATT TGATGCTTTG AATCTAAATA AAGAGGAACT 3 96 0 

TTTAAAGGCA AAGCCAGATC TAATTCTTGC GCATGAGTCG CAAAAGGCAA CTGCTAATAA 4 02 0 

AGTATTGTCA TCATTAGAGA AACAAGGCAT CAAAGTAGTG TATGTTAAAG ATGCACAATC 4 0 80 

15 

AATTGATGAA ACTTACAACA CATTTAAGCA AATTGGGAAA TT AACG CATC ATGATAAGCA 414 0 

GGCTGAACAA CTTGTTGAGG AAACTAAAGA TAATATCGAT AAAGTCATAG ATTCAATTCC 4 20 0 

TGCTCATCAT AAAAAATCAA AAGTATTTAT TGAGGTTTCA TCAAAGCCTG AAATATATAC 4 260 

20 

AGCAGGGAAG CATACATTTT TTAATGATAT GTTAGAAAAA TTAGAAGCCC AAAATGTGTA 4 32 0 

TAGTGACATT AATGGTTGGA ACCCTGTAAC GAAGGAAAGT ATTATTAAAA AGAACCCAGA 4 3 80 

25 TATATTAATT TCGACGGAAG CTAAGACAAG ATCAGATTAT ATGGATATCA TCAAAAAAAG 4 44 0 

AGGTGGATTC AATAAAATTA ATGCTGTCAA GAATACACGT ATTGAAGTTG TAAATGGTGA 4 50 0 

TGAAGTATCA AGACCAGGTC CACGTATTGA TGAAGGATTA AAAGAATTAA GAGATGCAAT 4 56 0 

30 TTATAGAAAA TAAACCATTC TAATTATGCC CCTTATTGCT ACATGTAAAA AATACATGTT 4 62 0 

TGAGATAAGG GGTTTTTaAA ATATATTTAG TGAATGATAG CAACGCGAGT ATGTGATTGC 4 68 0 

TATAATGAAT GTAATTATCG ATGAAcaaAA GAGAATGCTA TGACATTTAA TAAAGTATTA 4 74 0 

35 TTGAGCTGGa TAGTCmTATT GATTATAACA ACTAGCATAT ATCTATTTTG GCAGTTGGGC 4 800 

GATATCAATG ATGTATTTAA CCAGTCTATT TTAATCAATG TTAGATTACC GAGATTATTA 4 86 0 

GAAGCATTGT TGACAGGTAT GATATTAACT GTTGCAGGCC TTATATTTCA AACAGTTTTA 4 92 0 

40 

AATAATGCAT TGGCAGATAG CTTTACATTA GGATTGGCAA GCGGCGCTAC ATTTGGTTCA 4 98 0 

GGATTAGCAT TATTTTTAGG TTTAACAACG TTATGGATTC CTGTATTTTC AATAACATTT 5 04 0 

AGTTTGATAA CATTAATAAC TGTATTAGTC ATTACGTCGG TATTGAGCCA AGGCTATCCA 5100 

45 

GTTAGAATCT TAATATTAAG TGGTTTAATG ATTGGTGCGT TATTCAATTC ACTTCTATAT 516 0 

TTTTTGATTT TATTAAAACC TCGCAAATTA AATACAATTG CCAATTATCT GTTTGGTGGT 522 0 

so TTTGGTGATG CAGAATACTC AAATGTATCT ATAATAGCAA TCACATTTAT CATTGCATTG 5 280 

TTTGGTATAT TTATCATTCT TAATCAACTA AAGTTATTGC AATTAGGAGA ACTAAAAAGT S^4 0 
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ATTATGGACG AaGTTGGCGT TGACGCTGCA CGTTATTTCT TAACTATGCG TAgTCCTGAT 1860 

AGTCACTTTG ATTTTGATAT GG AATTAGCG AAAGAGCAAT CTCAAGACAA TCCAGTTTAC 1920 

TATGCTCAAT ATGCACATGC GCGTATTTGT TCAATTTTAA AACAAGCGAA AGAGCAAGGT 1980 

ATTGAAGTGA CTGCTGCGAA TGATTTTACA ACGATTACTA ATGAAAAAGC GATTGAATTG 204 0 

TTGAAAAAAG TAGCTGATTT CGAACCTACA ATTGAAAGTG CTGCTGAGCA TAGATCGGCA 2100 

CATAGAATTA CTAATTATAT TCAAGATTTA GCTTCTCATT TCCATAAATT CTATAATGCT 2160 

GAAAAAGTGT TAACAGATGA TATTGAAAAA ACAAAAGCAC ATGTTGCTAT GATTGAAGCG 2220 

GTCAGAATTA CATTGAAAAA TGCATTGGCA ATGGTCGGTG TAAGCGCACC TGAATCAATG 22 80 

TAAGAACATT TATATACACT CCAACGTAGA GTTTCTCGAA AGATACTTTG TGTTGGAGTG 2340 

TTTTTTTTAG GTATGTGACA TATTGGGGAA TGCTTAGTAT GTGAATAAGG TTAAGAGGAA 2400 

CACAGTTGGA TGCTCTGCAC AACTGCATAA GAGAGCCTGA GACATAAATC AATGTTCTAT 2460 

GCTCTACAAA GTTATAATGG CAGTAGTTGA CTGAACGAAA ATTCGCTTGT AACAAGCTTT 2520 

TTTCAATTCT AGTCAACCTT GCCGGCGGGG CCCCAACAAA GAGAAATTGG ATTCCCAATT 2580 

TCTACAGACA ATGCAAGTTG GGGTGGGACG ACGAAATAAA TTTTACGATA ATATCATTTC 264 0 

TGTCCCACTC CCTCTAAAAT GGAGGGTGTA AATGTTAGGA ACTGATGAAT TATATAAAGT 2700 

TTTATATGAA CATCTCGGAC CACAATTTTG GTGGCCTGCT GATAATGACA TTGAAATGAT 2760 

30 GTTAGGTGCA ATTTTAGTTC AAAATACTAG ATGGCGAAAT GCAGAAATTG CATTGAATCA 2820 

GATTAAAGAA CATACGCATT TTAATCCAAA TCATATATTA GAACTACCTA TTGAAACGTT 2880 

ACAATCATTG ATACATTCAA GTGGCTTTTA TAAAAGTAAA TCACTGACGA TTAAAACATT 2940 

ATTAACATGG TTAGCACGAC ATCATTTCAA TTATCAAGAG ATTAATGAGC GATATAAAGG 3000 

TGGATTAAGA AAAGAATTAT TATCTTTGAA AGGTATTGGA AGTGAAACAG CAGATGTCTT 3060 

ACTTGTTTAT ATATTCGGAC GTATTGAATT TATTCCAGAT AGCTATACAA GAAAAATATA 3120 

TGATAAATTA GGATATGAAA ACACTAAAAA TTATGATCAA TTAAAAAAAG TAGTCaCATT 3180 

ACCAAATCAT TTTACAAATC AAGATGCTAA TGAATTTCAT GCTCTGTTAG ATGTATTTGG 3240 

TAAACATTAC TTTAGAGACA AAGATATAAA GAATTATGAT TTTTTAGAAC CTTACTTTAA 33 00 

AAAGTAAACG CTGTGAAGTT AGATAGATGA GTTTATATGA AATATAAAAA ATAATTTACT 33 60 

ATTTTCTTTT AGTATGTGGA CTTATATAAT AAATAGAAGC ATATAAAGAA AAAAACAGTT 3420 

50 GTTTGTTTGT GCAGCAACTG CATAAGAGCC CCTAATCGCT AAAGCTCAAG GGGAGTAAAG 34 80 

GAATACAGTT GTTTGTGCAG CAACTGCATA AAAGCCTCTA ATCACTAAAG GTGAAGAGGA 3540 
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GGTTTTCnTG GAAAGATAGT GAAAATCTCG 
TTTaTAAAAT GGCTTACATA TATGAAGCGT 
5 ACCTATTTAG CTTTAAGAAG GCATAACAAG 

TTATGCGAAG ATTAAGGTGA GTAGTAAATT 
AGTGTTGAAA CAGCACAACG AAAAAGAAAA 

w 

ACAAAGGCAA TCTAACTTTA TTCGGTATGT 
TACAATAAAA GTGGATGATG ATAGCGTTAA 
TTTGCATTTT GTTGAAGGAC AAACGACAAC 

15 

TCCACTAGAA GTTAAAACAT TACGCATTTT 
AAAGATTCAT TATGAATTAT ATCAAGATAA 

20 TAACTATAAG GAGATAGGCG AATGAATATT 

GAAATTGCAG CAAGTATTAA CAAAGCAGGA 
GAAGTTCCTA AAGATACAAA AAATGGAGAT 

25 AAGATTGCAA AGCGTAATCC TCGTGAAATT 

GAAAAAGCAC ATGTAAAACA AATTGACATT 
GATAATCAGT ATTTAACAGC AATTATTCCT 

30 CATGTAAATG AATCAAAAGG TCAAAATGTA 

GGAGATTTAC ATATTGGTCA TGCTAGAAAT 
TTAACTGCAG CTGGCTATAA TGTAACACGT 

35 

ATTACTAACT TAGCGCGTTC GATTGAAACA 
TATTCAATGC CAGAAGATGG CTATAATGGA 
GCAGAGAAAC ACCCTGAAAT TAAAGATTAT 

40 

AAATTAGGCG TAGAATACGA AATGGCTAAA 
CATTTTGATA ATTGGTTTAG TGAAaCATCT 
TTAGCAAAAA TGAAAGAATT AGGTTATACG 
ACAACTGATT TTAAAGACGA CAAAGACAGA 
TATTTCTTAC CAGATATTGC GTACCACTTC 
50 ATCGATTTAT TTGGTGCTGA TCATCATGGT 

ACGTTTGGTG TAGATAGTAA TCGTTTAGAA 



TGTTTTTTGG TTTTgAGGTG TTGTTTGTAT 6 0 

TGATTAAGTA TGGAATTGTT AATTAATTGA 12 0 

ATGACCTTAT TTTATGCTAT AATATTTCTA 18 0 

GGATAAAAAA GTAAGTATTC AAACAAAGCA 24 0 

ATTTGAATTT ACTACTGAAG GAACTTGGCA 3 00 

AGAACAAATT GAGGATGCAA CAGTTAATGT 36 0 

GTTGATTCGT AAAGGCGACA TTAATATGAA 42 0 

AACTTTTTAC GATATATCGG CTGGACGAAT 480 

ACATTTCGTA AGTGGAGACG GTGGCAAGCT 54 0 

TGAAAAAATG GGTTCTTATC AATATGAAAT 600 

ATTGATCAAG TGAAACAAAC ATTAGTAGAA 66 0 

TTAGCAGATG AGATTCCTGA TATTAAAATT 72 0 

TATGCTACTA ATATTGCGAT GGTACTGACT 780 

GCTCAAGCGA TTGTTGATAA CTTAGATACT 84 0 

GCTGGTCCAG GATTCATTAA TTTTTACTTA 900 

GAAGCAATTG AAAAAGGTGA TCAATTTGGA 96 0 

TTGCTTGAGT ATGTTTCAGC TAACCCTACA 102 0 

GCAGCAGTTG GTGATGCTTT AgcTAAtATT 1080 

GAATATTATA TTAATGATGC TGGTAATCAA 114 0 

CGTTTCTTTG AAGCTTTAGG TGACAATAGT 1200 

AAAGATATTA TTGAAATAGG TAAAGATTTA 12 60 

TCTGAAGAAG CACGTTTGAA AGAATTTAGA 13 2 0 

TTGAAAAATG ATTTAGCAGA GTTCAATACG 13 80 

TTATATGAAA AAGGAGAAAT TCTTGAAGTT 14 40 

TATGAAGCTG ATGGCGCTAC ATGGTTACGT 150 0 

GTATTAATTA AAAATGACGG TACATATACG 1560 

GATAAAGTAA AACGTGGTAA TGACATTTTA 16 2 0 

TATATTAATC GTTTGAAAGC ATCTCTTGAA 16 8 0 

ATTCAAATCA TGCAAATGGT TCGTTTAATG 1 740 
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AATCCTACAA ATTGGTTGGA TCAATTGGTG ACACCATACG AAGAAGAAGC ACAACAAGCG 324 0 

CAACTTATTC AACTACTAAC AGACTTATCT AAAGTATTTA TCACAGCTGC TTATGATGCT 330 0 

5 TTAAATAAGG CGTATGATTT GTTTAGTATG ATGGATAGCG TCGATAAACA TTTAGCTGTT 3 3 60 

ATAGAAGATG AACGACGTTT AATGGGGCGT GTTTTAGAAG GTGGCTTTAT TGATATACCT 342 0 

TATTTAACTG GTCACGAATT TGGCGCGCGT TTGCCTAATG TAACAGCGAA AATTAAAGAA 34 8 0 

10 GCAAATGAAA TGATGGTCGA TGCCTTAGAA GATGCTAAAC TTCAGTATAA AAAATATAAA 354 0 

TCATTAATTG ATAAAGTGAA GAGTGATTAC TTTTCAAGAG AAGCTGATGA TTTGAAAGCT 3 600 

GATATGCAAC AATTGGCGCC ACGAGTAAAG TACCTTGCGC GTATTGTGAA AGATGTTATG 3660 

15 

TCAGAATTCA ATCGAAAAAA GCGTAGCAAA AATATTTTGG ATTTTTCTGA TTATGAACAT 3720 

TTTGCATTAC AAATTTTAAC TAATGAGGAT GGTTCGCCTT CAGAAATTGC CGAATCATAC 3780 

CGTCAACACT TCCAAGAAAT ATTGGTCGAT GAGTATCAAG ATACGAACCG AGTTCAAGAG 3 84 0 

20 

AAAATACTAT CTTGCATCAA AACGGGTGAT GAACATAATG GTAATTTATT TATGGTTGGA 3900 

GATGTTAAGC AATCCATTTA TAAATTTAGA CAAGCTGATC CAAGTTTATT TATTGAAAAG 3 960 

25 TATCAACGCT TTACTATAGA TGGAGATGGC ACTGGACGTC GAATTGATTT GTCGCAAAAC 4 020 

TTCCGTTCTC GAAAAGAAGT ACTGTCAACG ACTAACTATA TATTCAAACA TATGATGGAT 4 0 80 

GAACAAGTCG GTGAAGTAAA ATATGATGAA GCGGCACAGT TGTATTATGG TGCACCATAT 4140 

30 GATGAATCGG ACCATCCaGT AAACTTAAAA GTCCTTGTTG AAGCGGATCA AGAACATAGT 420 0 

GATTTAACTG GTAGTGAACA AGAAGCGCAT TTTATAGTAG AACAAGTTAA AGATATCTTA 4260 

GAACATCAAA AAGTTTATGA TATGAAAACA GGAAGCTATA GAAGTGCGAC ATACAAGGAT 43 20 

35 ATCGTTATTC TAGAACGCAG CTTTGGACAA GCTCGCAATT TACAACAAGC CTTTAAAAAT 4380 

GAAC^TATTC CATTCCATGT GAATAGTCGT GAAGGTTACT TTGAACAAAC AGAAGTCCGC 4440 

TTAGTATTAT CATTTTTAAG AGCGATAGAT AATCCATTAC AAGATATTTA TTTAGTTGGG 4500 

40 

TTAATGCGCT CCGTTATATA TCAGTTCAAA GAAGACGAAT TAGCTCAAAT TAGAATATTG 4560 

AGTCCAAATG ATGACTACTT CTATCAATCG ATTG 4594 
(2) INFORMATION FOR SEQ ID NO: 201: 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6313 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
50 (D) TOPOLOGY: linear 
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GTTTGACAAT GAAACTGTAA AATTAGGTGA AACGTTGTCT AAAGATTTAT ATGGTAAGGA 14 4 0 

AATCAATGCC AGTGTATCCC GTTTTGAAGG TTATCAACAA TGCCCATTTA AACACTATGC 1500 

5 GTCACATGGT CTGAAACTAA ATGAGCGAAC GAAGTATGAA CTTCAAAACT TTGATTTAGG 156 0 

TGATATTTTC CATTCTGTTT TAAAATATAT ATCTGAACGT ATTAATGGCG ATTTTAAACA 162 0 

ATTAGACCTG AAAAAAATAA GACAATTAAC GAATGAAGCA TTGGAAGAAA TTTTACCTAA 1680 

10 

AGTTCAGTTT AATTTATTAA ATTCTTCAGC TTACTATCGT TATTTATCAA GACGCATTGG 174 0 

CGCTATTGTA GAAACAACAC TAAGCGCATT AAAATATCAA GGCACGTATT CAAAGTTTAT 18 00 

GCCAAAACAT TTTGAGACAA GTTTTAGAAG GAAACCAAGA ACAAATGACG AATTAATTGC 1860 

15 

ACAAACATTA ACGACAACTC AAGGTATTCC AATTAATATT AGAGGGCAAA TTGACCGTAT 1920 

CGATACGTAT ACAAAGAATG ATACAAGTTT TGTTAATATC ATTGACTATA AATCCTCTGA 19 80 

2Q AGGTAGTGCG ACACTTGATT TAACGAAAGT ATATTATGGT ATGCAAATGC AAATGATGAC 204 0 

ATACATGGAT ATCGTTTTAC AAAATAAACA ACGCCTTGGA TTAACAGATA TTGTGAAcCA 2100 

GGTGGaTTAT TATACTTCCA TGTACATGAA CCTAGAATTA AATTTAAATC ATGGTCTGAT 2160 

25 ATTGATGAAG ATAAACTAGA ACAAGATTTA ATTAAAAAGT TTAAGTTGAG TGGTTTAGTT 2220 

AATGCAGACC AAACTGTTAT TGATGCATTG GATATTCGTT TAGAACCTAA ATTCACTTCA 2 280 

GATATTGTAC CAGTTGGTTT GAATAAAGAT GGCTCTTTGA GTAAACGAGG CAGCCAAGTG 2 34 0 

30 GCAGATGAAG CAACGATTTA T AAATT CATC CAACATAACA AAGAGAATTT TATAGAAACA 24 00 

GCTTCAAATA TTATGGATGG ACATACTGAA GTTGCACCAT TAAAGTACAA ACAAAAATTG 24 60 

CCATGTGCTT TTTGTAGTTA TCAATCGGTA TGTCATGTAG ATGGCATGAT TGATAGTAAG 252 0 

35 

CGATATCGAA CTGTAGATGA AACAATAAAT CCAATTGAAG CAATTCAAAA TATTAACATT 2580 

AATGATGAAT TTGGGGGTGA GCAATAGATG ACAATTCCAG AGAAACCACA AGGCGTGATT 2 640 

TGGACTGACG CGCAATGGCA AAGTATTTAC GCAACTGGAC AAGATGTACT TGTTGCAGCC 2 700 

40 

GCGGCAGGTT CAGGTAAAAC AGCTGTACTA GTTGAGCGTA TTATCCAAAA GATTTTACGT 2 76 0 

GATGGCATTG ATGTCGATCG ACTTTTAGTC GTAACGTTTA CAAACTTAAG CGCACGTGAA 2 32 0 

ATGAAGCATC GTGTAGACCA ACGTATTCAA GAGGCATCGA TTGCTGATCC TGCAAATGCA 2 880 

45 

CACTTGAAAA ACCAACGCAT CAAAATTCAT CAAGCACAAA TATCTACACT CCATAGTTTT 2 94 0 

TGCTTGAAAT TAATTCAACA GCATTATGAT GTATTAAATA TTGACCCGAA CTTTAGAACA 3 000 

50 AG CAGTGAAG CTGAAAATAT TTTATTATTA GAACAAACGA TAGATGAGGT CATAGAACAA 3 060 

CATTACGATA TCCTTGATCC TGCTTTTATT GAATTAACAG AGCAATTGTC TTCAGATAGA 3120 
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TTTTAGCATC ATGAGCACCC GTAAAATATG ATCGCAATC 5 5 59 

(2) INFORMATION FOR SEQ ID NO: 200: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4594 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 200: 
AAAT CAATCG AGTGGCATGT CAAGGTCATA TCAATATTTT AGAATCTGCG ACTATGAGAG 60 

AGGAAATAAA TGAAATTGCG CGACGTATCA TCGTTGATAT TCGTGATAAG CAATTACGAT 120 

ATCAAGATAT TGCTATTTTA TATCGTGATG AATCTTATGC TTATTTATTT GATTCCATAT 180 

TACCGCTTTA TAATATT CCT TATAATATTG ATACAAAGCG TTCGATGACA CATCATCCGG 240 

TCATGGAAAT GATTCGTTCA TTGATTGAAG TTATTCAATC TAATTGGCAA GTGAATCCAA 300 

TGCTACGCTT ATTGAAGACT GATGTGTTAA CGGCATCATA TCTAAAAAGT GCATACTTAG 3 60 

25 TTGATTTACT TGAAAATTTT GTACTTGAAC GTGGTATATA CGGTAAACGT TGGTTAGATG 4 20 

ATGAGCTATT TAATGTCGAA CATTTTAGCA AAATGGGGCG TAAAGCGCAT AAACTGACCG 4 80 

AAGATGAACG TAACACATTT GAACAAGTCG TTAAGTTAAA GAAAGATGTC ATTGATAAAA 54 0 

30 TTTTACATTT TGAAAAGCAA ATGTCACAAG CGGAAACTGT AAAAGATTTT GCAACTGCTT 600 

TTTATGAAAG TATGGAATAT TTCGAACTGC CAAATCAATT GATGACAGAG CGAGATGAAC 660 

TTGATTTAAA TGGTAATCAT GAAAAGGCGG AGGAAATTGA TCAAATATGG AATGGCTTAA 720 

TTCAAATCCT TGATGACTTA GTTCTAGTAT TTGGAGATGA ACCAATGTCG ATGGAACGTT 780 

TCTTAGAAGT ATTTGATATT GGTTTAGAAC AATTAGAATT TGTTATGATT CCGCAAACAT 84 0 

TGGACCAAGT AAGTATTGGT ACGATGGATT TGGCTAAAGT CGATAATAAG CAACATGTTT 900 

ACTTAGTAGG TATGAATGAT GGAACGATGC CACAACCAGT AmTGCGTCAA GCTTGATTAC 960 

AGATGAAGAA AAGAAATACT TTGAACAGCA GGCTAATGTC GAGTTAAGTC CAACATCAGA 1020 

TATTTTACAG ATGGATGAAG CATTTGTTTG TTATGTTGCT ATGACTAGAG CTAAGGGAGA 1080 

TGTTACATTT TCTTACAGTC TAATGGGATC AAGTGGTGAT GATAAGGAGA TCAGC CCATT 1140 

TTTAAATCAA ATTCAATCAT TGTTCAACCA ATTGGAAATT ACTAACATTC CTCAATACCA 1200 

50 TGAAGTTAAC CCATTGTCAC TAATGCAACA TGCTAAGCAA ACCAAAATTA CATTATTTGA 1260 

AGCATTGCGT GCTTGGTTAT ATGATGAAAT TGTGGCTGAT AGTTGGTTAG ATGCTTATCA 1320 
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CGATCACCAA ACTGCATGTC GAACAATGTA ACATTTGGAT TCGATATTTA AAATTGCTTG 37 8 0 

TGATGATAAA CTTTCTCATT TAGAAAACGC TTCCACGTAC ATTCAAAAAA ATAACTTTGT 384 0 

5 TAACCATATT GTAACATTAT TTCATATATT TTGGGGCATG AGAATGATTC TCACGCCCAG 3 90 0 

TAATTTATTT ATGCAATTGT TCATGTAGGT TCTTTGCGAC GTTTTCAGGA ATACCTATAT 3 96 0 

TTTTAAAATC TTCAAGTGTA GCTTCCTTCA TTTTCTTGAT TGAACCGAAT GAACGCAATA 4 02 0 

w 

ATAATGTTTT ACGTTTGTTA CCGATACCAT CT AT AT CATC AAGTATTGAT TTCAAGCCTG 4 0 80 

TCTTTTGACG TGTTTGTCTA TGAAATGTGA TTGCGAATCT GTGAACCTCA TCTTGGATAC 414 0 

GGTGCAACAA ATAAAATGCC TGGCTATTTT TCTTCAGTGG TACAATTTCT GCACTAGCGC 4 200 

15 

CATATAATAA TTCAGATGTT TGGTGTTTAT CATTTTTCTG CAAACCTGCA ACAGGGATAT 42 6 0 

CAAGACCTAA TTCGTTTTGT AGCACATCAA TAACCCCGTT CATATGTCCT TTACCACCAT 4 320 

CTACTATTAT TAAATCAGGT AATGGTAATC CTTCGTTTAA AACGCGAGAA TATCGTCGTC 4 3 80 

20 

TTACTACTTC TCTCATTGAT TTGTAATCAT CTGGACCTTT AACCGTTTTG ATTTTATACT 4 440 

TTCTATAATT TTTCTTATCT GGTTTACCGT CGACAAATGT AACCATTGCT GACACTGGAT 4 5 00 

25 CCACACCTTG AATATTAGAA TTATCGAATG CTTCAATTCT AATTGGTGTT TGAATTCCCA 4 5 60 

TTTGTGTTCC AAGTTCTTCA ATAGCTTTAA TCGTTCTGGA CTCATCACGT GATATTAATT 4 62 0 

CAAATTTATT ATTTAAGGAT ACTTTAGCGT TATGTGCAGC TAGGTCAACC ATATCTTTTT 4 6 80 

30 TGGGACCTCG CGCGGGTTGA ACGATTTTAG TGTCCACAAC AGATTGAATC ATTTCTTTAT 4 74 0 

CCAAATTACG TGGTACATGA ACTTCCTTAG GTAAAATATG TTGGTTTAAG CTATAAAATT 4 8 00 

GTCCAATAAA TGTATAAAAT TCTTCTTcTT CTGTTTGCTG TAATGGAATC ATCGTTGTAT 4 860 

35 CTCGCTTTAT CATATTACCT TGTCGTATAA AGAAAACTTG GATACACATC CATCCTTTAT 4 920 

CAACACTATA ACCAAAGACA TCACGAATCG TTTTATCTGA TGACATAATT TTTTGTTTGT 4 980 

TTGf CAGATT TTGAATATGT TGAATTAAAT CTCTATATTC TTTAGCCCGT TCAAAATCAA 504 0 

40 

GTGATTCACT TGCAGTTAAC ATTCGCTCTT CTAAACTTTT TAAAATTGTT TTGTCTTCCC 5100 

CATTCAGAAA ATCAGTAATT TCCTTCGTCA TTTGTGCGTA TTTACTCAAA TCAACGTCAT 5160 

ATACACATGG TCCTAAACAT TGTCCAATAT GGTAATAAAG ACATAATTTA TCTGGCATCT 5220 

45 

TATCACATTT GCGATATGGA TATATTCTGT CTAATAACTT TTTAGTTTCT TGAGCAGAAT 52 80 

ATGCATTCGG ATACGGTCCG AAATATTTGC CAGTACCTTG TTTTACAGTT CTCGTCACTA 5340 

50 GTAGTCTAGG ATATTTCTCC TTCGTAATTT TAATAAATGG ATAACTTTTA TCATCCTTTA 54 00 

ATAATATATT ATATCTTGGT TGATATTGTT TAATCAGATT CAATTCCAGT AAAAGTGATT 54 60 
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TAAATCTCAA TGATACCACC TAGTTTTACA TCTAACTCAT GTGGATCTTT ATGTGACAAA 1980 

TCAAGATATA CCATGTTTTC GCCATTTATA CCTAATTTTT GGTTAATACA TACATCGAAA 204 0 

5 ATTTCACGCG TTGCGATATC ACGAGGTACT AAGTTACCAT AATCAGGATA TTTCTCTTCT 210 0 

AAGAAGTACC AAGGCTTACC ATCTTTATAT GTCCAAATTC GTCCACCTTC ACCACGTGCT 2160 

GATTCACTCA TTAGTCGCAG TTTATCATCA CCAGGGATTG CAGTAGGATG AATTTGAATG 222 0 

10 AACTCACCAT TAGCATAAAT AGCGCCTTGT TGGTAAACAA TGGAAGCCGC TGATCCTGTA 22 80 

TTAATCATTG AGTTTGTTGT TTTACCGAAA ATAATACCAG GGCCACCCGT TGCCATAATA 2340 

ACTGCATCTG AACCAAATGT TTCAATCTCA GCAGTTGTCA TATTTTGTGC An CG AT AC CT 24 00 

CTTGCACTAT CATCGTCACC TTTAACTATG CCAAGGAATT CCCATCCTTC ATACTTCGTA 24 60 

ACTAATCCAT CTACTTCATA TGCACGAACT TGTTCATCCA ATGCATATAA TAATTGTTGT 252 0 

CCAGTTGTTG CCCCTGCATA TGCTGTTCTG TGATGTAATG TACCACCGAA ACGTCTAAAA 2580 

TCTAATAGAC CTTCATTTGT TCTATTGAAC ATTACGCCCA TACGGTCTAA TAAATGAATA 2640 

ATTTTAGGTG CTGCCTCTGT CATCGCTTTA ACAGGTGGTT GGTTTGCAAG GAAATCGCCA 27 00 

CCATACACTG TATCATCAAA GTGAATCCAA GGAGAATCGC CTTCCCCTTT AGTATTGACC 2760 

GCACCATTAA TGCCACCTTG GGCACAAACA GAGTGCGAAC GCTTTACTGG TACAACTGAG 2 820 

AACAAATCTA CATGTGCACC TTTTTCTGCC GCTTTAATTG TTGACATTAA GCCCGCTAGG 2 8 80 

30 CCACCTCCGA CAACAATAAG ATGTTTCTCT GCCATAAAAA TTTCACTCCC CTAAATTTTC 2 940 

AATCTATATT TGTTAAATGC GATGTATTAC ATAAAGGCAA TAATTGCAGT AACACCAATA 3 000 

TACGAAATAA CTAAAAATAC GATTAATGAA ACCCATGTAA ATACTCGTTG TGATTTTGGA 3 060 

35 GATTGAAGTC CACCCCAAGT AACTAAGAAT GACCATAAGC CATTTGCAAA GTGGAACACA 3120 

ACAGTAATAA TACAAATAAT ATAAAATATT GCCCATCCAG GATGTTGCAA TGTTTCGTGC 3180 

ATTAAATCGT AATTCACTTC TTTGC CGTAA AATGCTTTTT GTAAACGTGT TTGCCATAAA 3 24 0 

TGGATACCAA TAAAGATAAA TGTTAAGATA CCACTCACTC TTTGGAAGAA GAACATCCAG 3 300 

TTTCTAAAAA TCGAGTAATG TCCAACATTT TCTTTTGCTG TAAATGCAAT GTGTATACCA 33 60 

AACAAACCGT GATATAACAA CGGAATGTAT ATAAATAAAA ATTCTACAAT AATTAGAAAT 3420 

GGTAATGATT CCATAAAGTT AGATGCCTTA TTAAACGCTT CAGCACCTTG TGTTGCTTGG 34 80 

TGATTCACTA ATAAATGAAC GACCAAAAAT GCACCTATTG GGATAATACC TAATAACGAG 3540 

TGAATACGTC TTAGATAAAA TTCATTTTTT GATTGAGCCA AAAGGAGTCC CCCCTGTGAA 3600 

CGAATATTTA ATTTATTGAG CTATTTATAT TAAACGTACG CTTAACCCCC TAAAGTGATA 3 660 
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TTTAATTTTT AGTTTATCAT AACTAAGCAT TGGATTTTAG TATTATGCAC TGTGTTTACC 130 

ATTTTTGTCA TTATAATATT TATTTTAAAT CAGCCCACTA TCATATTGTC ATGTAATCTG 24 0 

5 CTTATTAAAA AAATCCCTTC CAAGTTATTG TGTATCTCCA TTCAATTTAA TTTTGAAAGG 3 00 

AACATAACwT TTTAACTCAA AAGGGATTAA TTTnTAnTCT ACTTCATGGT CTGAACCAAA 3 50 

GAATGATTTA AACATGTGGA ATGTTGTTTC TCTGTTCATT GCTGCAATGG ATGTTGTTAA 4 20 

10 

TGGAATACCT TTAGGGCAAG CATTAACACA GTTTTGTGAA TT AC CACACT GCTGTAAGCC 4 90 

ACCAGTACCC ATTAATGCAT TTAAACGTTC ATCTTTAGTC ATAGATCCTG TTGGGTGCAA 54 0 

ATTAAACAAA CGAACTTGCG AGATTGCTTG TGCACCAACG AaTTTATTAT TTTCAGTAAC 6 00 

15 

ATTAGGACAA ACCTCTAAAC ATACACCACA TGTCATACAT TTAGATAATT CATAAGCTGT 66 0 

TTGACGTTTT TTCTCTGGCA TACGTGGTCC CGGACCTAAA TCATACGTTC CATCAATTGG 720 

GATCCATGCT TTCATACGTT TTAAGTTATC GAACATTCTA GAACGATCAA CTTGTAAGTC 7 80 

20 

ACGGATAACT GGGAAAGTAT TCATTGGCTC TAAACGAATA GGTTGTTCTA ATTGATCAAC 84 0 

AAT CGCAGAA CAAGATTGTC TTGCACGACC ATTGATAACC ATAGAACATG CTCCACATAC 90 0 

25 TTCTTCTAAG CAGTTCATAT CCCAGACAAC AGGTGTTGTT TTTTCACCTT TAATATTAAC 96 0 

TGGGTTACGT CTAATTTCCA TTAAACAAGC AATGACGTTT AAATTTTCAC GATATGGAAT 102 0 

TTCAAATGTT TCTTCATAAG GCTTAGAATC ACTTGTATCT TGTCGTTTAA TAATTAATTT 10 8 0 

30 TACTGTTTTT TGTTTCGGTT TAGATTGTGT TTCATGTTGT GGAGTGTTTT TCACTGATTG 114 0 

TTCAGTCATT ATTTTTTACC CCCTTTAGAC TTACTTGTGT AATCACGTTT ACGAGGTGGT 120 0 

ATTAAACTCA CATCGACGTC ATCATAAGTA AACTGCGGTT TTTCAAATGC GCCTTGGAAT 126 0 

GAGGCCATTG TCGTTTTTAA CCACTCTTCA TCATTACGCT CTGGGAATTC TGGTTTATAA 132 0 

TGGGCACCGC GTGATTCGTT ACGGTTATAT GCACCAATCG TAATAACACG TGCAAGTACT 13 8 0 

AACATGTTCC ATAGTTGACG GGTAAAGAAT ACCGCTTGGT TACTCCAAGT TTGAGTATCT 14 4 0 

40 

TCCATATCAA TATCTTCATA ACGTTTCATC AATTCAACAA TCTTTTTATC TGTTTCTAAC 15 0 0 

AGTTTTTCAT TTTCACGAAC AACAGTTACA TTTGCTGTCA TAATTTCACC AAGTTCACGG 156 0 

TGTAATTTAT ATGCATTTTC TGTACCGCGC AT AG CT AAT A ATTTATCAAA ACGTTCTTGC 16 2 0 

45 

TCTTCAGCTT TACGCTTTTC AAAAATACTT TCGTCCATAT CAGTATATGA TCGATCAATA 16 8 0 

TTTGAAATAT AATCAATCGC GTTTGGACCT GCTACTGTAC CACCATAAAT CGCTGATAAC 174 0 

50 AATGAATTGG CACCTAAGCG GTTACCACCA TGTTGAGAGA AGTCACATTC TCCAGCTGCA 180 0 

AATAACCCTT TAATATTTGT CATTTGATCA TAATCTACAT AT AG AC CAC C CATTGAATAG 18 60 
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TATTGAGAAA AAGAAGGGTG ATAATATTAT GGGATTCAAA AACAATTTAA CATCAAATTT 1080 

AACAAATAAA ATCGGTAATT CAGTCTTTAA AATAGAAAAT GTTGACGGAA AAGGTGCAAT 114 0 

5 GCCAACGACG ATTCAAGAAT TGAGAGAAAG ACGACAACGT GCTGAAGCAA TTGTAAAGAG 1200 

AAAGTCTTTA ATGTCATCAA CAATGAGCGT TGTTCCAATT CCGGGTTTAG ATTTTGGTGT 1260 

TGATTTAAAA TTAATGAAAG ATATTATCGA AGATGTTAAT AAAATTTATG GTTTAGATCA 1320 

w 

TAAGCAAGTT AATAGCCTTG GGGATGATGT GAAAGAAAGA ATTATGTCTG CAGCAGCAAT 13 80 

TCAAGGTAGT CAATTTATTG GTAAAAGAAT TTCAAATGCA TTTTTAAAAA TTGTAATTAG 144 0 

AGATGTAGCT AAACGTACTG CTGCAAAACa AACAAAATGG TTTCCTGTTG TAGGACAAGC 1500 

15 

TGTGTCTGCA TCTATTAGTT ACTATTTTAT GAATAAAATT GGAAAAGATC ACATTCAAAA 1560 

ATGCGAAAAT GTTATTAAAA ATGTCATGTA GGTGCTATAA TAGTTTTGCA ATTTGCAAAT 162 0 

2Q TTTACTGAAA CCGGTTTTAA ACGAATTGAA TTTAAAGcAT GGTTTTGGTA AAGTTAATGT 1680 

ATAAAACTAA GTTAGyATTG TAATAATATk GAAGATTCTA ACTATACGAA GGAGAAATGT 1740 

AATTATGGAA CAAAATTCAT ATGTAATCAT CGACGAGAmT GGTATTCACG CTAGACCAGC 1800 

25 AACAATGTTA GTACAAACAG CTTCAAAATT CGATTCTGAT ATTCAATTAG AATATAACGG 1860 

TAAGAAAGTA AACTTAAAAT CAATCATGGG TGTTATGAGC CTTGGTGTTG GTAAAGATGC 1920 

TGAAATTACA ATTTATGCTG ACGGTAGTGA TGAATCTGAC GCCATTCAAG CAATCAGTGA 1980 

30 CGTCTTATCA AAAGAAGGAT TGACTAAATA ATCATGTCTA AATTAATTAA AGGTATTGCC 204 0 

GCATCTGATG GTGTCG CAAT TGCTAAAGCT TATTTATTAG TTGAGCCAGA CTTAACATTC 2100 

GACAAAAATG AAAAAGTCAC TGATGTTGAA GGAGAAGTTG CAAAGTTCAA TAGCGCTATC 2160 

35 

GAAGCTTCTA AAGTTGAGTT AACTAAAATT AGAAATAATG CAGAGGTTCA ACTAGGTGCT 2220 

GATAAAGCTG CTATCTTTGA TGCAcaTTGG GGGGTGGTAG ATGACCCTGA ATTAATTCAA 2280 

CCAATCCAAG ATAAGATTAA AAATGAAA 23 08 

40 

(2) INFORMATION FOR SEQ ID NO: 199: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5559 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

50 (xi) SEQUENCE DESCRIPTION; SEQ ID NO: 199: 

AAGTAATAAA TCGTCTCATT TGGCAACTGA CGCATAATTT CTTTAGCTAC TGTCAAACCT 60 
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ACCATAATAT GAATGGCTTC AGGATCAAAA TAAAGACCAA CTTCACTGCC TACTTCAGCT 120 

TTTTTAGTCG TTTGTATTAC CCATTCATAA CCTTTATTGT CTATACAACA TATTTCATAG 180 

TGGACCCCTC TAAATAACAT AGAATCAACA GTTGCTTTAA ATAATCCTTC TTCAGCTTTG 24 0 

ATTAATGATA TATCTTCTGG TCGAATAACG ACTTCTACTT TTTTATTTTC AGGAATACCC 3 00 

ATATCGACAC ATTCGAAATC TTGCCCATAA ATATTCACGA CATAATCTCT AACCATGCGC 360 

CCTTCAACAA TATTAGATTC TCCAATAAAA TCAGCTACAA ATCGATTCAC TGGTTCGTCA 4 20 

TaTATATCTG TTGGTGTGCC AAATTGTTGA AT 4 52 
(2) INFORMATION FOR SEQ ID NO: 198: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2308 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198: 

TAGGTTGGGT TCTAACATAC GATAAAGCTC AACAAATCAA CACAGCTTTC TTTGTAAAAT 60 

TGTTTAATAC TGCATTAGCA GAACGTGATT ATTATTTTAA TATAGATGGA ACAAATGCTT 120 

TTAGATTATT TAATGCTGAA GGTGATGGTG TTGGGGGATT AACAATCGAC AATTACGATG 180 

GTCATTTGTT GATTCAATGG TACTCAAAAG GTATTTATAA ATTTAAATAT GCCATTCTTG 2 40 

AAGCGGTTAG AAAAGTATTT GATTATAAAT CTATTTACGA AAAAGTAAGA TTTAAAGACA 3 00 

GCGAATATAG TGGTGGTTTT GTTGAAGGAG ATGCACCTGa GTTTCCAATT GTTATCGAAG 3 60 

AAAACTTCAC ATTTTATAAT GTAGACCTTG AAGATGGTTT GATGACAGGT AT CTTTTT AG 4 20 

ATCAAAAAGA AGTGCGCAAG AaATTAAGGG ATCAATATGC CAAAGAACGC CATGTTTTAA 4 80 

ACTTATTTAG TTATACAGGT GCTTTTTCTG CAATAGCAGC AAGTGAGGCA TCTTCAACAA 54 0 

CAAGTGTAGA TTTGGCTAAT CGTTCTCGTA GTTTAACTGA AGAAAATTTT GGATTAAATG 60 0 

CTATTGATCC TAAATCCCAA TATATTTATG TCATGGACAC TTTTGATTTC TATAAATATG 660 

CTGCACGACA TGGACATAGT TATGACACGA TCGTGATTGA TCCACCTAGC TTTGCGCGTA 72 0 

ACAAAAAACG TACATTTTCA GTGCAAAAAG ATTATGACAA ATTAATTAAT GGCGCCTTAA 780 

ATATCTTATC ATCTGAAGGA ACATTATTGT TATGTACAAA CGCAAGTGTA TATCCATTAA 84 0 

AGCAATTTAA AAATACTATT AAAAAGACGC TTGAAGAGAG TGGCGTTGAT TATGAATTAA 900 

CTGAAGTTAT GGGATTACCA AAAGATTTTA AAACGCATCC ACATTATAAG CCATCTAAAT 960 
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TAGTTGTTTT TGCGCAGGTG GTTCTGATTC AATACTTTCA ACAAATGTAA TTGGACCTTC 13 630 
TAACAGTCTT ATAATATCCC CTGCTGAGAT TTCTT 13715 
(2) INFORMATION FOR SEQ ID NO: 196: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 873 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196; 

AAATC CATAA TGTCATGATA ATCTGCATAT GCTTCATATA ATTCAATCAT TGTGAATTCA 60 

GGGTTATGTC TAGTTGATAC ACCTTCATTA CGGAATACTC TACCAATTTC ATATACTTTT 120 

TCAAGTCCAC CGACAATTAm ACGTTTTAAA TGCAACyCAA TAGCAATACG CATGT AT Aa C 180 

GTTGCATCTA ATGCATTATG ATGTGTTACA AATGGTCTAG CAGCTGCTCC ACCAGCAATT 240 

TGGTGCATCA TAGGTGTTTC TACTTCCAAG AAACCTTTAT TATTTAAATA ATTACGCATT 300 

25 TCTTGAATGA TTTTACTACG ATTAATAAAT GTACGAGTGC TATCTTCGTT CGTAATTAAA 360 

TCTAAATATC TTTGACGATA tCTCTGTTCA ATATCCTGTA AACCGTGGAA TTTATCCGGT 420 

AATGGTCGCA ATGATTTAGT TAGTAGCGTG AATTTCTTCG CTTTAACCGA TAATTCGCCA 4 80 

30 GTATTTGTTT TGAACATTAC ACCTTCAACA CCAACGATAT CGCCTAAATC AGCATTTTTC 540 

CATAAATCAA ATTCGTCATC GCCAACTTGA TCTTTACGAA CGTAAATTTG AATTTGTCCA 600 

GCTAAGTCCT GAACGTGTGC AAATC CTGCT TTACCTTTAC CACGCTTAGT CATTAATCGT 660 

CCAGCTATAG CGACATGACT ATCCG CTTCT TTTTCTACCA ATTCTTCTTT AGAATACTGG 720 

TCCGACTCTT CTTTCAAATC ACTAGATAAA CCTGAACGGT CAAATTTAGA ACCAAACGGG 780 

TCTATACCAA GAT CAT AT AA TTCTTGTAAT TtTTGACGTC GAACCAACAT TTGGTCATTC 840 

ATTTCTTCTG ACATAACTtT CTCTCCTTTA ACT 873 
(2) INFORMATION FOR SEQ ID NO: 197: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 452 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197: 
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GTCCCTATTC GAGAAGGTGA AGATGAACAA 
CAACATTTAG ACGAATTATC ATATGAAAGA 

5 AACCTAGTAA GTTCAGCAAC TGCTTTATTA 

ATACGTGTAG GTTCTGGAGG CATCATGTTA 
CAATTTGGCA CGATGGCAAC ATTATTTCCA 

10 CCTGGAACAG ATATGATGAC CGcAAGTGCA 

AAATTTCCAG AAGAGGTTTC ATTATTACAA 
TATGTTCGTG CTTATCCAGC AGTAGGTAAA 

75 

ACAGATTCTG CACATTTAGC TGCTCGCAAA 
GCACCTCAAC AAATGAAAGA AGCTATCGAA 
GTATTAGACG AACCTTATGT TATTGTATGT 

20 

GAAGCACAAT ATTTAGCTTC ATCTATGGCA 
ATGCAGCCCG TTCAACCGCC AACACATGAA 

25 GCGATGGCTA TGGAAAGACA GAAAATATCA 

AAAATTCAAG ATTTTATGGA AACTTATGGT 
ATTTATGATA AAGATATGCA ATTAGACTCT 

30 ATAAATGAAA AAAACACTTT ATAATGTGAT 

ATATTAATAA AAATATACAG TAACAGCATT 
ATTTATTTTA GTAATGATTC AAATTCACTT 

35 TTCAcTTTAC ACTGTGTAAG TGTTTATTTG 

ATCCSAAATT taaagaaatt tatttctctt 

TCGATCCTGT ATATTCATTT ATCTTATCTT 

40 

TTTTAAATTT TTCTACGCGT CTTTAGCTTT 

TATATTTGAT TAGTTTAATT GAACTTATTT 

GTTTTTAGCA TTTTCAACAA CTTCTTTTGC 

45 

TTTTTCTAAT TCTTTGTTAT CAGTAACGTT 

TTGATCGAAC TtACTTTCGT CTGCCATAGT 

5Q CCACTAAGGA GGTTCGCTmm mCAyymyAAT 

ATACGATTAA ATATAAAACA TGTATCCGTC 
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ACAGCAATTA ATAATATGGT TAATCTCGCA 118 80 

TATTGGATTG CTGAACACCA TAACGCTCCC 11940 

ATTCAACATA CGTTAGAACA TACGAAACAC 12 000 

CCTAATCATG CTCCATTAAT CGTTGCGGAA 12 060 

AATCGTGTCG ATTTAGGATT AGGACGTGCA 12120 

TTAAGACGAG ATCAACATGA TGGTGTTTAT 12180 

CAATATTTCG GCCCTGCTCA CCAACAAGCA 1224 0 

AATGTGCCTT TATACATTCT TGGTTCTTCA 12300 

GGGCTTCCAT ATGTGTTCGC TGGACATTTT 123 60 

ATTTACAAAA CGTTATTTGA ACCTTCTGAT 12420 

TTAAATACAA TCGTTGCTGA AAATGATGAC 12480 

CAAGTAATGG TTAGTATCAC TCGTGGCAGA 12540 

CTACAAAATA TATTAACGCC GAGAGAATAC 12 600 

TTAATAGGTT CAGAAAATAC TGTTCAACAA 12660 

GAAGTCAACG AAATTATGGC AATAAGTTAT 12720 

TATCGTCGGT TCAAGAATGT TATAAATCAG 1278 0 

AAATAAACTA AGTGAAAGTA TGTATCCATA 12 840 

TTGAATGAAA G ATGT CTTTA TTGTTCAATC 12 900 

AAAATyCTAA tGCAAATATG AAAGCGCCCC 12 960 

ATGGGGCGCT TTCAAAATAT TGAAAAGCAT 13 020 

TATCTTCATT TTCTTTTTTC TCTTCGTTAT 13 080 

TTACATTTTT AACTTGTTCA TTATCGCTAT 1314 0 

ATCCATAAAA CTCATATTAA TCGCTCCTCT 13 200 

TTTAAGTTTA TCAATTGCAT CAGTTATTTT 13 260 

TTTaCCAGTC GCTTTATCTT GCTGACCTTC 13 320 

ACCTACTGTT TCTTTAACAT TTCCTTTAAA 13 38 0 

GAAACCTCCT TGGATGTATA TATTTATATA 13440 

ATGAAGTTTT TATGTTATAG TATAGTATTT 13 500 

TAAATCTTCA CTTGTATCTA CATATTCCGC 13 560 
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TTTCTTTATT ACACATAACG TCTGGATTTG GAGTACGACC TTTTTTGTAT TCATCTAAGA 100 80 

AATACGTAAA GACTTTATCC CAATATTCTT TTTCAAAATT AACAGCGTAA TACGGAATGC 10140 

5 CAATTTGATT ACACACTTCA ATAACATCGT TGTAATCTTC AGTTGCAGTA CATACGCCAT 10200 

TTTCGTCAGT GTCATCCCAG TTTTTCATAA ATATGCCAAT GACATCATAA CCTTGTTCTT 1026 0 

TTAAGACGTG GGCTGTTACA GAACTATCTA CACCGCCTGA CATACCAACG ACAACACGTA 10320 

10 TATCTTTATT TGACAATTAT GACTCCTCCT TAAATTTAAA ATATATTTTA TGAATTTCAG 103 80 

CTACAATTGC ATTAATTTCA TTTTCAGTAG TCAATTCGTT AAAACTAAAT CGAATCGAAT 10440 

GATTTGATCG CTCCTCATCT TCGAACATTG CATCTAAAAC ATGCGACGGT TGTGTAGAGC 10500 

CTGCTGTACA TGCAGATCCA GACGACACAT AGATTTGTGC CATATCCAAC AATGTTAACA 10560 

TCGTTTCAAC TTCAACAAAC GGAAAATATA GATTTACAAT ATGGCCTGTA GCATCCGTCA 10620 

TTGAACCATT TAATTCAAAT GGAATCGCTC TTTCTTGTAA TTTAACTAAA AATTGTTCTT 106 80 

TTAAATTCAT TAAATGAATA TTGTTATCGT CTCGATTCTT TTCTGCTAAT TGTAATGCTT 10740 

TAGCCATCCC AACAATTTGC GCAAGATTTT CAGTGCCTGC ACGGCGTTTC AATTCTTGTT 10300 

CACCGCCAAG TTGAGGATAA TCTAGTGTAA CATGGTCTTT AACTAGTAAT GCACCGACAC 10860 

CTTTTGGTCC GCCAAACTTA TGAGCAGTAA TACTCATTGC GTCGATCTCA AATTCGTCAA 1092 0 

ACTTAACATC AAGATGTCCA ATTGCTTGAA CCGCATCAAC ATGGAAATAT GCATTTGTCT 10 980 

CAGCAATAAT ATCTTGAATA TCATAAATTT GTtGCACTGT GCCAaCTTCA TTATTTACAA 11040 

ACATraTAGa TACTAAAATC GTCTTATCTG tAATTGTTTC TTCAAGTTGA TCTAAATCAA 11100 

TAGCACCTGT ATCATCAACA TCTAGATATG TTACATCAAA ACCTTCTCGC TCTAATTGTT 11160 

CAAAAACATG TAACACAGAA TGATGTTCAA TCTTCGATGT GATAATGTGA TTACCCAATT 11220 

GTTC&TTTGC TTTTACTATG CCTTTAATTG CCGTATTATT CGATTCTGTT GCGCCACTCG 11280 

TAAATATAAT TTCATGTGTA TCTGCACCAA GTAATTGTGC AATTTGACGT CTTGACTCAT 11340 

CTAAATATTT ACGCGCATCT CTTCCCTTAG CATGTATTGA TGATGGATTA CCATAATGCG 11400 

AATTGTAAAT CGTCATCATC GCATCTACTA CTTCAGGTTT TACTGGTGTG GTCGCAGCAT 11460 

AATCTGCATA AATTTCCATG TTTGGACACT CCTCACAATT TTATCAATGT TCCAATAATA 11520 

GCACCTTACA TACTATTTTT CTACTTTTCT GTTTAACTTT ATTTATAATG TTTTTAATTA 11580 

TATTTT AC CA TTTTCTACAC ATGCTTTTCG ATAGGCTTTT TTAAGTTTAT CGCTTTATTC 11640 

TTGTCTTTTT TATAAATTTT AGTATTTGCA GATATTTTTT TATTTGTAAA ATGTAACGTA 11700 

CTATTATTTT GGTTATGAGC AATTTAATAT TTATCTGGTT ATTCGATTGG TATACTTCTT 11760 
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AAGCCCAAAT TGTATCTTGC ATCAACATGA TTTTTATCAA TCGTTAATAC ATGTTTAAGT 828 0 

TGAGTTATGG CTTCATTAAA CATTTCTAAT TGACATAATA CAAGACCATA TTGAAATTGA 8 34 0 

ACTTCTGCAT CTTTGTCTTT ATCTAGTTCC GCAGCAGTCA TTAAATACGG CAATGCCAAG 84 0 0 

CTTAAATGAT TCTAACTGAT TAAACGCCAT ACCGATCATA TAATTACAAT CAACTTGTTC 84 6 0 

AATCTCTGTT TGTAATGCTT GTTGATATAA TTTAATAGCT TCTTGATAAC GTTGCTGATT 8 52 0 

ATAATATACA TTTGCTAGAT TAAAAAATAC GACGCCATTC TTCGGATCTA TTGTnAAAGC 8 58 0 

TTTTTGGAAA AAACGCTCTG CCTTTyCAAy CyCATTCgCA TCAGCAAGTA CGATmCCaGC 8 64 0 

ATTAATATAA TTTTCAATAA TTGTAGGATT TTCTTCGATA TTTCCGAACA ATGCTTGTAA 8 700 

CGCTTCTTCT ATTTTTCCAT TTTGTATGTA TTGATAAATT GTTTGTTGAT CTATCATTTA 8760 

CGAACCTCAT TTCTCATCAA TTATAACATC TTGATAAATT GTATGTCTCG AATCACTTAA 8 820 

CAACGAATAA AATATAATCT AATATCATCT TCATTCATGA AAAAGCGGGA ATGGAATAGA 8 88 0 

AATGCTTAAG AACCATTAAC GGTTTATTAT GTAATGGTTC TTCCACATTA GCCACCACTA 8 94 0 

TTATGTACTT AAAAATAAGA ATACATAATT AGATTCATGC ATAGGGAGTG GGACAGAAAT 9000 

25 GATATTTTAA CAAAATTAAA TTCGTTATCC CCAACTGGCA TTGCCTGTAG AATTTCTTTA 9060 

CGAAATTCTC TATGTTGTGG TCCCGCCAAT ATAACATTGT AGAGCCTAGG ACATTGTGAT 912 0 

GTCCCAGACT CTATCCTCAT GAATTATTCT CATCAAAAAC TGTCTTTCGT CATTTTCAAC 9180 

GTTGAAACTT CAAATAAGTA ATTTATTGTT GCCATTGTTT ATACAACATA ATTTAATTGA 924 0 

CCTTCATTTT TGAACACATC GTCAATTGTT GCACCACCAA GACACACATC ACCTTGATAA 9 3 00 

AAAACAACTG CTTGTCCAGG TGTGATTGCT CTTACTGGCT CAGCAAAAGT AACACGTAGg 93 60 

CAtGGtCGTT TTCACGTTTC ACAAAAACTT TCGTATCTTT TTGGCGATAT CTAAATTTAG 9420 

CTGtACATTC AAAACCTTGA TCTAAGTCAT TATCTTCTGG ATTTACAAAT GAATAGTCTG 94 80 

AAGCAATTAA GTAATCACTG TATAATGCAT CGTGATGGAA TCCTTGTTCT ACATATAAAA 954 0 

CATTATCTTT TAGGTTTTTA CCGACAACAA ACCAAGGATC GCCATCTCCA CCTATACCTA 96 00 

ATCCATGTCT TTGTCCTATT GTGTAATACA TCAAACCACT ATGTTT AC CC ATTTTCTTAC 96 6 0 

CATCAAGTGT TAT CAT AT CA CCCGGTTGTG CAGGTAAATA TTGTGATAAA AATGTTTTAA 972 0 

AGTTTTTTTC GCCGATAAAA CAAATGCCTG TAGAATCTTT TTTCTTAGCA GTAACAAGTC 97 80 

CTTGTTCTTC AGCAATTCGA CGCACTTCAC TCTTTTCGAT GTCGCCAATT GGGaACATCA 984 0 

50 CTTTTGAAAG TTGTTGTTGA GATAATTGAT TCAAGAAGTA TGTTTGATCT TTATTATTAT 9900 

CTACACCACG TAACATTTCA ACATGACCAT CTTCATGACG ATGTATGCGT GCGTAATGTC ^ q ^ n 
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CTACCTTCTC AACAACTGTT GGTATTTGGT TTGCCTGACA GTTAATAAAA CTTCTATCAT 64 80 

GAAAACGCTG TGTAATATCA ATTTTCTGAC CCAACTTCAT TCGATGTGCT AATTCTATAA 6 54 0 

TGCTTGAACC ATCTTGTTGA CGATATACTT CAGTCAGATT TACTCGTGGT ATAGCTTTCG 66 00 

ATTCAATTAA ATCTTTAAAT ACTTGACCAG GACCTACAGA AGGCAATTGG TCCTCATCAC 6660 

CTACAAATAT CAATTGTGCA TCTAAAGGAA CTGCACTTAA AAATTGGTGG AACAACCAAG 67 20 

TATCTACCAT AGACATCTCA TCAATGATTA TGAGTCGTGC GTTTATTTCA TTTTCTAATA 6 780 

TATCCTCTGG CTTTGTGTCT TGATTCCAAC CTATTAAACG ATGAATCGTC ATTGCTTCTA 6 840 

ATCCAGTTGA CTCTTGTAGT CTCTTAGACG CTCTTCCTGT TGGCGCTGCT AATACAACTG 6900 

GATAATCATC ATTGACATAA TCATCATAAT CTAATGATAA GCCATGAATC TCAGCATATA 6960 

ATTCAACAAT ACCTTTAATT ACTGTCGTTT TTCCTGTTCC CGGTCCACCG GTTAATAGCA 7020 

TCACCTTAGA ATTGATAGCC GTTTGCAAAG CTTCTTTTTG TGAAGCTGCA TAGTTCACTT 7 080 

GATTCGCATC TTCTATTTCA CCAATATGCA TTTGTAAATC TGACTGTTCA ATTTCTGTAA 7140 

GTTTATTTGT ATGCGTCTTT ATTCTGAATA AGTTTTGAAC ACTTTTGATT TCaGAATAAT 7200 

ACAAACTTGG AATTGCAACT TGTTCaTTGT CAATAATTAG TCGTTTTTCC T CATTTAAGT 7260 

ATTGCAACAT TTCGTCTAAT TTTTCAGGTT CGATGACCTC TTCATCTTGa TAATTTAATA 7320 

CATCAACCGT TAAATCTATA ACAACATTGA TAGGCAAATA TGTATGTCCC TGTTTAATAC 73 8 0 

30 ATTCTTCTTC TAACGTATAG AGCAACGCAG CTTTTAATCG TTCATTATCG TTATAAGCGA 7440 

TACCAATATT TCTAGCAAGT TGATCTGCTT TATTAAAACC AATACCTTTA ATATCATAAA 7 500 

TCAATTGATA TGGATTTCGA TCTAAAATAG TCAGTGTATC GCCGAGATAA AACTGATAAA 7560 

35 TTGCCATTGA AAGTTTAGGA CCAAACCCTA AATCATGTAA ACGAATCATT ATTTTTTCAG 7620 

ATTCTTGATT TGCTGAAATT TGTTCTGCAA TTTGTTTCTG TTTCTTTTTA GATAATCCCG 76 bo 

AAACTTTTTC TAGCACTGAA TGGTCATCTA ATATATCATT TATCGCATTG TCACCTAATG 7740 

TATTAACAAT ATTTTGAGCT GTCTTTTTAC CTACACCTTT AAACAAATCA CTAGATAAAT 7800 

AACTTATAAT TGCTTCTTTC GTTTGTGGCA TTTCTTTTTC AAAAGTCTCT GCTTTTAATT 7860 

GTTTACCATA ACGTGGATGA TCAACAACTT GCCCTTTAAA TGTGTAGACA TCGCCTTCAA 7920 

CAATATTCGG AAGAAACCCT ACAACAGTTG GCATTGTATC AAAGTCTTCA TTTGTTTCAA 7 980 

TAGTATCTAC TTTAAGCACT GTATAAAAAT TATCACTGTT TTGAAACAAT ATCGCTTCAA 8 04 0 

CAGTACCTTT GATCATTGAA TAATCAAATA GTGTAGGGTC TGACATGTTA CTCCTCCTCT 8100 

TTCATTTTAG TGAATGTTTT CAGCGCATGC TGACTTAATA AGTGTTTAGG GTCGATAGTC 8160 
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TCGATACTTT ATCATCAACA TTACTTGCAA GAATGATAAT TGTATCTTGT AGTTTAGATT 2 8 80 

TAAAATCGTC CATTGTCGAG CGAATTGCTT TCGCATTTGG TACATCCACT TCAGTAACCA 2 940 

ATACTTTATA GCCATTGATT TCTTCAACTT GATCTTCAAT ATTACCCATT TTAAGTGATG 3 000 

TGATTTCTTT GTCACGTTGC TCTAATTGTT TTAATAATGC TTTTTCTTCA TCTTGTAATT 3 060 

GTGTTAACTT ATCGACTACT TGATCATCAG ATTTCACTTT CAGCTGTGAT TTCATCGTAT 312 0 

TAAATTTCTC TTGAATATCT TCTAAATATA AGAAAGCTGC TTTACCTGTT AATGCTTCAA 3180 

TACGACGCAC ACCAGCTCCT GTACCTGACT CACTTACTAT TTTGAATAAG CCAATTTCAG 3240 

AAGTATTGCG GACATGAATA CCACCACATA ATTCAATTGA AAATGGTGCC ATATTTACTA 3300 

CACGCACAAC ATCACCATAT TTTTCACCGA ATAATGCCAT TGCGCCCATT TCTTTAGcTG 3360 

AAGCAATATC CATTTCTTGA ATGTTAACGT CAATACCTTT CCAAATTTCT TCATTTACTA 3420 

AGCGTTCAAC TTGATCAATT TCATCATTAG TCATTGGACC AAAATGAGAG AAATCAAAAC 34 80 

GTAAACG AT C TGCTTCTACT AGTGAACCAG CTTGGTTAAC ATGATCACCC AGTACTGATT 354 0 

TCAACGCTGC ATGTAATAAA TGTGTTGCAC TATGGTTCTT TTGAATGTCA CGTCGATCAT 3600 

TTTGGTTCAC TTCAGCAGAC ACTGTAGCGC CAACATTTAC TTGGCCAAAT TGTACTACTC 3660 

CTTTATGCAA GTTTTGACCA TTTGGTGCTT TGGTTACTTC ACTAACAGCA ATTTCAAAAT 3720 

TGTCATTATA AACAATACCT GTATCCGCAA CTTGTCCACC ACTGATTGCA TAAAATGGTG 3780 

TTTCCGTTAA CATGAAGTAT ACTGTTTCAC CCGCTTCAAC TTGTGAAACT TCTTCACCAT 384 0 

TGTATATCAA GTGTGTTAGT GTTGTTTGAG ctGTCGCAGT ATCATAACCA ACAAAAGTAC 3 900 

TTGCAGATGT AATATTTTTC AATACTTCAC TTTGAACTTG CATTGATTGA GAATTTTGAC 3960 

35 GTGCTTGACG TGCACGATCA CGTTGTTGTT GCATTTCTGA CTCGAATGTT GTCATATCAA 4020 

CTTJCAATCC TGCTTGCACT GCTATTTCTT CAGTTAATTC AATTGGGAAC CCATACGTAT 4080 

CATACAATTT AAATGCATCT TTCCCATTAA TTTCATTTGT TGTCGCTTTA GCTTTTTTAA 4140 

TTAATTCATT TAAAATCGCT AAACCATCTT CTAATGTTTC ATGGAATCGT TCTTCTTCAG 4200 

ACTTTATAAC ACGCTTAATG AAATCTGCTT TTTCCTTAAC ATTTGGATAA TATGGTTCCA 4260 

TAATGTCTGC AACAATATCA ACAAGTTTGT ACATAAATGG CTCATTGATT CCTAACGTTT 4320 

GACTAAAACG AACGGCACGA CGTAACAATC GACGTAATAC ATACCCTCTA CCTTCATTGG 43 80 

CAGGTAATGC ACCATCAGAA ATTGCAAATG CAATCGTACG AATGTGGTCA GCAATTACTT 444 0 

TAAATGCCAC ATCTTGTTCG TTGTTTACTA AATATTGTTT ACCTGATACT TTTTCGATTT 4 500 

CATTCATTAT AGGCATAAAT AAATCTGTTT CATAGTTAGT ACGTACATTT TGAGAAACTG 4560 
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CATTACGCTC TATCGTTGTG ACATGAATGT CATCAGATAT AGAAGCGAAT TGCATAGAAC 10 80 

TATAGCCGAT TGCTGTACCA ATTTCTAAAA TATTTTTAAC ATTATTCATA CGAATTAATT 114 0 

GCTTAATTAA ATCTAATGTT AAACGATCTA CAATTGGCAC TTCATTTAC C TCGGCAAATT 12 00 

CACGCAAAAC TTCGATTGAA CTATTTTGAT GTTGATGTAA ATCTATTAAA TATTTTTTAT 12 60 

TTAGGTCATC CATGTTTTAA ACTTCCTTTA TGTAAAATAA GTCAATATGA TTATGACAAT 13 20 

AAAATAAATC AGCCTTCACA ATTGATTATA ATTTTGCCAA CCAATTAAAT GACTGATTTC 13 8 0 

GTGTTAGACG CAAAGCTATT TTATTTATAG AAGCGAATCA TT CAT AT AAA ATTTAACTTT 144 0 

AGATATTTTA C CAT ATTTTC AATAAAATTA TAAGCGTTAA TTATTTATAC ATTGCTTGAC 150 0 

TTAAAAAATA CTCTTGCCTC CCCATCTTTA AGGTTAGCAA GAGTAAAATC TTTTTAATTA 156 0 

TTCTTCCATT TCAGTATTTA CAACTTCTTC AATCATGTCC CATTCTTCAT CAGTTTCGAT 16 20 

TGGTACTAAC TTACCACCGT CACCTGACTC ATCTGGTTCA TTG AT CATTG GTACAAGCTC 16 8 0 

AATCATATCG TCTTCATCTG ATTGAGCACC TTCTTCAGCT AAGATAACAT ACTCTTTTTT 174 0 

GAATTCAGGA TGATAAAATT CTAAAACTTT TCGGTATAAA ACTTCATTTC CCTCTTCATC 18 00 

25 GAATAAAGTT AATAATTCTT CTTCGTTATT AATTTCTAGT TGTGAATCAT GATTATGTTC 18 60 

AGT CATAGTA AAATCTCCTT TTAATGTAGT GAATCTAAAT AGCCTTGTAA AATAAATACC 192 0 

GCTGCCATTT TATCAATCAC TTGTTTTCTT TTTTGTCTTG AAACATCTGC TTCTAATAAT 19 8 0 

30 GATCGTTCAG CAGCCATTGT GCTTAATCTT TCATCCCACA TCACAATCTC AATAGAAGGA 2 04 0 

TAAGCTTCTA ATAATTTTTC TTTATATGTT AACGAAGCTT CGCCTCGAAA TCCTATTGAA 2100 

TTATTCATGT TTTTAGGTAG TCCTATTACG ACTGTACCCA CATTATGTTT TTTAATAATG 2160 

TCTACTAATT GGTCAATACC TAATTCATTA TTTTCTTCAT TGATTCGGAG TGTGTCTAAT 2220 

CCTTGTGCCG TCCAACCCAT TATATCACTA ATTGCAATTC CTACCGTTCT ACTACCGACA 22 80 

TCGAGTCCTA AAATTTTATG TTGTAACATA AATTATTTAT TTTGCTCTTT TAAATAGTAA 234 0 

GAAACAAGCT CTTCCATAAT AACATCTCTA TCAATATGAC GAATTTGATT TCTTGCTTCA 24 0 0 

TTTTGGCGTG GAATATACGC AGGGTCACCT GATAATAAAT AACCTACAAT TTGGTTTACG 24 60 

GCATTATATC CTCGTTCATC TAATGTTCGA TAAACATTAT TTAAAACATC TCTTACATCT 2 520 

TGCGTTGGAA GTTCTTCATA GTCGAATTTC ATTGTTTTAT CAAAGTTTTC CATTTGCGAC 2 5 80 

ACTCCTTTAA TTACAAATAT AACTCACTAT CATCATACAA TATTATGGCT TTAAATTATA 264 0 

GATTTTTAAT GTAATCTTTA ATAAAGCTTA ATGATTTTGA GATATTTTCA GGTTGTGTAC 27 00 

CGCCACCTTG AGCCATATCT GGACGACCGC CACCTTTACC ACCAACGATT GGTGC CATTT 276 0 
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GTCTTATATC GATAACATGT TCTTCAATTA TTTAAATGCA GAGCATCCTA TAGGCTTGGT 300 

GCTAGTAATA TTAACAGTAC TTGTGATTAT TGGCTTTGTA CTGAACATGT TTATAAAACA 360 

CTTTAAGAAA GAGAGATTAA TCTAATGTTG ATGAATAGCG TGATTGCTTT AACTTTTTTA 4 20 

ACAGCATCTA GCAATAATGG CGGACTTAAT ATTGATGTGC AACAAGAAGA GGAAAAGCGA 4 80 

ATCAATAATG ATTTAAATCA ATATGATACA ACGCTATTTA ATAAAGACAG CAAAGCGGTT 54 0 

AATGATGCGA TTGCTAAGCA GAAAAAAGAA CGACAACAAC AAATAAAAAA TGATATGTTT 60 0 

CAAAATCAAG CGAGTCACTC GACTCGCTTG AATGA S3 5 



(2) INFORMATION FOR SEQ ID NO: 195: 

15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13715 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 195: 

25 CTGAAATGGG TATTATTTGT CTTCTTCATC ATAAAGTAAT AAAGATTGTT CATCATTGCG 6 0 

ACGTTGCCAA TTTTCATTTG GCGATCTTGG AACGTATAAT AATTGC CTAG TAGCATACGC 120 

TTTGATTGAA ACATACAAGT CATTCCTTGA ACTTGCACCT CAATTTCCAC ATTTGAATTT 180 

30 TCTTTTATAT TAATAATTTC ATCCAAATTC AGCTCACGTG CTAAGACAGC TCTTGATGCG 24 0 

CCTCTTTTAC CCCAGTAATT ACATTGAAAA TGATTAGTTA CTAACGTCTC TGCATTCCAA 30 0 

TGAAGTGGTA TTGGATTTTC TTGCGCCTTC ACATACATTA CTACTGCTGG ATCCCCGAAA 3 60 

35 ATAATTCTGT CAACTCGTAT TTCATGTAAA AAATTAATAT AATCTTCTAC AGCATCTAAA 420 

TGATAATTAT GAAATAATCC ATTCACTGCC GCATATACTT TTTTATCGTT TTTGTGAGCT 48 0 

AATGCGACAG CCTCTGTCAT TTGTTGTCTA TTGAATTCCC CTGGAAGTCT TAAACCAAAC 54 0 

40 

TTTTGCTCGC CAATTACAAA AGCATCTGCA CCTAAATCAA TAAGTGTTTC CATATGGCTT 6 00 

AATGACTTGG GTGTGACAAG TAATTCTGTC ATAGTCATTC TCCTTTAATT GAAATCGCTA 6 60 

ATCCATCGTC TATATTTAAA AAATTCGTTG TATATCCTGG TTGCTTTATT AACCACTCAT 72 0 

45 

TATAATCTTG AACCTTTTTA ACCATTTGTC TTACATTTCT CGATCTAACA ATC CCAATAT 780 

CCGATACAAA ACCGTGATAT AAAACATTAT CTGTAATTAC GAGACCTTGG TGCTTTAAAA 840 

50 GTGGTGTATA TATTTCAAAA AATTTCTTTG ATTGCGCTTT TGCTGCATCA ATAAATATCA 900 

TATCATAAAC TTTGTCATTT ACATTTTCAA ATTGCTCTAA AGCATTACCT TCAATAATTC 960 
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TCGAGTTAAA GAATGGTTAA AACGAACAGC AAGAAAATAT GGTTTGGAAG CTGACGATGT 54 0 

CGTATTAATT TCAGCTGAAA AAGGCTGGGG CATAGACGAC TT ATT AT CAT CAATTGCGAA 6 00 

TATTCGAGAA AATGAAGATG TGTATATTGT AGGGACAACG AATGTTGGGA AATCTACATT 6 60 

GATTAATAAA CTGATTGAAG CTAGTGTTGG TGAAAAAGAT GTAGTAACAA CTTCAAGATT 72 0 

CCCTGGAACA ACTTTAGATA TGATAGATAT TCCTTTAGAT GAAACATCAT TTATGTATGA 7 80 

TACACCAGGT ATTATTCAAG ATCACCAAAT GACGCATTTA GTTAGTGAAA AAGAATTGAA 84 0 

AATTATTATG CCTAAGAAAG AAATAAAACA ACGCGTATAT CAATTAAATG AGGCGCAGAC 9 00 

ATT ATTCTT C GGCGGTCTAG CGCGCATAGA TTATGTATCA GGTGGTAAAC GTCCGTTAGT 960 

TTGTTTCTTT TCTAATGACT TGAATATACA TCGTACTAAm ACGGAGAAGG CTAATGATTT 102 0 

ATGGCGTAAT CAACTTGGCG ATTTATTAAC GCCACCTGGA AATCCACAAA ATTTTGATCT 1080 

TAATGAGGTA AAGGCTGTTA GACTTGAAAC AGGCAAAGAG AAACGCGATG TTATGATCTC 114 0 

TGGTCTAGGC TTTATAACTA TAGGACCAGG GGCTAAAGTA ATCGTTCGTG TTCCTAAAAA 12 00 

TGTTGAJcGTT GTATTAAGAA ATTCTATTTT ATAAGGTGaT TAAAAAAATG AAATTTGcAG 1260 

25 TTATAGGAAA TCCTATTTCA CATTCCTTGT CGCCCGTTAT GCATAGAGCA AATTTTAATT 1320 

CTTTAGGATT AGATGATACT TATGAAGCTT TAAATATTnC CAATTGAAGA TTTTCATTTA 13 80 

ATTAAAGAAA TTATTTCGAA AAAAGAATTa GAAGGCTTTA ATATCACAAT TCCTCATAAA 144 0 

30 GAACGTATCA TACCGTATTT AGATTATGTT GATGAACAAG CGATTAATGC AGGTG CAGTT 15 00 

AACACTGTTT tGATAAAAGA TGGCAAGTGG ATAGGGTATA ATACAGATGG TATTGGTTAT 15 60 

GTTAAAGGAT TGCACAGCGT TTAnCCAGAT TTAGAAAATG CATACATTTT AATTTTGGGC 162 0 

35 GCAGGT 162 6 

(2) Information for seq id no : 194 : 
(i) sequence characteristics : 

40 (A) LENGTH: 635 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194: 

AGGGTTAATT GTCGGTTTAA TTGCAATGAA TAAGTTCCAT GTATTAGCTG GCTATAGAGC 60 

GAAATTCATC TTAATGGTGA TTTTAACTAT GATGGTCTTC GTACTTATTA ATACGTATTT 12 0 

ACTAAGACAG GTAAAATCTA TCGGTATGTT CTTAATGATT GCTGCATTGG GTCTATACTT 180 
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TTATtAGTAT GATTATACAT AAGTaAGAAG TTCTAAAn 14 078 

(2) INFORMATION FOR SEQ ID NO: 192: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 86 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192: 

TGAAAACTAA AGTGTTTCTA ATGCGTGACT AAAATTAGTA ATAATTAAGT TCTCATGATA 60 

ATAGGTATTT TTGAAAAATG GAGGAGTCTA TAAATGGGTA AAAAAATGGG TCTAGGTTTA 120 

TCTATTGCAT TGGTTGTTAT TGGTATTGCC GTTGTATGTT TAATGATTTT TTCTAGTCAA 180 

AAAACGACTT ATTTTGGTTA TATGAATAGT AATACAAATG CAGAAAAAGT TGTCAGTGAA 24 0 

AAAGATGGAT TAGTCAAACA TAATATCAAA GTAGAACCAT CTAATGATTT CAAGCCGAAA 300 

AAAGGAGACT TTGTAAAATT AGTTTCTAAA GATGATGGGA AGACATTTTA TAAACAAGAG 360 

25 ATTGTTAAAC ATGATGACGT CCCACACGGT TTAATGATGA AAATTCACGA CATG CAT ATG 420 

AATTAATAAA AAAGCATCTA TAACGTAATT TTGAAGAAGT AGAGTTATCT TCTTATGCGT 4 80 

TTTAGA 4 86 

30 (2) INFORMATION FOR SEQ ID NO: 193: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1626 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193: 

GAGGTCTATA TACAATTATG GTTGTTCCAG TTAAACGAAC TGATGGCTTT ATTACTAAGT 60 

TTAATAGATT AATTGAAAGA CGATTATTAC GTCATTTCAG TAAAAAAGGT TATATCACAT 120 

GGGAGGAAAA TTGATTGTCT GACATTTTAA AATGTATCGG TTGTGGTGCG CCACTTCAAT 180 

CTGAAGATAA AAATAAACCT GGTTTTGTAC CAGAGCATAA TATGTTTCGT GATGACGTGA 240 

TTTGCAGACG TTGTTTCCGC TTGAAAAATT ATAACGAATT CAAGATGTAG GATTAGAAAG 300 

50 TGAAGACTTT TTAAAATTAT TATCAGGACT TGCGGATAAA AAGGGTATTG TCGTCAATGT 360 

CGTGGATGTA TTTGACTTTG AAGGATCATT TATTAATGCA GTTAAACGTA TTGTCGGAAA 420 
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TTTCACAGTA ACATTTATTG GATTATGGGT 
GATTC CGGGT CTCATTTTTA TAGGGGCTTT 
5 TTATGAATCA AATGCGTTTA AAAATATTAG 

TGGCGGCGTA TTTTCATTAC TAAGTACGAT 
TCAATTCGAA AGGTTTGGTT CTTTAACATT 

10 

AGAAACTGGA AAAGCACTCA TTATTGTTTA 
TTTGAATGGA TTATTAATCG GTGCTGCTAT 
AGGTTATATT TTGAATTTCG CTTTAGGAGA 

15 

ACGTGCGTGG ACTGCGATTG GTGGTCATTT 
AGTTATTGCG AAAGAACAGC ATGGCTTTGA 
AATATTCTTT TTATCAGCCG TTGTTTTACA 

on 

TGGCAGTGAT ACGTTGAAAA TATTTATTTT 
TTTAATGGGG GCAGGTTTAA AACAAGTGAA 

25 GAAAAAAGTA GACGAATAAT AATTAAAGCT 

ATTTTTATAA TTTGTCTTTA AAAGAGTGGA 
TTtCACGTTA AACAAATAGT GACAATTAGA 

30 GTATGTAATA ATCATTTTAT TGAAATTCAT 

AATTAAACCA CGCCATAATC ATATTGGATG 

GGCTGAGACA TAAATCAATG TCCCACACTC 

35 

ATTAAATAGA TACCTTCAGG TGTTACTTTA 

ATTTDATCGT TGTAAGCAAT ATAACTGTTT 

TCATTAGAAA TATTGTGACG TTCCTTAACA 

jo 

TCATCGACGT TGCTATTATT GTAGTCTCCT 
TGATTAAAGA AATAACCTGT ATCTCCACGA 
TCCCATGCAG AACGGTCTTT TAAGTAACCT 
TCAATATTTG CTTTTTTAGC ATCTGATTGT 
CGACCGTCAT ACGTAATTAA CTTAATTTTT 
5Q GCTAATAACA TTTTTGCTTG TTCGATATTT 
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CATGGCAGCA ATTTTTAATA ACACTAACGC 123 00 

AACAGTACCA TTATCGGGTT TGTTCTTCTT 12 3 60 

CATTTTTGAA GTTATTATCA TGTTCTTTAT 12420 

GGTATTATAT AGATTTGTCG TTTTTAGTGA 12480 

TTTCGATGCA TTTTTAGTAG GATTAGTTGA 12 540 

TTTCGTCAAT AAATTGAAAA CAAATAAGAT 126 00 

TGGTGCAGGG TTCGCAGTTT TTGAATCAGC 126 60 

AAATGTCCCA TTATTAGATA TTGTCTTCAC 12 720 

AGTTTGGTCA kCGATTGTTG GTGCTGCAAT 12780 

ATTCAAAGAT ATTTTTGATA AACGCTTTTT 12 8 40 

TGGCATTTGG GATACATCTT TAACTGTACT 12 900 

AATCGTTATT GTGTGGATAC TTGTATTCaT 12 960 

TTTACTGCAG AAAGAATTTA AAGAACAACA 13 020 

TATGTTGCTC ATATGTTTGT GACATAAGCT 13 0 80 

ATAGGAATAC TTTTTGGAGT TAAAAAAGTG 1314 0 

TTTATATAAA ATGAACATGA TTCACTGAAA 13 2 00 

CAAACAGAAA TTAATACAAT CATATAAGCA 13 2 60 

ACTTCGGCGT GGTTTTTATA GTTGAAGCAG 13 3 20 

CCTTATCGTT CAATCGTTGT TCGATAATCG 13 3 80 

TAATTTTTAA CCTTAGAGTT AGCAGCGACT 134 40 

GGTACATCTC GACTTGATAA TTTAATAATA 13 500 

TCTACAGTAT GATTCAATTG ATTAATTAAA 13 5 60 

TTATTAATAG CACCATCTTT TTTATATGCT 13 620 

GGAATTGTTC CGAAACTATA CATCGTTGCA 13 6 80 

TCTATGTCAT CAACACTTTT AATGTCGATT 13 74 0 

AATACTTGCG CAATTTTCGA TAGCTCTGGA 13 8 00 

AAAGGGTGTT CTTTTGTATA ACCATCTTTA 13 86 0 

TGTTTGGTTA ACTTAGGTTC TTTAATATAT 13 920 

^n^TTCCCAT AATPTTGATA AATATHAT^T "!?9 80 
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CGACTCCCAA CGAATCCACC GAAAATGCCA ACATCTAAAA TCGGTTGCAC ATCATGTTCA 10500 

ACACATTCTT CATGCAATAA ATTAGAGAGT TCATTATTGA TGCCATAACC GATGCTAATT 10560 

5 GTATCGCCAT AAGTTAAAAA CTGAGCAGCA CGTCGGAGAA TCAATTTGCG ACTATTAAAA 10620 

GGTAATGCGG GTTCAGGTAT TCCATCAATT CGTTCTTCTC CAGACAAGGC TGGTAAATAA 106 80 

TGACTTTGAA TTACTTGGCG GTGATTCTTT TCATCTTCTG TGACGTATAC ATAATCGACA 1074 0 

w 

AGATTTCCTG GGATAACAAC TTCATTCGGT TTTAGTTGAT AGTCGTCAAC TAAAGCTTTA 10800 

ACTTGTACAA TAACTTTCCC ATGATTGGCT TTCGCGTTTA ATGCGACATG ATAACACTCG 10860 

CTCAAGTACG CTTCTTGAGT TAAATAAATG TTACCTTGTT GATCTGCGTA TGTTCCTCTC 10920 

15 

AGTAGTGCCA CATCAACGCT AGGGAATGTG TAATGTAAGT ATGTTTCATC GTTGATGGTT 10980 

AC T AATGAAA CTAAATCATC CGTTGTTCGT GTATTTACTT TACCGCCACC GTATCTAGGA 11040 

2Q TCAACAGCTG TGTTTAATCC GATTTTAGTA ATAACTCCAG GTAATAATTG ATTACTCTGA 11100 

CGATAATGAG TTGCAATGAT ACCTTGTGGT AAAAAATAAG CTTCAATGTC ATTATTTTTC 11160 

ATTG cTTGTG CCGTTTTGGA AGAAGCCGTT AAAATACTCA TAATGACACG TTTAATCATG 11220 

25 CGACGTTCTA T AAAAT CAT C TAAATCCGGT GCGGCACCTA AACTATGAAT ATCATTCGCT 1128 0 

AATATAAACG TTAAATCATT GGGCGTATGA TATGTGTCAT GTTGCGCTAA CACAGCACGT 1134 0 

AGAACTTCGG CGGGTAAGTT GGCTACAGCT AATG CTGGT A AACCAATCAC ATCACCATCT 114 00 

30 TTAATGATAT GTTGTAAGTC GTGCCATGTG ATTTGTTTCA AGCAAGTCAC CTCCATCACA 114 60 

TTTGATAAAA TATAGCGTTT TTACACTTTG TGTAAACCCT TaCAAGAAAT ATAACATAAC 11520 

GACGTTTAAA ATCAATTAGA AATATCTTTT TATTCTGATA ATAGACACAG TATAGACACA 115 80 

35 

TTTTGATGGT CGATAACAAT TGTAATATCA AGGGTTTGTA ATGAATTGAA TATCATTAAA 11640 

ATACTTATAT AAAAATATTG TTCGGAATAT AAAAAGTTAA ATAGGTTTTG ATTTTTAAAT 11700 

ATGAAATACA AAGTGCCCAA TCGAACAAAG TATTTATATT AAAATATGGA AAATCCATCA 11760 

40 

ATATTAAATT AAAATAGTTT TATTATGAAA AGTGAAAGTA GGTAAGTCTA TGGAAGGTCT 11820 

TAAT CATCGA AGAAATACAG AAAAAGAAGA GACAACACAA ACGCAATCaG TTGCACCTAA 11880 

TACAGGTGAA GAGGGGATGT CATCAGCAAG TACACAATCA ACTAAGACGT CCGACATACA 11940 

45 

TAATGAATCT ATCGATAAAC AAATGGAAGC TAAAGCGCAT GAAACAGCGC AAAATACAGA 12000 

TTTAAAAAAC GAAGCAAGAA GTTTATTTGA TAATGCAACC AAATCAATCG GTAGACTAGC 12060 

50 GGGCAATGAT GAAAGCTTAA ATCTTAATTT AAAAGATATG CTTTCTGAAG TATTTAAGCC 12120 

G CA TACT AAA AACGAAGCAG ATGAAATATT TATAGCGGGT ACTGCTAAAA CTACGCCAGC 12180 
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AGACATTTTC GCCACCAGTT ATGATTAATT CTTTTTTGCG GTCAATAATA AATATATCGC 870 0 

CATCGTTGTC CATCTTCGCT AAGTCACCAG TTAATAAATA TCGACCATGA AATGCTTTGG 876 0 

CAGTCTCTGC TGGTTTATTC CAATATCCTG GCGTGACATT TTTAGCCTTA ATTGCAAGTT 882 0 

CGCCAATCTC ACCAGTAGGT ACTTCCTCAC CGTTATCATC AAGGATACGT GCATCAACGA 8 8 80 

ACATGACTGC TTTACCAATA CTCATTGGCT TACGTTTTGA ATTTTCCGGT GTATTAACAA 894 0 

GTACAAGAGG TGCTTCAGTT AAACCATAGC CGTTAATAAT GTTTATGCCA TATTGTTTAA 900 0 

AAGCTGCTTG GATACTTGGT AATGGTTGTG AACCACCTTG GATGATATAA TCCATAGCTC 906 0 

TAAAATTTTC AGGATTAAAA TTACTAGCAC GTAGCGTACT ATAATACATT GTCGG AATCA 9120 

TGATAATAAA TGTAGGGTGA TATTGTGCAA TCATGTCATT CAATTCTTCG CCGTTAAAGT 918 0 

AACGTTGAAG AATAAGTGTG CCACCTGACA TTAATACTGG TAATACAGTA TCGTTAAACC 924 0 

pn CTAAAACATG GAACATTGGT GTTGATACAA TCGTAATATA GTTTGAATTG AACTTATACG 93 00 

TCAGCTCTAA GTTTGCACCG TTATGAACAA ATGATTCATA TGAGAACATC ACACCTTTAG 9360 

GTGATCCGGT TGTACCACTT GTATAAATTA ATGCTGCAAG ATCTTGTGGT TCAACAGGTG 942 0 

25 TTGCTTGAAA AGGTTGGTGA TAATCTGGAT TTACGATTTC ATCATATTGC GCCACATCAA 94 80 

TATCCATATG CAATAAGTTT TGGTCAATAT CGGTGAGTGA ACTTAAATGT TTTT C AG CAT 954 0 

AG AAGAG CAG TTTTAATTGT GCATCTTCCA CAATGGCTGC AATTTCTTTT GGGTTAAGCC 9600 

GCCAATTCAA TGGTAAAAAA ACCGCACCTG TTTTAAAACA AGCAAACAAT AAATCTAATA 9660 

TTGCAATATC ATTTGGCGCA AAAATACCGA TAACATCGCC TTTTTTAACA CCTTGAGATG 972 0 

TTAAATAATG TGCCATATTA TCAGCGCGTG CATTGAGTTG TTGGTATGTC CAAGATGTTT 97 80 

GTTTTGCGTG ATCAATAACG GCAGGCTTGT CATCATCGAA GTCTGAACGC GTTTTTATCC 984 0 

AATCGAAATT CATTAGTATA CCCCCTTTAG CTTCACTTTC ATACTTTATG AATTGATTGT 9 900 

TTAAGTTGTC CCCATTTTTC TTTGTAAATG CTGGTATCAA TTAATTTTAA ATGATCAGCA 996 0 

ATAATTGGTT TAAAAGCCAT TTGATTCAAA ATATCTTTAT GCAAATCAAG ACCTGGTGCA 10020 

ATTTCAATTA GTTTCAAGCC TTGATTGGTG AGTTCGAATA CTGCACGATC AGTAACAAAA 1008 0 

^ TAGATTTCTT GCTCGAGTGA TTGTGAATAT TGTGCATTAA AGTCGATATG GCTCACATCT 1014 0 

GATACAAATT TCTGGTTTTG TCCTTCAGTT TCAATGTTTA ATCGTTGATT ATGGCATGAG 102 00 

ACATGACTGC CAGCTACAAA AGTACCTGAA AAGATAATTT TATTTACAGA TTGCGTAATG 10260 

50 TCTATAAAGC CACCACATCC ATTTAGTCGG TCATTGAAGT AAGACACGTT GACATTGCCG 10320 

t^-t-^-^^. r^Tr^r^AAA ^^TAAGATAG n^AA^,A"A r*ACCATTGTT ATAAATAAAA , 07B') 
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GTGATTCCTC CAATTTAGTT GAGGATAAGA TAACCATTAA GATAATTGGA ATAACGTTGC 6 90 0 

TATTTTATAA AATTAATTAA GTATCTTTGA CAGTCATCTT AGCCTCTTAT TTAAGGAAAA 6 960 

5 AGCTTTATGC TTAAAATAAG TCTTTTTTAG TGAAATTAAT GCATCTCATA TAATTATTTG 7020 

CTATTTATAC GAAAGCAGAA TCTCCAGTCA AAGCGCGTCC AATTACTAAG GCATTAATTT 7080 

CATGTGTACC TTCGTACGTG TAAATCGCTT CTGCATCAGA GAAGAAACGT GCAATATCAT 714 0 

10 

AATCGTCAGC T AGTATG CCA TTACCACCTG TAATACCGCG GCCCATAGCT ACTGTCTCAC 720 0 

GCAAACGTAA GGCATTCATC ATCTTCGCCG TTGAAGTTGC AACCTCGTCA TATTCACCAT 7260 

GTGCTTGCAT ATTAGCTAAT TGAGCACATG TTGCCATTGC TTGAGCTAAA TTACCTTGCA 7320 

15 

TCATTGCTAG CTTTTCTTGT ATTAACTGAT ATTTACTAAT TGGTTTGCCG AATTGcTTAC 73 80 

GCTCAGTGAC ATAATCTAAT GTGGCACGTA AAGCGCCAGC CATACCACCT GTAGCCATAT 7440 

AAGCAACGCC TGCTCTCGTT GAATAAAGAA TTTTGGCAAT ATCTTTAAAG CTTGTTATGT 7500 

20 

TTTGTAAGCG ATCCGCTTCA TCTACTTTGA CATTAGTTAA TTTAATTAGG GCGTTAGGAA 7560 

CAATGCGAAG TGCGATTTTA TTATCAATGA CTTCAATATC GACGCCATCT TGTTCTGGTC 7620 

25 TGACTACAAA GCAATGGGGT TTGCCAGTTT CTTTATTTAC TGCGAATACT GGAATGACAT 76 80 

CAGATACATG TGCACCACCA ATCCATTTCT TTTCACCATT GATAACCCAA GTATCGCCTT 7740 

GGCGTTCAGC GACTGTTTCA AGACCTCCCG CAACGTCCGA ACCGTGTTCT GGTTCAGTTA 7800 

30 AAGCAAAGCA TGTACGCAGT TCATGTGACT GTAATTTAGG T ACATATTT C GCAATTTGTT 7 8 60 

CTTTGCTACC TCCGAAATAG AAAGTGTTAT GCCCTAAACC TTGGTGAACA CCGAGTAGGG 7 920 

TAGCTAAGGA AATATCAAAT CGCGCGAGTA GGTAAGACAT GAAAAACTGA AATAGTTGAC 7980 

35 TAGGCATTTT GGCGTTTGGA CGATCCTTGT AAAGTAATGG ATTGTTAAAA TAATTTAATT 804 0 

CTCfCAGATC TTTAAAATAG TCCTCGGGTA CAGTAGCGTC TATCCAATGT TGATTAATAT 8100 

TTTCACGGTA CTTACTTTCT AGCAATGAAT CTACTTGTTG TAAAAATTCG ACTTCACCGT 8160 

40 

CTGTTAAACC TTTAGCAATA CTAAGTACAT CTTCAGGAAA TAATGTTTTT AAGACCGTTT 822 0 

CTTTTTCAAA TGTCATATAA ATTCCTCCTA AAAATAATAT GAATACTAAT GTGAAATGCA 82 80 

TTTAATTCAA AAACAACACG CTTTATTTGT AAACGCTTAC ACTAAATGTC AAAAATTTTT 8340 

45 

ATCACCTTTA AAGTGTTTGC GAGACTTTGT CATTCATCAT TTGTCGAATC GCAAGTTTAT 8400 

CTGGTTTCTG CGTACTGTTT AACGGCATAT GTGTCACTGG TACATACATT CTTGGGACTT 8460 

50 TATAACCTGC TAAACGACTt CGCATATGTT GATTTAAAAT TTCAGCGTAA TGAGGTTCAT 8520 

CTTCGCGAAG TATAATGGCT GCAGCAATTG ATTCACCATA TTTTGGATGA TCATAGCCAA 85 80 
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TGTTTGTCAT CAAACTTATG CGATGTGCGT AATATGCGAT CAGCCATTTC TGCAAGGCCA 5100 

CCGCCACTCG GTAATAAGCC AACACCTGCT TCAACAAGAC CGATATATGT TTCACTTGCA 5160 

GCGACAACAA TAGGTGAGTA AAGTACAAGC TCACAGCCAC CGCCTAAGGC ACGACCTTGA 52 2 0 

ACAGCTGTGA CTACTGGTTT CAAACTATAC TTCAAACGAT TAAAGCTATA ATGTAATTTA 5 28 0 

TCAATTGATT GTGCAACGAC ATCATCTACA AGACCGTCTT CATGCGCCTT TTTCATTAAG 5 34 0 

AAAAGGTTAG CACCCACACT GAAATTGTTA CCATCTGCAT AAATAAC CAT ACTTGTGTAA 54 00 

TGGTCATTTT CCAGTAAATC AATCGCATCA ACTAACGCAT CGTTGAATTC ATCGGTAATG 54 6 0 

ACATTATTTT TACTTTGTAA TTTCAGTAAC AGTTGATCAT CATGAGTTAC GGAAAGTTTG 552 0 

G CATCAC CTT TATCCCAAAG TTCATCTTTT ACGAAGTGAG AAATAGGTGT TGCATATTCA 5580 

ATGGTCTCAT CTTGTTTATA AAAGCCACCA TCTAAATCAC TAATCCATTG TGGTAAGTCT 564 0 

CCAAGTTCGT CTTCCATACG TGTTTTAACA CGTTCGTATC CCATTGCATC CCATAATTGG 57 00 

AATGGACCAA GTTTCCAGTT GAACCCCCAG ACAAGCGCAC GGTCTATGTC TCGGAAATCA 5760 

TCGGTAGCTT TAGGTACATT GATAGCAGAG TAATAGAAAT TATTACGTAA TGTCTCCCAT 5 820 

AAAAATAGTC CCGCTTCGTC TTGCGCATTG AATATGGTAT CAAGGTTATG CACTAAGTCT 5 880 

TTATTAAATT CATTTAAAAT TGGTAATTGT GGTTGCGATA CAGGTACATA ATCTTGTTTT 5 94 0 

TCAACATCGT AAACAAGTCG AGCTTTAGTT TCTTTATCCT TTTTGTAAAA TCCTTGTTTC 60 00 

30 GTTTTACGTC CGAGTGCGCC ATTGTCAAAC AACGTATTTA CAATTTTGAC ATCATGAAAA 6 060 

TAAGGTGTTT CTTCAGGTAC TTGTTGCATG CCTTTAATTA CAGACACTGC AATATCTAAA 6120 

CCGACTAGGT CAGATAGCGC ATATGTACCT GTTTTAGGAC GACCAATCGC TTGCCCAGTT 6180 

AAAGCATCCA CATCTACAAT GCTTATCTTG TGTTGCTCGG CGCGATACAT AATATCATTC 624 0 

ATTGTTTGCG TGCCGACTCT ATTTGCGACA AAGCCAGGCA CATCATTGAC GACAATGACA 63 00 

CCTTTACCTA ACACATTTTG CGCGAAATTT TTTACATCTA ATATAATAGA TTCCTTCGTG 63 60 

TGTGACGTAG GTATTAACTC CACTAATTTC ATAATACGTG GTGGGTTAAA GAAATGTAGA 64 2 0 

CCAAAGAATC GTTCTTGATC CTTCTCGTTA AATGCTTGAG CAATCGCATT AATTGGAATA 64 80 

CCTGATG TAT TTGTAGCGAA TAAAGCATCT TCTTTAGCAT GTTGTAGAAC TTGTTGCCAA 6 54 0 

ACAGCATGCT TAATTTCAAT ATCTTCTTTG ACTGCTTCGA TATATAAATC AG CAT CAT CA 660 0 

TTTACCAAGT CATCATCAAA ATTACCATAT GTTAAATGAC TCGCTAGATT TAAGTCGAAT 666 0 

AGTAGCGGCC GTTTCTTATC TGTAATTTTA TCGTAAGATT TTTTCGCAAT GAGATTTGGA 6 72 0 

TCOTTTTTGT rrACTACAAT ATCTAATAGT TTTACTTTAA GTCCAGCATT CACAAAAAGT S78 0 
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TATTTCACAC AGCTTCATTA ATAAAACGAA ATTGCTTCAA CCCGCTTCAA CTTCAACTGG 3 3 00 

CTTCAACTTC AGCCTACTTC ATTCAATAAC AAAACGAATC CGCTTCATCC AAAATCAACC 3 360 

ATTCTAACGC ACATATTCAA ATATAGCAGC TGCACCCATG CCGACACCAA TACACATCGT 34 20 

AACCATGCCG TAACGGCTAT CGGGACGTCT ACCCATTTCA TTAAGTAAAC GCGCGGTTAA 34 30 

CATTGCGCCT GTAGCACCTA ATGGATGACC TAAAGCAATA GCGCCACCAT TCACATTCGT 3 540 

ACGTGATATA TCTAGACCTA CTTCTTTAAT AGATGCAATC GTTTGAGAAG CAAATGCTTC 36 00 

GTTCAATTCG ATCAAATCAA TGTCTTCAAC AGATAGATTG CTGAGTGACA ATACTTCAGG 3 660 

AATCGCATAT GCAGGCCCAA TACCCATAAT TTTCGGGTCA ACGCCTACTG CCTTAAAACC 3720 

AACGAATCGT GCAATAGGTG TCACGCCGAG TTCTTTCACT TTATCTCCAG ACATTAAAAC 3780 

TACAAATCCT GCACCATCAG AAAGTGGGGC AGATGTTCCT GCAGTCATAG TGCCGTCAGC 3840 

TTTAAATACT GTACGTAATT TGGCTAATGC CTCCATCGTG GTGTCAGGGC GTATAAATTC 3 900 

ATCTTGGTCA AAGATATTTG TGTGTACTTT TGGTCCTGCG TTTGTATATT CAACTGAGTT 3960 

TACTTGTATT GGAATAATTT CATCTTTGAA CCGACCATCA CGTTGTGCGT CATAGGCACG 4 020 

TTGATGACTT CTGACAGCAT AAGCATCTTG ATCTTCGCGT GATACGTCAA ATTGGGATGC 4080 

TACATTTTCA GCAGTTAAAC CCATAGGATA TGACGCACCT ATATCATCAT ATTGTAAGGT 4140 

TGGATTGTTT GTGGGCTCGT TGCCACCCAT TGGTACGGCA CTCATCAATT CAACGCCACC 4200 

30 AGCTACAAGT ATATCTCCTT GACCAGCCAT AATTTGATTG GCTGCAATCG CGATGGTTTG 4260 

TAATCCTGAT GAGCAGTAGC GATTCACTGT TTGACCCGGT ACCGTGTCAG ATAATCCCGC 4320 

ACGCAATGCA ATCGTTCGTG CAATGTTTTG GCCTTGTAAT CCTTCTGGAA AAGCCGTACC 43 8 0 

AACAATGACA TCTTCAATCA TATTCTTATT GAATTTTCCG TCAATACGTT TCAATACGCC 4440 

TTGTAATACT TTGGCTGCGA CATCATCAGG TCTTTCGTGG AATAATGCGC CTTGCTTTGC 4500 

TTTCGCTGCG GCTGAACGCC CATAAGCTAC AATGTATGCT TCTTGCATGG TTATCATCCT 4560 

CTCTTAATGA CTATCTTTTA ATTACGTAAT GGCTTACCAG TTTTTAACAT ATGTGCAATT 4620 

CTTTCATATG ATTTTTTAGA TTTTAGTAAG TCAATAAAGC CAATTTTCTC CAACGATTGA 4680 

ATGTAACGTT GATTGATAAA TGTATTTCTT GGTAAATCAC CACCCGCTAA AATTGTGGCG 474 0 

ATATTTAAGG CAATATGATA ATCATGGTCG CTAATAAAAT GACCCCGTCT TTGCGCATCT 4800 

AATTGTCCTT GGATCAATGC TTTGAAGTCT TCACCTAAAG CGATATATTG ATGTCTAGGA 4860 

TTCGGAATAT AGTTTGTTTC TGCTTCATAT TTCGCACGTT TGAGCGCAAC TTCGACACGT 4920 

TGTGCTGTAT TGAAAATAAT CGTATCTGTA TCACGTAAAT AACCATAACG ACGTGCCTCA 4980 
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GTTGTATCAT ATTATGGTGA TAAGGATTAT GGGGAGCACG CGAAAGAGTT ACGAGCAAAA 150 0 

CTGGACTTAA TCCTTGGAGA TACAGACAAT CCACATAAAA TTACAAATGA ACG T ATT AAA 156 0 

AAAGAAATGA TTGATGACTT AAATTCAATT ATTGATGATT TCTTTATGGA AACTAAACAA 16 2 0 

AATAGACCGA AATCTATAAC GAAATATAAT CCTACAACAC ATAACTATAA AACAAATAGT 1680 

GATAATAAAC CTAATTTTGA TAAATTAGTT GAAGAAACGA AAAAAGCAGT TAAAGAAGCA 174 0 

GATGATTCTT GGAAAAAGAA AACTGTCAAA AAATACGGAG AAACTGAAAC AAAATCGCCA 18 00 

GTAGTAAAAG AAGAGAAGAA AGTTGAAGAA CCTCAAGCAC CTAAAGTTGA TAACCAACAA 1860 

GAGGTTAAAA CTACGGCTGG TAAAGCTGAA GAAACAACAC AACCAGTTGC ACAACCATTA 192 0 

GTTAAAATTC CACAGGGCAC AATTACAGGT GAAATTGTAA AAGGTCCGGA ATATCCAACG 1980 

ATGGAAAATA AAACGGTACA AGGTGAAATC GTTCAAGGTC CCGATTTTCT AACAATGGAA 204 0 

CAAAGCGGCC CATCATTAAG CAATAATTAT ACAAACCCAC CGTTAACGAA CCCTATTTTA 210 0 

GAAGGTCTTG AAGGTAGCTC ATCTAAACTT GAAATAAAAC CACAAGGTAC TGAaTCAACG 216 0 

TTAAAAGGTA CTCAAGGAGA ATCAAGTGAT ATTGAAGTTA AACCTCAAGC AACTGAAACA 222 0 

ACAGAAGCTT CTCAATATGG TCCGAGACCG CAATTTAACA AAACACCTAA ATATGTTAAA 228 0 

TATAGAGATG CTGGTACAGG TATCCGTGAA TACAACGATG GAACATTTGG ATATGAAGCG 2 34 0 

AGACCAAGAT TCAATAAGCC ATCAGAAACA AATGCATATA ACGTAACAAC ACATGCAAAT 2400 

30 GGTCAAGTAT CATACGGAGC TCGTCCGACA TACAAGAAGC CAAGCGAAAC GAATGCATAC 24 6 0 

AATGTAACAA CACATGCAAA CGGCCAAGTA TCATACGGAG CTCGTCCGAC ACAAAACAAG 2 52 0 

CCAAGCAAAA CAAACGCATA TAACGTAACA ACACATGGAA ACGGCCAAGT ATCATATGGC 2 5 80 

GCTCGCCCAA CACAAAACAA GCCAAGCAAA ACAAATGCAT ACAACGTAAC AACACATGCA 2 64 0 

AACGGTCAAG TGTCATACGG AGCTCGCCCG ACATACAAGA AGCCAAGTAA AACAAATGCA 2700 

TACAATGTAA CAACACATGC AGATGGTACT GCGACATATG GGCCTAGAGT AACAAAATAA 2 76 0 

GTTTGTAACT CTATCCAAAG ACATACAGTC AATACAAAAC ATTACGTATC TTTACAACAG 2 82 0 

TAATCATGCA TTCTATGATG CTTCTAACTG AATTAAAGCA TCGAACAATC GGAAGCATAT 2 88 0 

TTCTAAATTA TTTATTCATT ATAGTCTTAA ACATAACATG ACCTAATATA TTACTAACCT 2 94 0 

ATTAAAATAA ACCACGCACA TCTAAGTGAT ATACGACAAT CACAGCAATA ATAATTGCTT 3 000 

TAGAAAGTCG TGCCGAACTG GAACTTACAA GTCTAGTTCG AACACACACT GATGTGAGTG 3 06 0 

GTTTTCTTTA TTTTAAACAT GAACAATCAG ATAAGTTACT AGCATTAGCA AATATTATTA 3120 

AATCAAAGGG CTTCGATTCA TAAAATTTAA AACAATGATT AAAATTAGAr GTGTAAATGT -» - q a 
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(2) INFORMATION FOR SEQ ID NO: 191: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14078 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

w 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191: 

TGGACTATTA ACGGCGaAGA AGATTTAACG AAATACTTAC AAACCAATGT TGATGGTATT 6 0 

ATCACAGATG ACCCAGCATT AGCTGATCAG ATTAAAGAAG AAAAGAAAGA CGAAACATAC 12 0 

15 

TTCGATCGTT CTATAAGAAT TTTGTTTGAA TAATATAAAC AAAGACCTCT AAAGTTATCA 180 

AGATGATACC TTCAGAGGTC TTTTTAATGT TGCCATCTAT GGGATAGGCA ATCGTTTCAT 24 0 

2Q TCGTTTATAT TCATATGACA AGTATTTGTA TGGCAATTTG GCGTCACAAA CACTTACATG 3 00 

ATTTATTGGT GAATTATTAA TTGTTTTGTG AATGCAAAGG GTTAGAAATT GAATTGTAAA 360 

TACTTTCTAA TCTTTGTTTC GCTTTAGTCA TTTGATCCAA ATTTTTAGTG CGTATAGCGG 420 

25 ATTTTGCAAT ATAGTGCGCA cTAAAATATC GCGTTTTTGA AACGCATCTA AATTTAGGTA 4 80 

CGATAATTTA TTTAAGTCAG TGTTTGCTAT TAATTCATGT AATTGATCTA CAAGCGCTTG 54 0 

ATGTTGATAC GTATGTGATG TAGTTTCAGA TTTGCTTGCT AATTTAATAC CAGTCGTATC 6 00 

30 AAGGAGCGCC GCTTTAATAC CAGCAACTAA ATATGTTTTG ATTTTCATTT GTGTTGTCAT 660 

GCTTTGTTAC TCCTTTGATG TACATTAATC AAAAAAATTA TACACTATTG TATATTGCAA 720 

AG CT AATTAA CTATAACAAA AAGATAGTTA ATGCTTTGTT TATTCTAGTT AATATATAGT 780 

35 

TAATGTCTTT TAATATTTTG TTTCTTTAAT GTAGATTGGG CAATTACATT TTGGAGGAAT 840 

TAAAAAATTA TGAAAAAGCA AATAATTTCG CTAGGCGCAT TAGCAGTTGC ATCTAGCTTA 900 

TTTACATGGG ATAACAAAGC AGATGCGATA GTAACAAAGG ATTATAGTGG GAAATCACAA 960 

40 

GTTAATGCTG GGAGTAAAAA TGGGACATTA ATAGATAGCA GATATTTAAA TTCAGCTCTA 1020 

TATTATTTGG AAGACTATAT AATTTATGCT ATAGGATTAA CTAATAAATA TGAATATGGA 10 80 

45 GATAATATTT ATAAAGAAGC TAAAGATAGG TTGTTGGAAA AGGTATTAAG GGAAGATCAA 1140 

TATCTTTTGG AGAGAAAGAA ATCTCAATAT GAAGATTATA AACAATGGTA TGCAAATTAT 1200 

AAAAAAGAAA ATCCTCGTAC AGATTTAAAA ATGGCTAATT TTCATAAATA TAATTTAGAA 1260 

50 GAACTTTCGA TGAAAGAATA CAATGAACTA CAGGATGCAT TAAAGAGAGC ACTGGATGAT 1320 

TTTCACAGAG AAGTTAAAGA TATTAAGGAT AAGAATTCAG ACTTGAAAAC TTTTAATGCA 1380 
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CACCT r ^Cd'V*V 


1 rCACTCAAT 


TGAATAGAAA 


GATGCTTACA 


AATTGAACTT 


AATGCTGTTG 


4020 




AATACCTTGA 


TTAATATTTT 


TTAATCCACT 


GTGTTCATGG 


TAAAAGAGGA 


4080 




ATATTTATTT 


TCCATAGTTT 


AGCCTACTTT 


CTAAAAATTG 


GTTCATTAAA 


4140 


inlnlnl A^v, 


CACrrrrAAT 


TGTTAATACC 


AAAAATATGT 


TTTTAAATAG 


AGAAAATGGT 


4200 


-rv-rt 1 MAn 1 vjAA 


Al luAl l i CT 


ATAGAGTGGG 


ACGAGAAAAT 


ATAGTTATAG 


CTGTCTATAA 


4260 




AAGTTTTTAT 


TTATACTGAT 


ATCTTGAATT 


TAATTAATAG 


AAACCTATAA 


4320 


A 21 Zk AZir , 2kr , T2l 


&P/*'PJV'T*T*PH TV 

AGCCATTTAA 


ATGACTTACT 


Gi-i-i-i-ri'GAA 


TTAGGCCAAC 


AATATTAACG 


4380 


Inl MV— Lll lw 


A I CGCTTTGA 


TGATTAAAGG 


TGAGAATGCT 


AATACAATTG 


TTGTAACAAT 


4440 


nni IVjLAACJA 


ACACCTAGGA 


AAATAAAGTA 


ATTTGTTTGA 


CCTAGTGGTT 


CTATTAACTT 


4500 


AAl_ 1 AAAsj I A 


f~*f JV 'IT^ TV « 1 w TV* 

C CATTG ATTG 


CTTGTGCAGA 


AGCGTTAGTT 


AAGTACCAAA 


TACTCATCAT 


4560 




AATGCTTTAG 


GTGCTAACTT 


AACAGCAGCA 


CTATTACCCG 


TTGGTGATAA 


4620 


f /"* ta tv /t> tv 
bUAl AGCTCA 


C CGATAACAC 


AAATAATGTA 


CGATAAAATA 


ACCCAGTTAA 


CTGAAAAGTT 


4680 


TV"" A T'O TV TV /^»«T< 

I\»A 1 bAACCT 


GATGCATAAC 


CTACAATACC 


AATTAGTATG 


TATGACGCAC 


CTGCTAAGAA 


4740 


LQj I AUCAATT 


GCAAATTTTA 


CTGGCAGGCT 


AGGTTGTTTA 


GTTCCAAGCT 


TTTGCCATAA 


4800 


T\ TA/-"TV"»TV iv TVT>TV 

AALj I u AAATA 


ATTGGAGCTA 


GTAATAAAAT 


AAATAATGGG 


TTAATTGATT 


GGAAGATCGC 


4860 


I 1 UAv-v-AAAG 


TTTGTTTTCC 


AACCAAATAA 


GTTTAATTTC 


ATATCTGAAT 


GTTCAATTCC 


4920 


Ai AIAIvjI I 1 


TV TV T> TV TV T*P IV 

AATACATTAG 


ACCCTTGTTC 


TTGAATAGCC 


CAGAACACCA 


TTCCAAGAAT 


4980 


TV TV TV T* X >. T , /" , /~* H 

AAA I AA 1 GG A 


ATAAATGCTT 


TAACACGAGA 


ACGTTCAGTA 


TCAGTGACAT 


CTTTACTTCT 


5040 


AATAATTAAA 


GTGAAGTAAA 


TGAnTGGTAA 


TGCAATACCT 


AATACTAAAA 


CAGTATTACT 


5100 


AACTAAGTTA 


AATGATAATG 


AGTTAGTTAA 


TGCACCAATA 


ACGATAATTA 


ATACAATTGC 


5160 


taajEacaaca cttccgataa 


TAAGACCATA 


CTTTTTCTTT 


TCAGCTGGTG 


TCAATGGGTT 


5220 


AGTAGGTTTC 


ATACCAACGC 


TAC CTAAGTT 


TTTGCGGTTG 


AAAAGTACAT 


ACCATACTAA 


5280 


ACCTAATGCC 


ATACCAACTG 


CTGCAATCAA 


GAATCCGCCG 


TGGAAGTTTT 


TAACATTAAC 


5340 


AAAGTGTTGC 


AAAATAATAG 


/-» <"p/-» IV rri TV TV T* TV IV 

G TG AT AAT AA 


TGCACCCATA 


TTAACTGACA 


TATAGAAAAT 


5400 


AACAAAACCT 


GCATCCATAC 


GTCTATCATT 


TTCAGGATAT 


AAACGGCCAA 


CGATATTTGA 


5460 


AATGTTTGGC 


TTCATTAAAC 


CTGAACCAAT 


AATGATGAAG 


AACATTGATG 


TGAATAAGCC 


5520 


GATTAATGCA 


AATGGTAAGC 


TTAAACAAAT 


ATGTCCGATA 


ATAATAAAGA 


CTGCACCTAA 


5580 


TAAAGTAGCG 


CCTCTAGTGC 


CTGTAATTCT 


GTCAGCAATC 


CATCCGCCTG 


GTATTGATGT 


5640 
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TTCCTGAACA TGATGGATTA ATTACAGAAG TATTGAGAGA ACCAGGCTTC TTCAGACATC 2220 

TTAAAGTGAT GCCGTATGCA CAAGAAGTTG TGAAAAAATT AACTGAACAT TATGATGTAT 2 2 80 

ATATTGCTAC AGCAGCAATG GATGTACCAA CATCATTTAG TGATAAATAT GAATGGTTAC 234 0 

TAGAGTTCTT TCCATTTTTA GATCCTCAGC ATTTTGTTTT TTGTGGTAGA AAAAACATCG 24 00 

TTAAAGCTGA TTATTTAATA GATGACAATC CTAGACAGCT TGAAATTTTT ACTGGTACAC 2460 

CGATTATGTT TACAGCAGTG CATAATATTA ATGATGATCG ATTTGAACGC GTAAATAGCT 2 520 

GGAAAGATGT AGAACAGTAT TTTTTAGATA ATATTGAGAA ATAAAATATA TCACTTGAAA 2 580 

AATTTCATGT AGAAAAGATG ATGGATAGGC TATAAAGTAA TTGTGACTGA GATGAACTTT 264 0 

TATGTCTTAG ACACTACAAC ACTATATTGG CAGTAGTTGA CTGCGGGGCC CCAACATAGA 27 00 

GAAATTGGAT TCCCAATTTC TACAGACAAT GCAAGTTGGG GTGGsCCCCA ACATAAAGAA 2760 

20 ATACTTTTTC TTTAGAAATT AGTATTTCTT ATGCATGAGT GTAACTCATG CATTCATATT 2820 

TTTAAGTACA CATTAGCTGT GACTAATGAT AAAGAATCGC TACATAATCA ATCATTAGTC 2880 

GTTCTTTATC ATTTCCGTCC CGCTCTCAAT AAATGTTAGT CTATCTTATT ATTATAAATC 2 94 0 

25 GGATGAATGT GTTAATCTAT GGCAGATTAC ACGTCATCCG ATTTTTTATA GAATTTGAAA 3000 

AAGACGCATA AACCACTATG ATTTAAAATA CAACATCAAT CATTTTAGTG gCATGCGCCA 3 060 

AAATTATATG TCTGTTTTTG AAACAGGGTA ATAGCTTAAA GCTAATAAAA ACGAATATAA 312 0 

GGTGCGTTGA ATCTTATGAT TACACTCCAA ACCTAATATA ATATCGGGTT AAGATCATTC 3180 

CGGATG CTTA CAAATCATTG ACAGTAAGTA ACTGAATGGC ATTTGGTATA ACCTCAATAT 3 24 0 

CAATAGGTGT TTCTAATGAA ATTTCGCCAT CAATATCAAC TTTCATTGCT GGATCTGTTG 33 00 

TAAGTGAAAT CTTTTTACCA GGTATATGCT CAATACCTTG AGTAATTTCA TTCCaATTCA 3360 

TGCTATCACG CTTTTTAAAA ATATCATTTA AAATACTGAA ACTTTGTTCA TTAAAAATGA 3420 

AAGTGTTCAG TTCACCATCT TGAGGAGACA AATCAGTCaA TGGTATACGA CTACCACCAA 3 4 80 

TGAATGGACC ATTTGCTGTT AGTATCATGG TCGTTTCGCC AGAATATGTC TTATCATCTA 354 0 

TTGATAATTG ATAATTAAAT TGTGTTGGAT TTAGCAGTGT TTTGACAGTT GATCCAATAT 3600 

45 AACTCAATTT ACCAAATATA TCTTTTGAAC CATCTTGTAC GTTTTCAGCG TTTTGAACAA 3660 

TGAGACCTAA GCCAACAAAG TTGAGTGCAT ATTGATTATT TATTTTAATT ACATCGTATG 3720 

TACCAACTTG TGCAGAAATC ATTTGTTCAC TAGCTTGTTT ATGATTAGGT GCTATATTTA 3 780 

GCGTTTTTGT AAAATCATTA AAAGTACCGC CTGGTAAAAT GCCAATAGGG AGTTGAAGGT 3 840 

CATGTGTCAT AACACCGTTT ATAAGTTCGT TAACCGTGCC ATCACCGCCA AGAATAAATA 3 900 
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GTTAAT CAAA TAACGACCAA CGCCACATAA GATGCGTAAC ACCAAATTAT ATCTTATGTG 4 20 

GCGTTGTTAT ATTTAAATCT ATAATTATGT TCAATTTAAA CATGCAATAA TGATTAAAAA 48 0 

ATATGACATG TTAAACACAA TGTAAGCTAT TATGATGTGA AAATAGTAGC ATTGCATTTT 54 0 

AGAAACATAG AGCGATATAA TGAATATAAG TTTTTTGAAA TTTCAGTTAA TTCTAAGGAG 60 0 

GTTGTTTTTA TTATGAAAGA ACAACTTAAT CAACTATCAG CATATCAGCC TGGTTTATCT 66 0 

CCAAGGgCAT TGAAAGAAAA GTATGGCATT GAAGGAGATT TATATAAACT TGCATCAAAT 72 0 

GAAAATTTGT ATGGACCATC GCCTAAAGTT AAAGAAGCGA TATCAGCACA CTTAGATGAG 78 0 

TTATATTATT ATCCTGAAAC AGGATCACCG ACATTAAAAG CGGCGATTAG TAAACATTTA 84 0 

AATGTAGATC AATCACGCAT TTTATTTGGT GCGGGATTAG ATGAAGTTAT ATTAATGATT 90 0 

TCTAGAGCTG TATTAACGCC AGGGGATACT ATTGTTACAA GTGAAGCGAC ATTCGGTCAA 96 0 

2Q TATTATCACA ATGCGATTGT TGAATCAGCT AATGTGATAC AAGTACCTTT AAAAGATGGT 102 0 

GGCTTCGATT TAGAAGGTAT TTTAAAAGAA GTTAATGAAG ATACGTCATT GGTATGGTTA 108 0 

TGTAATCCAA ATAATCCTAC AGGTACATAT TTTAATCATG AGAGCTTAGA TTCGTTTTTA 114 0 

25 TCTCAAGTAC CTCCACATGT ACCAGTAATT ATAGATGAAG CTTATTTTGA ATTTGTGACA 120 0 

GCAGAGGACT ACCCGGATAC ACTTGCTTTG CAACAAAAAT ATGACAATGC TTTCTTATTA 126 0 

CGTACATTTT CAAAGGCGTA TGGATTAGCG GGTTTACGTG TAGGATATGT GGT AG CAAGT 132 0 

GAACATGCGA TTGAAAAATG GAACATCATT AGACCACCAT TTAATGTGAC ACGTATATCT 1380 

GAATACGCAG CAGTTGCAGC ACTTGAAGAT CAACAATATT TAAAAGAGGT AACACATAAA 1440 

AATAGTGTTG AACGCGAAAG ATTTTATCAA TTACCTCAAA GTGAGTATTT CTTGCCAAGT 15 00 

CAAACGAATT TTATATTTGT AAAAACmAAG CGGGTAAATG AACTTTATGA AGCACTTTTA 156 0 

AATGTAGGGT GTATTACGCG ACCATTTCCA ACTGGTGTTA GAATTACAAT TGGTTTTAAA 16 2 0 

GAACAAAATG ATAAAATGTT AGAAGTTTTA TCAAACTTTA AATACGAATA GTAAGTGGGG 168 0 

AGTGGGACAG AAATGATATT TTCGCAAAAT TTATTTCGtC GTCCCACCCC AACTTGcATT 174 0 

GTCTGTAGAA ATTGGGAATC CAATTTCtCT TTGTTGGGGC CCCGCCGGCA AGGTTGACTA 18 0 0 

j 5 GAATTGAAAA AAGCTTGTTA CAAGCGCATT TTCGTTCAGT CAACTACTGC CAATATAACT 186 0 

TTGTAGAGCA TTGAACATTG ATTTATGTCT CAAGCTCAAT GCAGTGTGAA TGATGAGGTG 1920 

AGAGTATTCA GTGTAAAAAG CAACAATAGA TGATATTGTT TTGTATCAAT TGCTTTTTTG 1980 

50 CTATACTGAA TCAATACTGA TATTTTCAGG AGAAGATTAA AATGACCCGT AAATCAATCG 2 04 0 
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GACAAATGCG 


CTTTCTATTA 


TATCAGTAAT 


TGGTATACTC 


AGCCTCATAA 


TTTTCCTTAG 


3840 


TGTGTATTTG 


GCAAATAAGT 


TTTTATAAAT 


CATCGTGGTA 


TCGTCTCATA 


TTATTTATAT 


3900 


TATCCAAAAT 


AGCATAAAAA 


AATACCAACA 


AGATTTAGAA 


CCTTGTTGGT 


AATCAAAGCG 


3960 


a TTC ATTT AT 


AATGAGTCGT 


TTTATGTTGT 


AAGATTAAAC 


AGTTTGTACG 


TTAACTGCTT 


4020 


GGTCTCCACG 


TTGACCTTCA 


GTGATTTCGA 


AAGTAACTTT 


TTGACCTTCT 


TCTAAAGTTT 


4080 


TGTAGCCATC 


GCTAG CGATA 


CCTGAGAAAT 


GTACGAATAC 


GTCTCCGCCA 


TTTTCTTGTT 


4140 


CGATGAAACC 


AAAACCTTTT 


TCTGCrTTAA 


ACCATTTwAC 


TGTACCGTTA 


TTCATATwGA 


4200 


AwACCTCCGT 


gTGCTTTTGC 


ACTTAATATT 


TGTAACAAAT 


TCATAACTAA 


AAAAGAGGAT 


4260 


ATTCTAAACA 


AATACACTAC 


AATTTAATTC 


ACGAGCTTTT 


ATTACGTAAG 


ACCAACTATA 


4320 


CGCTCATATT 


GGCATAATGT 


ACAGTGTTTT 


TTGAAAATAA 


ATTAAAAAAG 


ATTTTTAAAA 


4380 


ACCTTAGAAA 


CGTTGATTTA 


AAGGGGTTTA 


TAAAAATwAw 


AAAATTGTAG 


TCTTTTATGG 


4440 


TGTTTGCTAG 


TTTTCAAAGT 


GACATATCGT 


TTAAACATGA 


TGATTTTATA 


AGCAATCCAT 


4500 


AAAAAACAAG 


CAGCGATAAA 


CGCTACTTGT 


TGATATTAAA 


ATCTGACTTG 


AAAGGTCATA 


4560 


GCAATGTTCT 


ATACCGATGG 


AATGTGCTTA 


CTTGCCTTTT 


TCTTCACGAC 


GTTTTAAATA 


4620 


AT AAG AG CCA 


CCTAATAAAC 


CAGCTGGAAT 


GCCTATCATT 


GGTGTTGTGA 


ATGAGCTTAA 


4680 


TACAATAACA 


AGTATTGTTA 


AAGCAATGAC 


GTTATACCAA 


GTTACAGTCA 


AATTTTTCAA 


4740 


ATCCTCATAT 


GATTGTTTTA 


CTAATTCTCT 


AAATTTCATG 


ATTCAATCTC 


TCCTTTTTTA 


4800 


T AAAT CTTTA 


GATTGTCAAA 


TTAAGCTGGA 


CA 






4832 



(2) INFORMATION FOR SEQ ID NO: 190: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5727 base pairs • 
T (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
40 (D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 190: 
45 CAAAGCTGTT CAAAAGGCTT ATAATTTAAA TTTAGATAAC ATACGTACAA TGGAACCTAA 60 

GTTGAGATAT CAAGCGATCA ATAAAGGTAA TATTAATTTA ATAGATGCAT ATTCAACTGA 120 

CGCTGAATTA AAACAATATG ATATGGTTGT GTTAAAAGAT GATAAGCACG TATTTCCACC 180 

50 

AT AT CAAGGA GCACCATTAT TTAAAGAAAG CTTTTTAAAG AAACATCCAG AAATTAAGAA 240 

ACCGTTAAAC AAACTAGAAA ACAAAATATC TGATGAAGAT ATGCAAATGA TGAACTATAA 300 

55 
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TTCTAGTCAA CCTTGCTGGG GTGGGACGAC GAAATAAATT TTGCGAAAAT ATCATTTCTG 2040 

TCCCACTCCC TAATTTGAGC TGGATATACT TTCATTTGAA CCCTTTATTG CTAGTTTATG 2100 

5 AAAGTATCAT GAAAGCTTTA TGAACATCGC TTGAGTTGCC TTTACAGTAG AAAATTTAAG 2160 

TTTTACACTT TGTGTGAATG ATACGTTTTG TATTGAATTA ATTATAGAAA GGTACGTTGA 2220 

AGATGTTTTC AATTGGAAGT GCAATTCTTC ATTTTGTCAT TGGTGGTATC GCTGTTGCAT 22 80 

10 

TAGCTTCAAT TATTGCTGAT AAGGTAGGTG GTAAGTTAGG AGGTATTATA GCTACTATGC 234 0 

CGGCAGTCTT TCTTGCGGCT ATTATCGCAT TAGCTTTAGA TCATCGTGGT ACGCAATTAG 24 00 

TGGAGATGTC GATGAATCTT AGTACTGGAG CAATTGTCGG TATTCTGTCT TGTATATTAA 24 60 

15 

CTGTATTTTT GACATCTCTC TACATTAAGC ATAAAGGTTA TCGGAAAGGC GCAATATTCA 252 0 

CAGTTGTTTG TTGGTTTGTC ATTTCCCTCG CAATATTCAG TATTAGACAT TTATAGTTTG 2580 

2Q GAAAATGCGT GATAATTAGT TGTATTCAGT TATTAAGTAA TAAATTATTG GAGGCAGAAC 2640 

ATCATGAAAT TAACATTAAT GAAATTTTTT GTGGGGGGAT TTGCAGTATT ATTAAGTTAT 2700 

ATTGTATCTG TAACACTACC TTGGAAAGAA TTTGGCGGTA TATTTGCaAC GTTTCCGGCA 276 0 

25 GTATTTTTAG TGTCTATGTT TATTACAGGT ATGCAATATG GTGATAAAGT CGCTGTGCAT 282 0 

GTAAGTCGTG GCGCAGTGTT TGGTATGACA GGGGTATTAG TTTGTATTTT AGTTACATGG 28 80 

ATGATGTTAC ATATGACGCA CATGTGGTTG ATTAGCATTG TTGTTGGTTT CCTAAGCTGG 2940 

30 

TT CAT CAGTG CAGTATGTAT TTTTGAAGCG GTAGAATTTA TAGCACAAAA AAGATTAGAA 3 000 

AAGCATAGTT GGAAAGCTGG AAAATCGAAT AGTAAATAGT GTGAACGTAA TCTCTTAACT 306 0 

AGGACTAACT TTGCAAGCAT TGAATAGCAT GGAAAAGTTG CATCATTAAT AAGTGAAATT 312 0 

35 

CAAGTTGGCA TTGAGAAAAT TACAAGCGCG TAATCATACa GGTCTGTCTT AAGGGAGTCT 3180 

TCGAACCCCG ATGTTGTCGT ATGTCAAAAC ATTTAGTCAA TCATAAAGGT GACTTGATTT 324 0 

40 AACTTTATCT GATAGTCTGA TTGTAATGAT TGTACTAATT GACTGGAGGC GTATGTAATT 33 00 

GAATCTGAGT AAACAAATTA AAAAGTATAG GGAACGAGAT GGTTATTCAC AAGAATATCT 3 3 60 

TGCTGAAAAG TTATATGTAT CTAGGCAGAG TATTTCTAAT TGGGAAAATG ACAAAAGCTT 34 2 0 

^ AC CAG AC AT A CATAACTTAT TAATGAyGTG TGAATTGTTC AATGTAACTT TAGATGATTT 34 80 

AGTAAAAGGG ACCATTCCAT TTGTACCTGA TATTAAAGCG CAACGAAGTC TTAACTTATG 3 54 0 

GACATATGTG ATGCTTATTT TCATGACATT AGCTGCAATT TTAATGGGAC CTTTAGTTGT 3 60 0 

50 

TTATTGGAAT TGGACTTGGG GTGTAACGGT GGCAATCATT TTGGGAATAG GTTTTTATGC 3 66 0 
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TGGTTAGCAG 


AACAATTAAA 


AGAACATAAT 


ATTCAATTAA 


CTGAGACTCA 


AAAACAACAG 


240 




TTTCAAACAT 


ATTATCGTTT 


ACTTGTTGAA 


TGGAATGAAA 


AGATGAATTT 


GACAAGTATT 


300 


5 


ACAGATGAAC 


ACGATGTATA 


TTTGAAACAT 


TTTTATGATT 


CCATTGCACC 


TAGTTTTTAT 


360 




TTTGATTTTA 


ATCAGCCTAT 


AAGTATATGT 


GATGTAGGCG 


CTGGAGCTGG 


TTTTCCAAGT 


420 


10 


ATTCCGTTAA 


AAATAATGTT 


TCCGCAGTTA 


AAAGTGACGA 


TTGTTGATTC 


ATTAAATAAG 


480 


CGTATTCAAT 


TTTTAAACCA 


TTTAGCGTCA 


GAATTACAAT 


TACAGGATGT 


CAGCTTTATA 


540 




CACGATAGAG 


CAGAAACATT 


TGGTAAGGGT 


GTCTACAGGG 


AGTCTTATGA 


TGTTGTTACT 


600 


15 


GCAAGAGCAg TAGCTAGATT 


ATCCGTGTTA 


AGTGAATTGT 


GTTTACCGCT 


AGTTAAAAAA 


660 




GGTGGACAGT 


TTGTTGCATT AAAATCTTCA AAAGGTGAAG AAGAATTAGA AGAAGCAAAA 


720 




TTTGCAATTA 


GTGTGTTAGG 


TGGTAATGTT 


ACAGAAACAC 






780 


20 


GATGCTGGAG 


AGCGCCAGAT 


GTTCATTATT 


GATAAAAAAA 


GACAGACGCC 


GAAAAAGTAT 


840 




CCAAGAAAAC 


CAGGGACGCC 


TAATAAGACT 


CCTTTACTTG 


AAAAATAATG 


CATAATCCTT 


900 




TACAACTAAC 


ATAAAAGGAG 


CGAATGGATA 


ATGAAAAAAC 


CTTTTTCAAA 


ATTATTTGGT 


960 


25 


TTGAAAAACA 


AAGATGACAT 


CATTGGACAT 


ATTGAAGAAG 


ATCGCAATAG 


TAATGTTGAA 


1020 




TCCATTCAAA 


TTGAACGTAT 


CGTTCCCAAC 


CGTTATCAAC 


CAAGACAGGT 


GTTTGAACCA 


1080 


30 


AATAAAATTA 


AAGAACTTGC 


TGAATCAATA 


CATGAACATG 


GTTTACTACA 


ACCTATTGTT 


1140 


GTAAGACCGA 


TTGAAGAAGA 


TATGTTTGAA 


ATTATTGCTG 


GAGAGCGCCG 


ATTTAGAGCA 


1200 




ATACAATCAC 


TAAATTTACC 


TCAAGCAGAC 


GTTATTATTC 


GTGATATGGA 


TGATGAAGAG 


1260 


35 


ACGGCTGTTG 


TTGCATTAAT 


TGAGAATATT 


CAAAGAGAAA 


ATTTGTCTGT 


TGTTGAAGAA 


1320 




GCGGAAGCCT 


ATAAGAAATT 


ATTGGAAATT 


GGTGATACAA 


CGCAAAGTGA 


ATTGGCAAAA 


1380 




AGTTTAGGTA AAAGTCAAAG 


CTTTATTGCA 


AATAAGTTGC 


GTTTATTGAA 


GTTGGCGCCG 


1440 


40 


AAAGTACTAC 


TTCGCTTAAG 


AGAAGGTAAA ATTACTGAAC GTCATGCGAG AgcGGtATTA 


1500 




TCATTGTCTG 


ATAGCGAACA 


AGAAGCGTTG 


ATTGAGCAAG 


TCATTGCACA 


AAAGCTAAAT 


1560 




GTGAAcAGAc 


TGAAGATAGA 


GTACGCCAAA 


AAACGGGGCC 


CGAAAAAGTC 


AAAGCACAAA 


1620 


45 


ACCTTCGCTT 


TGCACAAGAT 


GTCACTCAAG 


CACGAGATGA GGTAGGCAAA AGTATCCAAG 


1680 




CGATTCAACA 


AACAGGATTA 


CATGTTGAGC 


ATAAAGACAA 


AGATCATGAA 


GATTATTATG 


1740 


50 


AAATAAAAAT 


TCGAATATAT 


AAACGTTaGT 


AGTAGGATGT 


CGTATACATG 


ATGACTAACA 


1800 


CATAAAAGAC 


AAAGCTAAGA 


TCATAACAGC 


TTTGTCTTTT 


TTTTTTGTTT 


TACGTGAAAC 


1860 




ATAAAAATTT 


ATATTTATAT 


GTTGATCAGG 


CTGGTACATA 


AATCAATGTT 


CTATGCTCTA 


1920 
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TGAACCTTAT AGCGTTCACT TAGCAGGAGA ACTTAAAAAC TTTAATATTG AAGATCATAT 43 80 

CGACAAAAAA GAAGCGCGTC GTATGGATAG ATTTACTCAA TATGCAATTG TAGCAGCTAG 444 0 

AGAGGCTGTT AAAGATGCGC AATTAGATAT CAATGAAAAT ACTGCAGATC GAATCGGTGT 4 500 

ATGGATTGGT TCTGGTATCG GTGGTATGGA AA C ATTTG AA ATTGCACATA AACAATTAAT 4 56 0 

GGATAAAGGC CCAAGACGTG TGAGTCCATT TTTCGTACCA ATGTTAATTC CTGATATGGC 4 62 0 

AACTGGGCAA GTATCAATTG ACTTAGGTGC AAAAGGACCA AATGGTGCAA CAGTTACAGC 4 6 80 

ATGTGCAACA GGTACAAATT CAATCGGAGA AGCATTTAAA ATTGTGCAAC GCGGTGATGC 4 74 0 

AGATGCAATG ATTACTGGTG GTACAGAAGC ACCAATTACT CAT ATGG CAA TTGCTGGTTT 4 800 

CAGTGCAAGT CGAGCGCTTT CTACAAATGA TGACATTGAA ACAGCATGTC GTCCATTCCA 4 860 

AGAAGGTAGA GATGGTTTTG TTATGGGTGA AGGTGCTGGT ATTTT AG T AA TTGAATCTTT 4 92 0 

AGAATCAGCA CAAG CTCG AG GTGCCAATAT TTAXGCTGAG ATAGTTGGCT ATGGTACTAC 4 33 0 

AGGTGATGCT TATCATATTA CAGCGCCAGC TCCAGAAGGT GAAGGTGGTT CTAGAGCAAT 504 0 

GCAAGCAGCT ATGGATGATG CTGGTATTGA ACCTAAAGAT GTACAATACT TAAATGCCCA 510 0 

TGGTACAAGT ACTCCTGTTG GTGACTTAAA TGAAGTTAAA GCTATTAAAA ATACATTTGG 5160 

TGAAGCAGCT AAACACTTAA AAGTTAGCTC AACAAAATCA ATGACTGGTC ACTTACTTGG 5220 

TGCAACAGGT GGAATTGAAG CAATCTTCTC AGCGCTTTCA ATTAAAGACT CTAAAGTCGC 52 80 

ACCGACAATT CATGCGGTAA CACCAGATCC AGAATGTGAT TTGGATATTG TTCCAAATGA 534 0 

AGCGCAAGAC CTTGATATTA CTTATGCAAT GAGTAATAGC TTAGGATTCG GTGGACATAA 5400 

CGCAGTATTA GT ATT CAAG A AATTTGAAGC ATAACTATAA nAATCTTCAG TAACGTTGTT 5460 

TTAGTTACTG AAGATTTTTT CaGTTTCTTT ATACTAAGAT GAGCGACAcA CAATCGTCAT 5520 

AATAAAATAT GAATATTTAT TAATAATAA 5549 
40 (2) INFORMATION FOR SEQ ID NO: 189: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4832 base pairs 

(B) TYPE: nucleic acid 

^ (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



25 



30 



35 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 189: 

50 

AG ATT AT AG T AAGATTGATA GTTTGGCGAC TGaAGCgCGa GaAAAATTAT CAGaAGTAAA 6 0 
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45 



CAACATGAAG 


CATTTAATTT 


TACAGTGATG 


ATTATAAAAT 


AATTGCCTTG 


ATACAAAGAT 


2580 


TACTCGTAAA 


TGACATCTTT 


GTATTAAGGC 


TTTTTCTAAA 


TTTAAAAGTG 


ATGGGTTAGA 


2640 


GGTCATTGAG 


CTTTAAAATA 


TTCAAAATAC 


AAAACATTAA 


TGGCCAAAAA 


TAAAAGCCGC 


2700 


CTTTATCTGG 


GCAGCTTCAA 


TAATAAGAAA 


GACATATTTC 


ATTTTATACT 


AAATAGTTAT 


2760 


TGTGATGAAT 


CTTTCGG CGG 


TTTAATTACT 


GCAGCAAAAA 


TTGCTGTGAA 


AATCGTGAAC 


2820 


AATACTGCCA 


TGATAATTGG 


ATTCACTACA 


TTTAAGCTGT 


CTCCACCTAC 


TAGGCTATTA 


2880 


AGTACAAAGT 


TAACCATTTG 


CATTAATAAT 


AATGCCCAAA 


AGAATGTTAC 


GAGGTGTTTC 


2 940 


ATGTCATTCT 


ACCTCCACTT 


TAATTATATA 


TATTTTATTT 


TAAGTGAAAG 


TTAGAAATTT 


3000 


GTATAGTAAC 


ATCTCATATA 


TTTTGACCAT 


ATTATACAGT 


TTAAATAAAT 


GATTTTATCT 


3060 


GAATGGCTAT 


TCTAAATTAA 


GCGCATTAAA 


ACCAATTTCA 


TACTGAAATT 


TGACGATAAT 


3120 


AAAGCATTAA 


AATTTTATTA 


ACTAGTCAAT 


ATTCCTACCT 


CTGACTTGAG 


TTTAAAAAGT 


3180 


AATCTATGTT 


AAATTAATAC 


CTGGTATTAA 


AAATTTTATT 


AAGAAGGTGT 


TCAACTATGA 


3240 


ACGTGGGTAT 


TAAAGGTTTT 


GGTGCATATG 


CGCCAGAAAA 


GATTATTGAC 


AATGCCTATT 


3300 


TTGAGCAATT 


TTTAGATACA 


TCTGATGAAT 


GGATTTCTAA 


GATGACTGGA 


ATTAAAGAAA 


3360 


GACATTGGGC 


AGATGATGAT 


CAAGATACTT 


CAGATTTAGC 


ATATGAAGCA 


AGTTTAAAAG 


3420 


CAATCGCTGA 


CGCTGGTATT 


CAGCCCGAAG 


ATATAGATAT 


GATAATTGTT 


GCCACAGCAa 


3480 


CTGGaGATAT 


GCCATTTCCA 


ACTGTCGCAA 


ATATGTTGCA 


AGAACGTTTA 


GGGACGGGCA 


3540 


AAGTTGCCTC 


TATGGATCAA 


CTTGCAGCAT 


GTTCTGGATT 


TATGTATTCA 


ATGATTACAG 


36 00 


CTAAACAATA 


TGTTCAATCT 


GG AG ATT AT C 


ATAACATTTT 


AGTTGTCGGT 


GCAGATAAAT 


3 660 


TATCTAAAAT 


AACAGATTTA 


ACTGACCGTT 


CTACTGCAGT 


TCTATTTGGA 


GATGGTGCAG 


3720 


GTGCGGTTAT 


CATCGGTGAA 


GTTTCAGATG 


GCAGAGGTAT 


TATAAGTTAT 


GAAATGGGTT 


3780 


CTfiATGGCAC 


AGGTGGTAAA 


CATTTATATT 


TAGATAAAGA 


TACTGGTAAA 


WRA TV TV TV Tl ffH/^t 14 

CTGAAAATGA 


3840 


ATGGTCGAGA 


AGTATTTAAA 


TTTGCTGTTA 


GAATTATGGG 


TGATGCATCA 


ACACGTGTAG 


T rt A A 

3 900 


TTGAAAAAGC 


GAATTTAACA 


TCAGATGATA 


TAGATTTATT 


TATTCCTCAT 


CAAGCTAATA 


U Q C C\ 


TTAGAATTAT 


GGAATCAGCT 


AGAGAACGCT 


TAGGTATTTC 


AAAAGACAAA 


ATGAGTGTTT 


4020 


CTGTAAATAA 


ATATGGAAAT 


ACTTCAGCTG 


CGTCAATACC 


TTTAAGTATC 


GATCAAGAAT 


a f\ a 

4080 


TAAAAAATGG 


TAAAATCAAA 


GATGATGATA 


CAATTGTTCT 


TGTCGGATTC 


GGTGGCGGCC 


4140 


TAACTTGGGG 


CGCAATGACA 


ATAAAATGGG 


GAAAATAGGA 


GGATAACGAA 


TGAGTCAAAA 


4200 


TAAAAGAGTA 


GTTATTACAG 


GTATGGGAGC 


CCTTTCTCCA 


ATCGGTAATG 


ATGTCAAAAC 


4260 
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TACCAGAAGA GAAGTTTTGG ATATTGCATT ATTCTTTTTT AAGAGCGTTA ATATTAGGAT 78 0 

TTTTTATAGG ATTTATTCCA ATTAATATAT ATAATGATTT AAAACTGAAT AATTTACAAT 84 0 

5 

TTATTTCAGT ATTAACTTGT TACACAGTTA TGGGTTTTGT ATCTTCACGT TATTTAACTA 900 

AATACTTGAA TTATAAGTTT GTGTCAGAAA TTTGTTTAGT AATATTTTTA ATAATATATA 96 0 

CATATCAAAG TTTCATAGCA GTTACTATTT CTATGATATT TTTAGGTATT TCTTCAGGGT 102 0 

w 

TAACTCGTCC ACAAACTATA AATAAACTTT CTAGCAGTAG TAACTTAAGA GTGATGCTTA 108 0 

ATTATGCAGA AACGTTATAT TTTATTTTTA ATATCGCATT TTTACTTATG GGTGGTTACT 114 0 

15 TATATACAAT AGGAACTATT CAATACTTAA TATTATTTAT TTCGTTATTA ATTTTTATAT 1200 

ATTTAATAAT AATATTTyAT TTTACAAGGA GAGAGCAACA TGAAAATAAA AACTGAATTT 126 0 

AAAGGGAACA AT AT AC CAT A TGAATACGCA GCAGGTGCAG ATGTGAGTGA TTCTATTAAC 1320 

20 GGGAATCCAA TTAAGTCATT TCCATTTGAA GTAAT7GAAT TACCGGAAGG gACTAAATAT 138 0 

CTTGCTTGGT CTTTAATTGA CTATGATGCA ATTCCTGTAT GTGGCTTTGC TTGGATT CAT 1440 

TGGAGTGTAG CTAATGTAAG TGTTAGTGGC AATTCAATTT CTATAAAAGC AG ATTT AT CA 1500 

25 

AGAACAAAGG GCGACTATGT ACAAGGTAAA AATAGCTTTA CTAGTGGGTT GTTGGCTGAA 1560 

GATTTTTCAG AAATAGAAAA TCACTATGTA GGACCTACAC CACCTGATCA AGAT CATCAA 1620 

TATGAATTAA CAGTTTATGC GTTAGATCAT TCTTTAAATT TGAAGAATGG GTTCTACTTG 1680 

30 

AATGAATTTT TAAAAGAAGT AAATCAACAT AAAATTGATC AAACAAGTAT TAAC CTTATA 174 0 

GGAAGAAAAA TTTAATACTA AATATCTCAT CAATATAAAA TTGTTCAATT AAAAGTACAA 1800 

35 AGAAACAAAG GTTTTAATTT ATATATTAGG TACGGCGTTC GCTATAATGC AAAGAAGTAA 1860 

TTAAATTTAA GAAATGTAAA CTTAGTTATT GTAATGTGAA TTTATTTGAA AAAATAGAAA 1920 

GTATTAACAA TTATAGCTTT TACATTAATT AAAATTTATT TTTAAAAACA AGTAAACAAT 1980 

40 TTACATACTT ATAATTTTTG AAAATTTTCA ATTTGTGTTA TATTGATTTT GTAAGATACT 204 0 

TTAACTCACA AAGGAGAGAG AGTATATGAA ATTAAAATCA TTTATAACTG TAACTTTGGC 210 0 

ACTGGGCATG ATCGCAACGA CTGGCGCTAC TGTGGCAGGT AATGAGGTAT CTGCAGCAGA 2160 

AAAGGACAAA CTACCGGCAA CTCAAAAAGC TAAAGAAATG CAAAATGTTC CATATACAAT 22 20 

TGCAGTAGAT GGCATTATGG CTTTCAATCA ATCTTACTTA AATTTACCAA AAGATAGCCA 22 80 

ATTATCATAT TTAGATTTAG GAAATAAAGT TAAAGCTTTG TTATATGATG AACGCGGTGT 234 0 

50 

AACACCTGAG AAGATTCGAA ATGCAAAATC TGCCGTTTAC ACGATTACTT GGAAAGATGG 24 00 
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ATGTCCGTAA 


CTGATGTAGG 


TACTGTATTA 


CAAATTGGTG 


ATGGTATTGC 


ATTAATTCAC 


540 


GGATTAAATG 


ACGTTATGGC 


TGGTGAGCTA 


GTAGAATTCC 


ATAACGGCGT 


ACTTGGTTTA 


600 


GCCCAAAACC 


TTGAAGAGTC 


AAACGTGGGT 


GTGGTTATTT 


TAGGACCATA 


CACAGGTATT 


660 


ACTGAAGGTG 


ACGAAGTTAA 


ACGTACTGGT 


CGTATCATGG 


AAGTACCAGT 


AGGTGAAGAA 


720 


CTAATCGGAA 


GAGTTGTTAA 


TCCATTAGGA 


CAACCTATTG 


ATGGAcAAGG 


ACCGATTAAC 


780 


ACAACTAAAA 


CACGTCCaGT 


AGAGAAAAAA 


GCTACTGGTG 


TAATGGATCg 


TAAATCAGTA 


840 


GATGAGCCAT 


TACAAACAGG 


TATCaAAGCA 


ATTGATGCTT 


TAGTACCAAT 


TGGTAGAGGT 


900 


CAACGTGAGT 


TAATCATCGG 


TGACCGTCAA ACAGGTAAAA 


CAACAATTGC 


AATTGACACA 


960 


ATTTTGAACC 


AAAAAGATCA 


AGGTACGATT 


TGTATCTATG 


TTGCTATTGG 


TCAAAAAGAT 


1020 


TCAACAGTAA GAGCAAATGT 


TGAAAAGTTA 


AGACAAGCAG 


GCGCTTTAGA 


CTACACTATT 


1080 


GTTGTAGCAG 


CATCAGCTTC 


TGAACCTTCT 


CCATTATTAT 


ATATTGCACC 


ATATTCAGGT 


1140 


GTAACAATGG 


GTGAAGAATT 


CATGTTTAAC 


GGTAAACATG 


TTTTAATCGT 


TTA 


1193 



(2) INFORMATION FOR SEQ ID NO: 188: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 554 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 188: 



TGCTAAGAAG 


TCAAAATAAA CTAACTATnA 


AACATCTAGT 


ACGATTATTA 


AAGTGACAGA 


60 


TnATAAAATT 


GAATTATTnA GAGAAGGAGA 


TATAAAGTTT 


GAAGAAATAA 


AAGAAAGACT 


120 


AGGTACAGGT ATTATTTATG AATAAGTTAA 


TACTTGGGAT 


TTATTTATAC 


CGAATTTTTT 


180 


CACGAGCATA 


CTTTTATTTA CCGTTTTTAT 


TAATTTACTT 


TTTGATTCAA 


GGTTATTCCA 


240 


TAATACAATT 


AGAAATATTA ATGGCGTCTT 


ATGGCATTGC 


AGCATTTTTA 


TTCTCTCTAT 


300 


ACAAAGAGAA GTGTTTTAAA ATTTGTAACT 


TAAAAGATTC 


TAATAAATTA 


GTTGTTAGTG 


360 


AAATATTCAA 


AATCATCGGT TTATTGTTGT 


TATTATATCA 


AAATCAATAT 


TTAATTTTAG 


420 


TAGTGGCACA 


AATATTATTA GGGTTAAGTT 


ACTCAATGAT 


GGCGGGTGTT 


GATACCGCAA 


480 


TAATTAAAAG 


AAATATAACA AATGAGAAAT 


ACGTACAAAA 


TAAGTCAAAT 


AGCTATATGT 


540 


TCCTATCATT 


ATTAATTTCA GGGATTATAG GTAGTTATCT TTATGGAATA AATATTAAAT 


600 


GGCCTATAAT 


AATGACTGGT ATATTTTCAA 


TTCTAACAAT 


TATAATTATT 


CGATGCACAT 


660 
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TATTAAAAGT GGTCTGGTTC CCTCCATGGT TGCAACAATT GTTGAACCTG GAACATTTGT 6 00 0 

TGAACGTTCT TCTAAAAACA TTTCAGATGG ATTATTTACA CCTTTTAATC CACTTTGCTT 6 06 0 

CATTTCGAAG ATTCcCATTT CATTCGTTGA ACCAAAACGG TTTTTAACAG CTCGCAAAAT 612 0 

TCGATATGCG TGGTGTTCAT CGCCTTCAAA ATAAAGCACA GTATCaACCA TGTGTTCTAG 618 0 

CAATCTTGGG cCCAGCAATT TGACCTTCTT TCGTTACATG ACCCACTATA AAAGTTGCaA 6 24 0 

TGTTCATTTG TTTAGCAATA TTCATTAAAC TTTGTGTACT TTCACGAACT TGTGAAACAG 6 300 

AACCTGGCGC AGAGCTGATT TCAGGATGAT ATATTGTTTG AATCGAATCC ACTACTAATA 63 6 0 

AATCAGGTTG TTCTTCTTTT ACTGTTTGAT AAATAACTTC AAGATCTGTT TCAGCTAATA 642 0 

CTTGCAATTC ACTTGAATCT TCATCTAATC GCTCTGCACG TAATTTAGTC TGACTAAGCG 64 80 

ATTCTTCTCC AGTAATATAT AGTACTTTTT TCTTTTGAGA TAACGATGCA CAAATTTGTA 6 54 0 

iiR<-<rn»R(^r>m m/-» »\ »mm i\ /-*/~>-* rr»* /~» / ~»m/'-«/-» » rr» r*r*r>r^ » r*r* * » it* h * /** T* n /-*rp » * r* r~* Jv T"«/"">r , «/-»f , ~» r P/'"' T\ c C r\ r\ 

CAATACCTCC ACCTAATACA CGGTTGAATT CTGCTGAATC TGTTAACACT CTCGGCGTTG 666 0 

TTTCATGTTT AATACTATTT AATTTTTGTA CTTTACCTGC TAATTCCTTG GTTTTAACTC 6720 

CATGTTTAGG ATTGGCTGCT TTTTCAACAA TTTCCTCCAT TTGATTCCAA GCGCCACAAT 6780 

TAGGACATTT CCCCATCCAT TTAGGAGATT GATAACCACA AGCCATACAT TCAAAAATCA 6 84 0 

CTTTTTTCTT GGCCArAATT GCAcCTCCAC TTTCTT 6 876 
(2) INFORMATION FOR SEQ ID NO: 187: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1193 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



40 Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 187: 

CAACTCAAAC AGCAGAACAA CGTCGTGAGT TGATTAATGG TGTATTTACT GACATTAATC 6 0 

CCATACATTA AAAATATGAT GTACGTGTTA GCAGATAATA GACATATCTC ATTAATAGCT 12 0 

GACGTATTCA AGGCGTTCCA AAGCTTATAT AACGGACACT ACAATCAAGA TTTTGCAACA 180 

ATTGAGTCAA CATATGAATT GAGTCAAGAA GAGTTAGATA AGATTGTCAA ACTAGTAACT 24 0 

CAACAAACGA AGTTATCTAA AGTTATTGTA GATACAAAAA TTAAT CCAGA TTTAATTGGT 3 00 

so 

GGATTTAGAG TTAAAGTCGG CACAACTGTA TTAGATGGTA GTGTTAGAAA TGATCTTGTC 3 60 
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TTAAACTTCA AATTAACTAT TCAAATACGT TAAAATTGAT TCTAATTTTG TATGTCTTGA 4 2 00 

TTGCTATAAG AATAACTTTA TTAATATCTA AAATTTAACA CTTAATGAAC TTGTTTCAAT 426 0 

GATATATTAG CACTATTTGT ATTTTTTGAT AACTAATATG TTTTGCATTT ATTTATAGTT 4 32 0 

ATACTTCAAA TTACAAACTt CGCCATTTCA TATACCTTTT AATATCTATT TTGTTTTCGT 4 3 BO 

CAACTACAGT TTTTATAATG ATACTGTATC TTCGATTTTT TTAGCAAAAA CAATTCTTCC 444 0 

TGAAGATGTT TGCAATAAGC TGACTACTTC TAAATTGACA TGACTGCCAA TAAGATTTTT 4 500 

AGCATTATCA ACAACTACCA TCGTACCATC ATCTAGATAT CCTACTGCCT GACCAGGCtC 4560 

CTTACCCATT TTTGTCAGTA AAATATGCAG TTGATCACCT TGATGTACAT TAGGTTTGAT 4620 

TGCTTCTGAT AAATCATTAA CATTTAATGC TTTGATACCA TGTACATGAC AAACTTTATT 4 6 30 

TAGGTTGAAA TCTGTCGTTA TAATACTTGC ATGATATTGT TTTGCTAATT TTAATAACAT 4 74 0 

20 CGTATCAATA TCACTATGTG TTTTAGTTGG ATGTATAACC TTTGTAGGAT AGTCTAAATC 4 800 

ATACAATTCA TTTAAAATAT CTAAGCCTCT TTTACCCTTT TCaCGTTTAA CACTGTCATT 4 860 

TGAATCTGCA ACAATTTGTA ATTCATTAAT AACACCTTGT GGAATTAAAA TATTGCCATC 4 92 0 

GATAAAACCG CAACGAATGA CTTCTAAAAT ACGACCATCA ATAATTGCGC TTGTGTCGAT 4 980 

AATTTTTGGC GTAgcaCTTT TaGTATGTTG TGACATGGAA CGCGCTATAT TCTCAGGTAA 504 0 

AAACATTAAC ATTTCATCTC GTTTTTTAAG GCCAAATTGG AAACCGAAAT AACATAGTAA 5100 

TATCGTAATT ATGACAGGAA TGAAATGATT AAAAATAGAG TTGCCAATTG ATTCTAATAT 5160 

AAACGACACC ATAACAGAAA TAAGTAATCC GATTATTAAA CCTATTGTTG CGAATAGTAT 522 0 

TTCAACAGCA CTTCTACGCA TAATAAAATG TTCTAAACCT TTTATAGCGT TAGTAACTCG 52 8 0 

TCTAATAAAT ACACCAAAAA TTAAGAACAT AAAAATACTA CCGATAATGC CATCTACATA 534 0 

GTGA3TTTTT AAAAAGCTGG AGTTTTGTAA TCCAAGATCA TTTGCAATTT CAGGAATAAT 54 00 

40 AATTATTCCT AATGCGCTCC CAATAATTAA GTAAATAATA ATAACCATTA GTTTAACGAT 54 60 

ATTCACACAA TGTCCTCCTT TCTTGATGTT TTATGAATGA AGAGCAAATG ACAATACTTC 5520 

ATGTACAGTA GTTACACCTA TTACTTGTAT ACCTTCAGGA TATGTCCATC CGCCTATATT 5580 

ATTTTTAGGA ATAATTACAC GTTTGAAACC TAGTTTTGCA GCCTCTTGCA CGCGTTGTTC 5640 

TATCCGAGAT ACACGACGTA CCTCACCCGT TAAACCAACT TCTCCAATAT AGCAATCTAA 5700 

TCCGTCGACA GCTTTATCTT TAAAGCTAGA TGCAGTTGCT ACAATTACAC TTAAATCAAC 5760 

TGCTGGCTCC GTTAACTTTA CACCGCCAGC TACTTTGATA TAAGCATCTT GTTGTTGTAA 5820 

TAGATAATTT TCTTTCTTTT CCAAAACAGC CATCAACAAA CTTAATCGAT TATGATCAAT 5880 
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ATAAAATTCA AGTATATACT ACCTTGATCT TGTCTATTTC ATTACTTATA TTGTTTTAAA 24 00 

CGGTTTAGCA CTTTTTCTTT ACCAAGTACT TCAATTGTAT TTGGTAATTC AGGACCATGC 246 0 

ATTTGGCCTG TTACAGCAAC ACGAATAGGC ATAAATAATT GCTTGCCTTT TATTCCTGTT 2 52 0 

TCTTTTTGAA CTTCTTTAAT TGTCTTTTTA ATTTCAGCCG CTTCAAATGG TTCAAGTGCT 2 5 80 

TCTAATTTAC TGAATAAGTG CGTCATTAAC TCTGGTACTT GCTCTCCATT AATCACTTGT 264 0 

TGTTCTTCTT CACCAAGAGC TGGCATTTCT TTAAAGAACA TTTCTGATAA AGGTACAATT 2 7 00 

TCACCGGCAT AACTCATTTC TTTTTGATAA AGCGCAATTA ATTTGCGTCC CCAAGATAAA 2 760 

TCCTCTTCTG ACGGCACCTC AGGAATCAAA TTTGCTTTAA TTAAATGAGG TAATGCTAAT 282 0 

TGGAATACTG TTTCAGTATC TTTTTGTTTC ATATATTGGT TATTAACCCA TGCTAATTTT 28 8 0 

TGCTTATCGA AAAATGCTGG TGATTTTGAC AAACGCTTTT CATCAAAGAT TTTGATAAAT 2 94 0 

TCTTCTTTAG AAAAGATTTC TTCTTCACCT TCAGGAGACC AACCTAATAA CG CAATAAAA 3000 

TTAAATAACG CTTCAGGTAA ATAACCTAAG TCACGATATT GCTCAATAAA TTGTAAAATT 3060 

TGCCCATCAC GTTTACTTAA CTTTTTACGT TCTTCATTAA CAATTAATGA CATATGACCA 3120 

AAACGAGGTG GCTCCCAGCC AAATGCTTCA TAAATCATAA TTTGTTTAGG CGTGTTTGAA 3180 

ATATGATCAT CACCACGAAT TACATCTGAA ATTTGCATGT AATGATCATC TATAGCTACT 3 24 0 

GCAAAATTGT ACGTTGGAAT GCCATCTTTT TTTACGATAA CCCAGTCACC AATACCATTT 3 3 00 

GAATCAAATG AAATATTTCC TTTTAC CATA TCATCAAA7G AATACGTTTG GTTTTGAGGT 3 3 60 

ACTCGGAAAC GAATTGATGG TTGGCGTCCT TCTGCTTCAA ATTGTTGACG TTGTTCTTCA 34 20 

GTCAAATGCG CATGTTGACC ACCATAGCGA GGCATTTCAC CACGAGCGAT TTGCGCTTCA 34 80 

CGTTCAGCTT CTAATTCTTC TTCTGTCATA TAGCATTTAT ATGCTTTATC TTCTGCTAGT 3 54 0 

AACTGATCTA TTAATGGTTG GTAGATATGT TGACGTTCAG ATTGACGATA TGGTCCGTAG 3 600 

40 CCATTGTCTT TATCTACAGA CTCATCCCAA TCTAATCCTA ACCATTTAAG ATTATCAAAT 3 66 0 

TGTGATGTTT CTCCATCTTC TAAATTACGT TTTTTATCAG TAT CTTCAAT TCGAATCACA 3 72 0 

AAATCTCCGT TGTAATGTTT AGCATACAAG TAATTGAATA ATGCTGTTCT TGCATTACCA 3 78 0 

ATATGAAGAT ACCCAGTTGG ACTTGGTGCA TATCTTACTC TTATACGATC GCTCATTTTT 3 84C 

TTCACTCCTA AATTAAATAT CAGATTTTCA AGTTAGTTCA TATAAATTGT TCATTTGCTA 3 90 0 

TCTTCGACCG TCATAACAAA TGTCTAACTC GTCTTATTGT TAAAACGAAA CAATGCTTTT 3 960 

TAACATGACC TTAAAATAAT TTCATTGTTT AATCATAACA TAATTCCCTG GGTAATATGC 4 02 0 
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ATTATCAATA 


TTAATGAAAC 


CATTATGCAT 


CCAATAATTA 


GCAAATGGCG 


CATGATTATG 


600 


TGCTTCTGAT 


TGTGCTATTT 


CATTTTCATG 


ATGTGGAAAT 


TGTAAATCTG 


AACCACCCGC 


660 


ATGTATATCA 


ATTGTAGGTC 


CTAGCTCATG 


AAATGCCATT 


ACAGAACATT 


CTATATGCCA 


720 


TCCTGGTCTA 


CCTTCACCAA 


ATGGGCTATC 


CCAACTAATC 


TCGCCAGGTt 


CGCTTTTTTC 


780 


CACAATGTAA 


AATCAAGTGC 


ATCTTCTTTA 


TGCTCTCCTG 


CATCTATACG 


AGCACCCACT 


840 


TTTAAGTCAT 


CTATGGATTG 


ATGACTTAAT 


TTACCATAAC 


CTTCAAATTT 


ACGTGTTCTA 


900 


AAGTAAACAT 


CGCCACCACT 


TTCATATGCA 


TAACCTTGAT 


CCACCAAATC 


TTTAATAAAT 


960 


TGAATAATGT 


CATCCATATG 


GTCCATTACC 


CTTGGATTTG 


AAGTCGCTTT 


TCTAACATTT 


1020 


AACGCACCAA 


CATCTTCATG 


AAAAGCAGCG 


ATATATTTTT 


CTGCAATTTC 


GGGAACAGAC 


1080 


TGATTTAATT 


CTTGAGAACG 


TTTAATTAAT 


TTATCATCTA 


CGTCTGTAAA 


ATTTGATACA 


1140 


TATTCTACAT 


TATATCCTTG 


GTATTCAAAG 


TAACGTCTCA 


CTACGTCATA 


ATTAATTGCw 


1200 


GGTCTTGCGT 


TACCAATATG 


AATGTAGTTA 


TATACAGTAG 


GACCACATAC 


ATACATTTTT 


1260 


ACTTTCCCTG 


GTTCTATAGG 


CTTGAACACT 


TCTTTTTGAC 


GTGTAAGCGT 


ATTATATAAT 


1320 


GTAATCATCT 


TGAATCTCTC 


CATTCCTAGT 


CTTTTCAAGT 


TGTCGTTCTA 


AATGCTTAAT 


1380 


TTGTT CATAA 


ATTGGATCAG 


GTAGATGGCG 


ATGATCAAAT 


GTTTTTCCAA 


CTCGAACACC 


1440 


ATCTTG CTTA 


ACAATATGTC 


CTGGTATACC 


AACAACCGTT 


GAATAACTTG 


GAACTGATTG 


1500 


TAAAACAACT 


GAATTTGCAC 


CAATATTTAC 


ATTTGAATTT 


ATTTTAATAT 


TTCCTAAAAC 


1560 


TTTCGCACCG 


GCTGCTATTA 


AAACATTGTC 


TCCTATATCT 


GGGTGTCTTT 


TCCCTCTTTC 


1620 


TTTCCCTGTC 


CCACCAAGTG 


TCACGCCTTG 


ATAGATTGTC 


ACATTATCAC 


CAATTGTACA 


1680 


TGTTTCTCCT 


ATTACAACGC 


CCATACCATG 


ATCTATAAAT 


AGACGCTTTC 


CAATTTTAGC 


1740 


ACCFGGATGG 


ATTTCTATAC 


CTGTGAAAAA 


TCTTGAAATT 


TGAGATATCG 


CGCGTGCTGC 


1800 


AACATATTTT 


TTTTGGTTGT 


ATAACTTATG 


TGCAATCAAA 


TGACTCCAAA 


CTGCATGTAA 


1860 


ACCTGCATAC 


GTTGTAATGA 


CTTCTAATGT 


TGAACGTGCC 


GCTGGATCCT 


GCTCAAATAC 


1920 


CATTTTTATA 


TCGTCTCTCA 


TTCTTTTTAA 


CAAGATCATT 


TCCTCCTCAA 


TGATTGAACT 


1980 


ACGTAAATAC 


ATAATTGAAG 


TACCTGCGAA 


ATTAAATATC 


AAAAAAGCAC 


CACTAACATA 


2040 


CAAATTGTAT 


TGTTAGAGGC 


GCTTCCGCAC 


GGTTCCACTC 


TGAATTTAGC 


GAATAACATT 


2100 


AATAATATTG 


CGGGCGCTTC 


CAAATTATCA 


AGGAAACTAA 


GTCAACTTAA 


TGCTCATCAC 


2160 


TCTCATTATA 


TATTTAATTC 


ATTTTACGAA 


GGTGCATTCA 


TTAATTTCTA 


CGTTGTACTC 


2220 


ACAGCAACCG 


TACACTCTCT 


GCATCGTATA 


AATTTAATTA 


CTAATCCTTC 


GTITTATATA 


2280 
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CATTTGTTAT TCTGAGTAGC CAATTTGGCA AAGATGAACA AACGTCTGAA CAAACGTATC 6 0 

AAGTTGCAGT CGCATTAGAG TTAATTCATA TGGCAACACT TGTTCATGAT GACGTTATTG 12 0 

ATAAAAGCGA CAAGCGTCGA GGCAAGTTAA CCATATCAAA GAAATGGGAT CAGACAACTG 18 0 

CTATTTTAAC TGGGAATTTT TTATTGGCAT TAGGACTTGA ACACTTAATG GCCGTTAAAG 24 0 

ATAATCGTGT ACATCAATTG ATATCTGAAT CTATCGTTGA TGTTTGTAGA GGGGAACTTT 3 00 

TCCAATTTCA AGACCAATTT AACAGTCAAC AGACAATTAT TAATTATTTA CGACGTATCA 36 0 

ATCGCAAAAC AGCACTGTTA ATTCAAATAT CAACTGAAGT TGGTGCAATT ACTTCTCAAT 42 0 

CTGATAAAGA GACTGTACGA AAATTGAAAA TGATTGGTCA TTATATAGGT ATGAGCTTCC 48 0 

AAATCATTGA TGATGTATTA GACTTCACAA GTACCGAAAA GAAATTAGGT AAGCCGGTCG 54 0 

GAAGTGATTT GCTTAATGGT CATATTACGT TACCGATtTT ATTAGAAATG CGTAAAAATC 60 0 

CAGACTTCAA ATTGAAAATC GAACAGTTAC GTCOTGATAG TGAACGCAAA GAATTTGAAG 66 0 

AATGTATCCA AATCATTAGA AAATCTGACA GCATCGATGA GGCTAAGGCA GTAAGTTCGA 72 0 

AGTATTTAAG TAAAGCyTTG AATTTGATTT CyGaGTTACC aGATGGACaT CCGaGAtCAC 780 

TACyTTTAAG TTTGACGAAA AAAATGGGTT CAAnAAACAC GTAGTATTTA TGnAAAAGTA 84 0 

TTGAAAGCGC TTTACCAACC TGTTAATATA TAATAGTAAT ATAC 884 
(2) INFORMATION FOR SEQ ID NO: 186: 

(A) LENGTH: 6876 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186: 

40 AATTTCATCT GCTCGTGCAA AATCTTTGTT TTTCCTTGCT TCATTACGCT CTTCGATTAA 6 0 

TTTTT CAACA TCTTCATCCA ATAATTCATC TGCATTTTTA GATTTTAACG GTACACCTAA 12 0 

AACATCGCTG AAAATTTGAT AAACTGCTTT AAATTTATCA ATTACTTCTG TTGATGTTGT 180 

GTTCTCTAGT ACATATTTAT TCGCAAGTkT TGCTAAATCA TACCAAGCTG TAATTGCATT 24 0 

AGCTGTATTA AAATCATCAT TCATAACTGT TTCAAAACGA TTTAAAATCG CATCAATTTG 3 00 

ATCAATATAT GTCTGTTGAT TTTCAATATT AGTAGCAATT TGTGCGCGCT CTTCAATTAA 3 60 

50 

TTGATAACTA TTGCGAATAC GCTCTAGTcC aCTACGTGCT GATTCTACCA ATT CT AG ATT 42 0 
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TTATACGCAA AAAATTCTCC ATGTTATATA TGTCAATATA AAAATGTGAA TCGTCTACAC 4 8 00 

TTAATTGGAT AAATGGCTAC TGAAAAAGAA CTTTTCATTT TTGTTACGTC ACTAAGTGGG 4 860 

TGTAGTTATA AAGAGATGAG CCGAGTTTTG ATATTTTCAT TAGAATCAAT ATGCCTATTA 4 920 

ACACAATCAG CAATAGTTGA CGAGACGGAA ATAAAAGAAG TCGTAGTTAA GAAATGCATT 4 980 

TCACAACATA CCATTGTAGC CATTTTTATT GTTTTGGATG ATAAACTCTT TTTGGAATTT 5040 

TTAGTTTTTA TAATTTGCAA CTACACTACT TCTTTTACTA ATATTAATGT CTAAGTAATC 5100 

GATAAAAAAT TTTCCATTGA ATAAATGAGA AGTTAAAAAC TTTACTTAAC CTTTCycATT 5160 

GCATTTTCCT ATTCACGATT TTAAGAACCC AACATACTAC AAACGAATTT TAAAAGGCGA 5220 

GAGTAAAGCT TACTTGTTTA TTATACATAT TTAAAATCCA AGAGTCAGAA CAGACTACTC 5280 

C T CTTT AT AA CTATAAAAAA TAGCTATGAA AAAATCTATC GTCATAGATT CCTTCATAGC 534 0 

20 TAATCTTAGT ATGTTTATTT TTATTTTAGG ATGCTATTTA TCAACTCAAC ATATAACTCA 54 00 

CTATTTTTAT AACCTTCTAA TATATCATTA ACTTGT CTAA TAGGTATTTC TGGTACTTCT 5460 

CTAATGTTTT CCAATTTTGT TTTAAATTGT TTTTTTGTTA TTTGCTCTTT ATTTGTAGCC 5520 

AATTGGAACA AGTAAGAATC TAGCATATTA ATTTCTTTAT ATGAATACAT ATATCTTAAT 5580 

AACACTAAAT CTCTAGTTTT TAAGTTAGGC GCTAGTTCTT CTTGTAATTG TTCTATTGAT 5640 

TGTyTCATTA ATAACAATCT CATTTCTAAT TCTTCATTAT TCATTTTATC ACACTCTTtT 5700 

TATATTAATG CTTGACCAAC TTGGGAAACC CAAAACCCTA TGCTTCTTGC AGTAGAATCT 5760 

TTAATACCAG TTCCCATCAA TGCTTGTGAA ACTTGACCTT GTACATTTCC CCATGTAGCC 5 82 0 

TCTTCTTGTT TTAATGCATT ATTCAATGCG GGATTTACAA ATTTATCCCA TCTTTTTTTT 5880 

ATGATTTTCC GGCACGGGGA CTGATTTCTT TAACACCATT AAACACAGAT TTTTTATTTT 5 940 

TAATCATAGC TTTATAGTAT CATGTTGGCT AAGCTATAAA TAAGTCAGTT TCTCTAAAAA 6 000 

40 TTAAATAACT GAATGTAAGA CAATCAACAA wCCAAATTTA TACTTCATCT AAACCACTGT 6 060 

GGTCGTCATC TTTTTGCTTT TCTTTTTCTT TCTCTCGTTC TTGTTCTTTT TTGTACTCTT 6120 

CTTCAAATTC TTTTTCTTTC TTTTCTACTT CTTCTCT 6157 

45 

(2) INFORMATION FOR SEQ ID NO: 185: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 884 base pairs 

(B) TYPE: nucleic acid 

50 (C) STRAND EDNESS : double 

<D) TOPOLOGY: linear 
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AGAAGTGAAG CGTTGGGAAG ACAAATTGCC TGAGTTAGCT GAACATACTT CTAAACGTGA 3 00 0 

ACGTCGTGCT ATTGAGGCAG AACGTGATAC TGATGaATTG AAAAAAGCAG AATATATGAT 3 06 0 

5 TCAACATATT GGTGATGAAT TTGAAGGTAT TGTCAGCTCA GTAGCTAACT TCGGTATGTT 312 0 

CATTGAATTG CCAAATACGA TAGAAGGTAT GGTTCATATT GCGAATATGA CTGATGATTA 3180 

TTACCGTTTT GAAGAGCGTC AAATGGCATT AATTGGTGAG CGTCAAGCTA AAGTATTTAG 3 24 0 

10 

AATTGGTGAC ACAGTTAAGG TTAAAGTGAC GCATGTTGAT GTAGATGAAC GATTAATTGA 3 3 00 

TTTTCAAATT GTAGGTATGC CTTT AC CGAA AAATGATCGA TCACAGCGCC CAGCGCGAGG 3 36 0 

TAAGACAATT CAAGCCAAAA CGCGTGGTAA ATCATTAGAT AAATCAAAAT CTGATGATAA 34 2 0 

15 

GGGTCGTAAG AAAAAAGGTA AGCAACGTAA AGGTAAAAAC CAACGTAATA ATGATAAATC 34 80 

AGGTAATAGT AAGCATAAGC CATTTTATAA AGATAAAAGT GTGAAAAAGA AAGCACGTCG 354 0 

20 TAAGAAAAAA TAAGCAGCAA TGAGGTGAGT ATGAATGGCT AAGAAGAAAT CAC CAGGTAC 3600 

ATTAGCGGAA AATCGTAAGG CAAGACATGA TTATAATATT GAAGATACGA TTGAAGCGGG 3 660 

AATTGTATTG CAAGGCACAG AAATAAAATC AATTCGCCGA GGTAGTGCTA ACCTTAAAGA 3 72 0 

25 TAGTTATGCG CAAGTTAAAA ACGGTGAAAT GTATTTGAAT AATATGCATA TAG CAC CAT A 3 78 0 

CGAAGAAGGG AATCGTTTTA ATCACGATCC TCTTCGTTCT CGAAAATTAT TATTGCACAA 384 0 

GCGTGAAATC ATTAAATTGG GTGATCAAAC ACGTGAGATT GGTTATTCGA TTGTGCCGTT 3900 

30 

AAAGCTTTAT TTGAAGCATG GACATTGTAA AGTATTACTT GGTGTtGCAC GAGGTAAGAA 3 96 0 

AAAATATGAT AAACGTCAAG CTTTGAAAGA AAAAGCAGTC AAACGAGATG TTGCGCGCGA 4 02 0 

TATGAAAGCC CGTTATTAAG CGATTTAGTT GCTTAATCGG GCTATATTTG ATATAGTTAT 40 8 0 

35 

ATGTGCTTTT GTAAATTACA AAAGTATGAT TTGTTTGATT TATTATTTCG GGGACGTTCA 414 0 

TGGATTCGAC AGGGGTCCCC CGAGCTCATT AAGCGTGTCG GAGGGTTGTC TTCGTCATCA 4200 

40 ACACACACAG TTTATAATAA CTGGCAAATC AAACAATAAT TTCGCAGTAG CTGCCTAATC 4 26 0 

GCACTCTGCA TCGCCTAACA GCATTTCCTA TGTGCTGTTA ACGCGATTCA ACCTTAATAG 4 32 0 

GATATGCTAA ACACTGCCGT TTGAAGTCTG TTTAGAAGAA ACTTAATCAA ACT AG CAT CA 43 80 

TGTTGGTTGT TTATCACTTT TCATGATGCG AAACCTATCG ATAAACTACA CACGTAGAAA 4 44 0 

GATGTGTATC AGGACCTTTG GACGCGGGTT CAAATCCCGC CGTCTCCATA TTTGTAGCCT 4 50 0 

ACAGCCTTTG TGGTTGTGGG CTTTTTTATT TTGTGTTTTT CAGGGGATAA TGCATTGCAG 4 56 0 

50 

AATTTGTTGT GAGTATTGAT ATAGCAGTGT TTGTATAGGT GTTTATTTGA TGGAGGAAAG 4 62 0 



EP 

AGAATAAAAA GAGATTTTAA CATTAGAAAG 
AGAAGAGATT ATTAATCAAC CTGAATATGA 
5 ATTAGGTTTA AGCAGTGCCG ACTCGTTTAG 

ACAATCAGGA TTAATCGAAC GTACAAAAAC 
AGGTCAATCA AAATTGATAA AAGGAACGTT 

10 

AAGACCTGAA GATGAGGATA TGGAAGATAT 
CTTGGATGGA GATACTGTTA TTGTAGAAAT 
AATCGAAGGG GAAGTTAAGT CGATTGAGAA 

15 

TAGTGAAGCT AGACATTTTG GCTTTGTTAT 
TTTCATTCCT AAAGGTCAAA GTTTAGGCGC 

20 TACTAAGTAT GCTGATGGTT CAGATAATCC 

TAAAAATGAT CCTGGCGTAG ATATTTTATC 
ATTTCCTGAT GAAGTGTTAC AAGAAGCTGA 

25 AATTAAAGGC CGTCATGATT TACGTGATGA 

TAAAGACTTA GATGACGCAA TTAGTGTTAA 
TGTAAGTATT GCTGATGTCA GCTATTATGT 

30 

ATATGATAGA GCGACAAGTG TATATCTTGT 
ATTAAGTAAT GGTATTTGTT CATTGAATCC 

35 CATGGAAATC GATGCTAGTG GTCGCGTTGT 

TTCTGATTAT CGAATGACGT ATGATGCGGT 
CATTCGCGAA CAATATAATG AAATTACGCC 

40 TCGTTTGATT CAAATGAGAA AACGACGTGG 

AGTATTAGTT AACGAAGACG GTATACCAAC 
TGAACGTCTA ATTGAATCAT TTATGTTAAT 

45 

TAAGTTAGAT GTACCTTTTA TTTACCGAGT 
ACAATTCTTT GATTTTATTA CAAACTTTGG 
5Q TCATCCAACA ACACTTCAAA AGGTTCAAGA 

CATTTCAACA ATGATGTTGC GTTCAATGCA 
ACATTTTGGC TTATCAGCTG AATATTATAC 

55 
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GAGGGGCATA 


ATGAATTTAA 


AGCAATCTAT 


1200 


ACCTATGTCA 


GTGTCAGATT 


TTCAAGATGC 


1260 


AGATTTAATT 


AAGGTGCTTG 


TGGAGTTAGA 


1320 


AGACAGATAC 


CAAAAAAAGC 


ATAGTTATAG 


1380 


AAGTCAAAAT 


AAAAAAGGCT 


TTGCATTCTT 


1440 


ATTTATTCCC 


CCGACGAAAA 


TTAATCGTGC 


1500 


CCATCAATCA 


AAAGGTGAAC 


ATAAAGGTAA 


1560 


GCATTCTGTA 


ACTCAAGTTG 


TTGGTACGTA 


1620 


TCCGGATGAT 


AAACGTATTA 


TGCAAGATAT 


1680 


AGTCGATGGT 


CATAAGGTAC 


TTGTACAAAT 


1740 


AGAAGGACAT 


ATTTCTGCTA 


TTTTAGGACA 


1800 


TATTATCTAT 


CAACATGGCA 


TAGAAATTGA 


1860 


AGCAGTACCT 


GATCATATTG 


AAAATACTGA 


1920 


ATTGACAATC 


ACAATTGATG 


GTGCTGATGC 


1980 


AAAGTTAGCG 


AACGGTAATA 


CGCAATTAAC 


2040 


AACAGAAGGT 


TCTGCATTGG 


ATAAAGAGGC 


2100 


TGACCGTGTA 


ATTCCAATGA 


TTCCACATCG 


2160 


TAATGTTGAT 


CGTTTAACTC 


TAAGCTGTCG 


2220 


TAAACATGAA 


ATTTTTGATA 


GTGTTATACA 


2280 


AAATCAGATT 


ATTACTGAAA 


AGGATCCTAA 


2340 


TATGCTAGAT 


TTAGCACAAG 


ATTTATCTAA 


2400 


TGAAATCGAT 


TTTGATATTA 


GTGAAGCAAA 


2460 


AGATGTTCAA 


TTAAGACAAC 


GTGGCGAGGG 


2520 


TGCAAATGAA 


ACAGTTGCTG 


AACATTTTAG 


2580 


GCATGAGCAA 


CCTAAATCAG 


ATCGCTTAAG 


2640 


CATCATGATT 


AAGGGTACTG 


GCGAAGATAT 


2700 


AGAAGTAGAA 


GGTCGACCTG 


AACAAATGGT 


2760 


ACAAGCGCAT 


TATGATGATG 


TGAACTTGGG 


2820 


GCATTTTACA 


TCACCAATTA 


GACGTTATCC 


2880 
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TCAAGTACCT CATTACCTAA TCTTGCTCTG AAAAATGCAC CAACAGAAAT GTCACCATCC 108 0 

TGCATTTGAG TAGGTTTTTT TAATAAATCA AACCCTGCTC TTAATTTACC AAGTGG CG AT 114 0 

ATTAATTTTG TAGTAACAAA TGGTTTAATA TCTGTTGGAA TACCCATAAT TGAACCACCT 120 0 

GGAATCGGAT ATAATTTATT TTTCGCAAAA ATATATGATT GTCCAGTCGT ATTTGTAACA 12 6 0 

ATATCTTGTT CTAATCCAAT ATCTTTCGCT AATTCTGTCA TAATCGTTTT TC 1312 
(2) INFORMATION FOR SEQ ID NO: 184: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6157 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184: 

TTTTACAATA AAAATATGAT ATACTACTTG TCGTATATAA GGAACGGAGG ACAATTTATG 6 0 

CATACATTTT TAATCGTATT ATTAATCATT GATTGTATTG CATTAATAAC TGTTGTACTA 120 

CTCCAAGAAG GTAAAAGCAG TGGACTTTCA GGTGCCATCA GTGGTGGTGC TGAGCAGTTA 18 0 

TTCGGTAAAC AAAAACAACG TGGCGTCGAT TTATTCTTAA ATAGATTAAC AATTATTTTA 24 0 

T CAAT ATT AT TTTTTGTACT TATGATTTGC ATAAGTTATC TTGGTATGTA AGGTCCGGCG 300 

ATGTAAATGT CGGG CTTTTT TATTTATAAT TAAGAATGTA ATAGTTTAAC AATAAGCTAT 36 0 

GTAAAATATA TAGCCTAGTT AAGTATGCAA AGGGAGCGTT AGATTTATGC AGATAAAATT 420 

ACCAAAACCT TTCTTTTTTG AGGAAGGTAA ACGTGCCGTG TTATTACTAC ATGGTTTTAC 4 80 

AGGCAATTCG TCTGATGTTC GTCAATTAGG TCGATTTTTA CAAAAGAAAG GTTATACATC 54 0 

ATATDCACCG CAATATGAAG GCCACGCGGC ACCACCAGAT GAAATACTGA AATCTAGTCC 6 00 

TTTCGTTTGG TTTAAAGATG CGTTAGATGG TTATGATTAT CTTGTTGAAC AAGGTTATGA 66 0 

TGAAATTGTT GTTGCTGGTC TATCATTAGG TGGGGATTTT GCTTTAAAAT TAAGCTTAAA 72 0 

TAGAGATGTA AAGGGTATTG TAACGATGTG TGCTCCTATG GGTGGCAAAA CTGAAGGTGC 7 80 

CATTTATGAA GG CTTTTT AG AATATGCACG CAATTTTAAA AAGTATGAAG GTAAAGATCA 84 0 

AGAGACTATT GATAATGAAA TGGATCATTT TAAACCAACT GAAACTTTAA AAGAACTAAG 9 00 

TGAAGCATTA GATACGATTA AAGAGCAAGT TGATGAAGTG TTGGATCCTA TTTTAGTGAT 960 

-'-VAA^A.^AA BA^HArAATA TGATTGATCC ACAATCCGCA AATTATATAT ATGACCATGT 1020 
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AAATCGTTCA -pAGTATCTAC CTGCAATGAA AAATATAAGC CAAATCACTA TAAATGCGCT 4 8 00 

ATTAATCAAA AGCAGCACCC ATTTATCAGC AAAATTATCA GCATCCCCTG CTAAATTATA 4 86 0 

ATGAATAGGC ACTTTGGTTG GTAATTTTGG ATAGGTCACT ACTGTATAGC ACATCATAGC 4 920 

TAAGTAAATA AGTAGACTTA ATATTGTAAA AGACCTGATT TTAGACATTC TATCGCCTcT 4 980 

TcTTTACATT TTATGTATAA CACTCTGCCT ATTTTACCTT TTAATaCATT ACCCCAAcGA 504 0 

TtAAaCAATA tGTAaTGATA CTATAATTGC GTCAGGAGTA TCCGCTTGTT AAATGTGCAT 510 0 

AGCTTATATT TAGCTGTTTA ACATGCCACA TAATGATTCG AATTATT 514 7 
(2) INFORMATION FOR SEQ ID NO: 183: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1312 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
20 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183: 

CACTTACTTC CACCATTATC ATAACTTTAA AATGGATATA nTTCATCAAA CATTATCTAA 60 

AGGCGTCGCA CCTACACCAA CACCATCCAA CAATTAACTT ACAACTCTGC GATTACTTCT 120 

TCAGCAGCAA CTTTCACnTG CGTAATACAA TCAGGTAGTC CAACCGCTTC AAAAGATGCA 180 

CCAGTTACTC TAAGTCGTGG ATATGTTTGT TTAATATGTG CTTGAATCTG TCTAATTTGT 24 0 

TGAATATGAC CGACATGGTA CTGTGGCATA CTTTTCGGCA AACGATTGAC AATTGTAAAT 300 

TCAGGATCAC CTTTAAATGT CATCATTTGA CTTAAATCTC TACGTACAAT CGATACTAAT 360 

TCATTATCTG TATGATCATC AACCACAGTA TCACCTGGTT TACCTACATA CGCACGAATC 420 

AAAACCTTAC CTTCCGGTGT AGTAAATGGC CATTTTTTCG ATGTCCAAGT ACATGCGGTA 4 80 

40 ATGfCTGTAT CACTCGTTCT CGCAATTACG AAGCCAGTAC CATCATGGGT ATTTTCAATG 540 

TCTTTTTCAT CAAATGCCAA TACAACAGTT GCAACAGTCG TACTATCCAT CGTTTTAAAG 600 

TAATCAAATG CTGGATCTTG TCCGAACCAA TTTAAAAACA CTTGATGTGG TGTCGTTACT 660 

AATACGCCAT CATACACTTC TTCTAGTTGA TCATTGTAAA CAATTTTATA TTGTTTTTGA 720 

GATGTAATTA TATCATCCAC TGACGTATTG TAGCGTATTG TCACACCTTT ATTTTTAACA 7 80 

TCTTGTTCTA ATGCTTCAAT AAATGAGCTT AAACCATGCT TAAATTGTTT GAATTGTCCT 84 0 

TTCGGTGCGC CAGGATATAA TTGTCTTTGT TTCAGACGCT TATTTTTCTC ATCCTTCATA 900 

CCTTTTATCA GACTTCCGAA TGCCTCTTCT TTTTCTTTAA AATTAGGAAA CGTACTCATC 960 
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TGGAGAGAAT AATGGGAAAA TTGCTCCCGC TTTAGCAATA CCTTGTCCAA TTGCTACAGT 30 00 

CAAACCACCG TATGTCATAA CTTTAGCAAT AGCTAGGATA GCTGAAATTG TAAGGATCGG 3060 

TAACCATAAT TCTTTAATTG CTTCGACCAA TAAAGCACCT GCACTTTTCC ATTTTAACTT 3120 

CGTAATTAAA ATTGTAATAA TTACTGTTAA TAAAATCGCT GTCCCAGTTG CACCAATTAA 3180 

ATCGAGACGC AACGCAATTC CTTTAGGCGA TAAATCACTC ACAGTATTTG GAATTGGCAA 324 0 

TTTTATTACT AAACTTTCAA GTGCACCTCC AGGTTGGAAT AATTTTTTGA AGAATGGTGC 33 0 0 

ACTCCATACT AATACAAAGG CAGTTAAAAT TACGAACGGA CTCCAAGCAA AGACAATTTC 33 6 0 

TTTAGGCGTT CGTTTTTGAA TTTTATGTTC AGACGCTTCC AATCTGAAAA TGTTTTTCGG 34 20 

TTTAAATTTA CGACAAACAA ATGCTAACAC CACCATTGTT GCTAGTGATG GAATAATGTC 34 80 

TGCTAGTTCT GGACCATGGA ATATTGTTAA TAATAATTGT AATCCAGTAT ATGTACCACT 3 54 0 

CACTGTTAAA ATGACAGGTA AAATTTCTTT AATACCTTTC ATACCATCTA CAATGAATAC 3 600 

TAAAACAAAT GGAATAATAA AGTTTAAAAT TGGAAGTGTT AATG CTGAGT ATCTCGCAAC 3660 

ATCTAATGTT GTAACGCCTC CACTTAAGTT AAACGTATCA ATAATACTAA CTGGTAAACC 3720 

AATTGCACCA AAGGCACCCG C CG CACCATT AGCAATTAAA CATAACATCG CTGCTTTTAA 378 0 

TGGTTCAAAT CCAAGTTGAA TTAATAATAC TGCACAAATC GCAATTGGCA CACCAAATCC 3 84 0 

TGCTGCACCT TCTAAAAATG CGTTGAAACA AAATCCAATT AATAATAGTT GGATTCTTTG 3 90 0 

GTCCACTGAA ATACTTGCAA TACTATCTTG AATAATAGAA AATTGTCCTG TTTTAATAGA 3 96 0 

AACTTTATAT AACCAAACTG CCATTAAAAC GATATATCCT ATTGGGAAAA TACCGGCAAC 4020 

AACGCCTTCT GTAATCGCAC CTGCTGATAC ACGCGCTGGT AATTCAAATA CAAATAAAGC 40 8 0 

CACAATCAAT GTAACAACCA AAGTTGTCAA TGCTGCATAA ATGCCTTTCA TTTTAAAAAC 414 0 

GGTTfiAGCAT AATAAAAATA AAATAATAGG TACTGCTGCA ACTAAGGCTG ATAATCCGAC 42 0 0 

ATTATCGAAT GGATTTACAG TAAGTAGTGT CATAATGACT CCCTCTCTTT ATATAAAATA 4 2 60 

TTTATCATTC TGATTAATCT ACAACCTATT TCAAC1TATA TTTTGCGATG ATCACATATT 4 32 3 

TAAAATGTAA CACTCCTATA TGTGACAGGC AATCGAATTT TTACAAAAAG TTCACAAAAT 43 8 0 

ATACACAATA TTTAACTATA ATAmATAATA TATCaTlltTA ATTATAAATA CTAGATATTA 44 4 0 

TTTATAATAA TCTCAGGAAT TCGCTTCAAA ACTGCATCAT GAGAGTTTAT ATTTTTATTG 45 0 0 

AGAATCTCTC ATTTTATGAA TTGTAGGAAG TAAACAAAAT ATGACAAGCG TCAAACCAAT 4 5 60 

-i\na^fl-r. ^r\^^ATA m ^AAAr^ATAH ^AAATTGAAT TGATGATGGT GTTGTATTTG 4620 



EP0 786 519 A2 



10 



15 



20 



25 



AAAGTCTTAA AGACGACCCA AGCCAAAGCA CTAACGTTTT AGGTGAAGCT AAAAAATTAA 12 00 

ACGAATCTCA AGCACCGAAA GCTGATAACA ATTTCAACAA AGAACAACAA AATGCTTTCT 12 6 0 

ATGAAATCTT GAATATGCCT AACTTAAACG AAGAACAACG CAATGGTTTC ATCCAAAGCT 1320 

TAAAAGATGA CCCAAGCCAA AGTGCTAACC TATTGTCAGA AGCTAAAAAG TTAAATGAAT 13 8 0 

CTCAAGCACC GAAAGCGGAT AACAAATTCA ACAAAGAACA ACAAAATGCT TTCTATGAAA 144 0 

TCTTACATTT ACCTAACTTA AACGAAGAAC AACGCAATGG TTTCATCCAA AGCCTAAAAG 1500 

ATGACC CAAG CCAAAGCGCT AACCTTTTAG CAGAAGCTAA AAAGCTAAAT GATGCTCAAG 1560 

CACCAAAAGC TGACAACAAA TTCAACAAAG AACAACAAAA TGCTTTCTAT GAAATTTTAC 1620 

ATTTACCTAA CTTAACTGAA GAACAACGTA ACGGCTTCAT CCAAAGCCTT AAAGACGATC 1680 

CTTCAGTGAG CAAAGAAATT TTAGCAGAAG CTAAAAAGCT AAACGATGCT CAAGCACCAA 174 0 

AAGAGGAAGA CAATAACAAG CCTGGCAAAG AAGACAATAA CAAGCCTGGC AAAGAAGACA 180 0 

ACAACAAGCC TGGTAAAGAA GACAACAACA AGCCTGGTAA AGAAGACAAC AACAAGCCTG 186 0 

GCAAAGAAGA CGG CAACAAG CCTGGTAAAG AAGACAACAA AAAACCTGGT AAAGAAGATG 1920 

GCAACAAGCC TGGTAAAGAA GACAACAAAA AACCTGGTAA AGAAGACGGC AACAAGCCTG 1980 

GCAAAGAAGA TGGCAACAAA CCTGGTAAAG AAGATGGTAA CGGAGTACAT GTCGTTAAAC 2 04 0 

CTGGTGATAC AGTAAATGAC ATTGCAAAAG CAAACGGCAC TACTGCTGAC AAAATTGCTG 2100 

30 CAGATAACAA ATTAGCTGAT AAAAACATGA TCAAACCTGG TCAAGAACTT GTTGTTGATA 216 0 

AGAAGCAACC AGCAAACCAT GCAGATGCTA ACAAAGCTCA AGCATTACCA GAAACTGGTG 2220 

AAGAAAATCC ATTCATCGGT ACAACTGTAT TTGGTGGATT ATCATTAGCC TTAGGTGCAG 22 80 

CGTTATTAGC TGGACGTCGT CGCGAACTAT AAAAACAAAC AATACACAAC GATAGATATC 2340 

ATTTTATCCA AACCAATTTT AACTTATATA CGTTGATTAA CACATTCTTA TTTGAAATGA 24 00 

TAAGAATCAT CTAAATGCAC GAGCAACATC TTTTGTTGCT CAGTGCATTT TTTATTTTAC 246 0 

TTACTTTTCT AAACAACTTC TGAAACGCCT CAACACTTTC TACTCTGATT ACATATATGA 2520 

CATTTTTAGG CATTAAAAAA TCGAACTAGA CAAGATGCTC ATTGCATTTC GTACTAGTTC 2580 

GATTCATGAA TAATTAGATT TAAAATGTCA TTTGAATCCA AGTGACAACA TTATTTATAT 264 0 

TTAGAATATT AACGTTAGTA TAAACGTCCA AACACAAATA AAAGCAACAA ATATAATACT 2700 

GTATTTTAAC GTCATTTTTA ATAATGCAGA TTCTTCACCA ACTTTTTTAA CAGCTGCAGT 2760 

CGCAATGGCA ATTGATTGTG GTGAAATAAG TTTCGCTGCT ACACCACCTG CAGTGTTAG C 282 0 

TGCCACAAGT AATGAACCGC TTGTTGAAAT TTGTTGTGCC ACTGTCGCTT GAATAGGTGC 2880 
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GCAATTTCTT CATGACTTAA ATGACCCTCT TTTTTACCTT TTTCAATTAA TTGCTTCTTA 4 60 

ACATCTTCTA ATGTTAATGT CGGATCAATT GTTTGTTTTT TAATTTTAAC TGTGTTATCA 54 0 

GACATGAAAC GGCCTCCCGA TTTTAAATAT GAACATTCGA AATTTATTCA ATATTGCTAT 60 0 

TTTAAACGAA ATTCTTAATT AATTCCATCC ATATTTTnAA TTTTATTTTA CAAATTGGGA 66 0 

ACTAAATCCC CAATATTTAT TTTTCAATAG TGGTGGTT 6 98 
(2) INFORMATION FOR SEQ ID NO: 182: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5147 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182: 

ACTTGATGAT GTATACAATG TATTTCAAGA ATATTATCAA AAAACATCTA ACATTAAGTT 60 

TTGTAGAATT CACAATTCTA GCTATTATCA CTTCTCAAAA TAAAAACATC GTTCTTCTTA 12 0 

25 AAGATTTAAT TGAAACAATC CACCATAAAT ACCCTCAAAC TGTTAGAGCT CTCAATAATT 180 

TAAAAAAGCA AGGCTATCTA ATAAAAGAAC GCTCAACTGA AGATGAAAGA AAAATTTTAA 24 0 

TTCATATGGA TGACGCGCAG CAAGACCATG CTGAACAATT ATTAGCTCAA GTGAATCAAT 3 00 

30 TATTAGCAGA T AAAG AT CAT TTACATCTTG TTTTTGAATA ATATCTCTAT TACGCAAGTG 3 60 

TGCTGTATTC TAAAGTGCAC TTGTGTTTTC TATTTTTTAA TAAAACCTCA GCACATAATG 420 

AACAACTTTC TATTTTCTAT ATCACTTAAA ACCATTTCCG AAATTAAACC TCAGCACATT 480 

35 

CAAAGCCCCA CTTTATTCTT AAAAATATTT TTTAACTCAT ATGTATTAAA CCGCTTTCAT 54 0 

TATAAAAAAT ATCTCTATAT TtTATCTGtT TtTATTAATC GAAATAGCGT GATTTTGCGG 60 0 

TTTTAAGCCT TTTACTTCCT GAATAAATCT TTCAGCAAAA TATTTATTTT ATAAGTTGTA 66 0 

40 

AAACTTACCT TTAAATTTAA TTATAAATAT AGATTTTAGT ATTGCAATAC ATAATTCGTT 72 0 

ATATTATGAT GACTTTACAA ATACATACAG GGGGTATTAA TkTGAAAAAG AAAAACATtT 78 0 

ATTCAATTCG TAAACTAGGT GTAGGTATtG CATCTGTAAC TTTAGGTACA TTACTTATAT 84 0 

45 

CTGGTGGCGT AACACCTGCT GCAAAtgctG CGCAACACGA TGAAGCTCAA CAAAATGCTT 900 

TTTATCAAGT CTTAAATATG CCTAACTTAA ATGCTGATCA ACGCAATGGT TTTATCCAAA 96 0 

s 0 GCCTTAAAGA TGATCCAAGC CAAAGTGCTA ACGTTTTAGG TGAAGCTCAA AAACTTAATG 102 0 

ACTCTCAAGC TCCAAAAGCT GATGCGCAAC AAAATAACTT CAACAAAGAT CAACAAAGCG 108 0 
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CGATGCCCCT ATAACGGATC GATTTGATGA CAATGACAAa GAaCATCTTG AaGCAATTGT 300 

TAAAATTATT GAAAaGCAAC TCGCATAAAA GGACATCAGC ATTTTCAATA AAGTGTTGAC 360 

5 

AGTTAGCAGG AAAATGTTAC AATAATCTTT GTGTGAATTA ACGAAAGTAG CAGTTGTATA 42 0 

TTATTGAGCG CTATGTTGTT CCCAATGCGG ACGTGTCACG TAACTGTCGC TATAAGGTGA 4 80 

AGACACATAA AACAATATAT CTTAGTAAGC ATGCAACACT CTTTTTTGTT TATTCATAAC 54 0 

10 

AACAAAAAAG AATTAAAGGA GGAGTCTTAT TATGGCTCGA TTCAGAGGTT CAAACTGGAA 6 00 

AAAATCTCGT CGTTTAGGTA TCTCTTTAAG CGGTACTGGT AAAGAATTAG AAAAACGTCC 660 

TTACGCACCA GGACAACATG GTCCAAACCA ACGTAAAAAA TTATCAGAAT ATGGTTTACA 720 

15 

ATTACGTGAA AAACAAAAAT TACGTTACTT ATATGGAATG ACTGAAAGAC AATTCCGTAA 7 80 

CACATTTGAC ATCGCTGGTA AAAAATTCGG TGTACACGGT GAAAACTTCA TGATCTTATT 84 0 

2Q AGCAAGTCGT TTAGACGCTG TTGTTTATTC ATTAGGTTTA GCTCGTACTC GTCGTCAAGC 900 

ACGTCAATTA GTTAACCACG GTCATATCTT AGTAGATGGT AAACGTGTTG ATATTCCATC 960 

TTATTCTGTT AAACCTGGTC AAACAATTTC AGTTCGTGAA AAATCTCAAA AATTAAACAT 1020 

25 CATCGTTGAA TCAGTTGAAA TCAACAATTT CGTACCTGAG TACTTAAACT TTGATGCTGA 1080 

CAGCTTAACT GGTACTTTCG TACGTTTACC AGAACGTAGC GAATTACCTG CTGAAATTAA 114 0 

CGAACAATTA ATCCGTTGAG TACTACTCAA GATAATACGG TCAATACCAA CACCCACAAT 1200 

30 TGTGGGTGT 12 0 9 

(2) INFORMATION FOR SEQ ID NO: 181: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 98 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
~ (D) TOPOLOGY: linear 



40 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 181: 
AAATCCCTTt GTtaAAgTsC AAAtTTTTCc AACrgCTTTA AtArGACCCA TATTACCtTC 60 
TTGGATTAAA tCmAGGaATG AcATACCACG ACCaCGTATC TTTTAGCAAT ACTTACAACT 120 

45 

AAACGTAAGT TCGCTTCTGC AAGTCTTGAT TTTGCTACTT CATCACCTTG TTCAATACGT 180 
TTGGCTAATT CGATTTCTTC TTGTGCACTT AATAAGTTAA CACGCCCAAT TTCTTTAAGG 240 
TACATACGAA CTGGGTCATT TATTTTAACA CCTGGAGGGG CACTAAGATC ACTTGGATTC 300 

50 

AGTTTCTCGT CAGTATCTGA ACTATCTTTT TCATTAACTA GTGAAATATC ATTATCATTT 36 0 
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TAACAATGTA TCTCTATCAA GATAAGATGT TnCAATATCA TCTGACCAGC CCATTCTTCT 600 

TAAACCAGCT GCAGCTAAAA TAATCGCATC ATAATCTTCA GTTTGTAACT TTTCTAATCG 660 

5 

TGTATCTATA TTACCTCTAA TCCATTTAAT CTCTAAATTA GGATACTTAG ATAATATTTG 72 0 

TGCACCACGA CGTAATGAAC TAGTACCAAT AATACTGCCT TCTGGCAATT GGGATAGTGG 78 0 

TGTATGTGTT TTAGAAATAT ACGCATCAAA AGGTAATTCT CTATCAGGGA TACAACCTAA 84 0 

10 

TGTTAAACCT TCCGGAATTA CACTTGGTAC GTCTTTAAGC GAGTGTATTG CCATATCGAT 90 0 

ATTTTTTTCA AAAAGTTCAT GTTGTATTTC TTTAACAAAT AAGCCTTTGC CTCCGACTTT 96 0 

15 AGACAATTGT TTATCTACTA TACGATCGCC TTTCGTGaCA ATTTCTTTAA TTTCAATTTC 102 0 

TAGATTTGGC TCGACAGCTT TTAATTTATC AATAAATTGC TGGCTTTGTG TTAAAGCTAA 108 0 

TTTACyTCTT CTGGAGCCAA CGACTrATTT ACGCATGTTC AATTCCTCCT AGGAACGGAT 114 0 

20 TGCTCTAGAT TATTTTCTCA ATTCACAAAA TGTGTTGCAA AAAATAAATT AATCATATTT 12 00 

AAGCAAAATA AAATAATGTT ATAGTATATT AAATATCTTG AATTCAACCA TTTGTTGATT 1260 

CTAAGTAAAA TATAACTTCC ATATAATACT GTAATAATTG AAGAGAGTAT TACCTTCGGG 13 2 0 

25 TCAATGAATA TACGTTCACC AACTGAAATT ACACCCCACT GTGTACCTAA AATAATACTA 13 8 0 

AATATGAGAA TTATCCACCC ACTTAACGTT GAGTAAAACA CAATTGATTC AAGTGTAGCA 144 0 

ACGCTACCAA TTCTAAAGTA TTTTTGATCA AAACGTTTTT CCTTCAAATT ACGGTATTGC 150 0 

30 ATGATATACA GTAATGCATT GACAAAAGCT AAGGCAAAGA AGACATAACT TAACACAGCT 156 0 

AGACCGATAT GGACTAACAG TAACTCGTCT ACAACAGCAA TTTTCTGAAC CTTATTAGTA 1620 

TAATGTGTCG GTTGAAATGT ATTCATCCCT AAnAGTGTTA ACCCTATTAA ATTCCAAGGA 16 80 

35 

aaaacacag 1689 
(2) Information for seq id no: iso: 
(i) sequence characteristics: 

40 (A) LENGTH: 1209 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 180: 

nTGGnTGGCT TTTCCTATTG GACCAAATGG ACCnTTTACC TGGCCnTTCC CAGGACACCC 60 

^ CGCTTGTGCC CACATTCCAA TCGGAAAAGG TGTATGTGGT ACAGCCGTTT CAGAACGTCG 120 

TACACAAATT GTAGCTGATG TTCATCAATT CGAAGGACAT ATCGCTTGTG ATGCTAATAG 180 
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TGAATACGTA TAAAATAAGT GGGATTCAAT CGTTTTTCAT AACGATTCAA TGGCTCTGTT 222 0 

GTTTCGTATT TATGATTCGT TGTATGTATG GTTGTAATAC CATTATGTGT GCCAATCCCA 22 8 0 

5 ATAATATTTT GTTGCTTTAA CATGTGAATT TTATCGTCAA TTTCAACAGG TAAGCTTTGA 2 34 0 

TCAAAATTCG CCGACATATC ATTCGCAATT GCACTTGCGT TATTATCATC TTTGGCTTTA 24 0 0 

GTCGCACGCA CTTTATTGAC TGCTTGTTCA ATACGTTTTT GACCAAACGG TTTCAAAATA 24 60 

W TAGTCTGTCG CATTTAATTC AAATGCCTGT ACTGCGTATT GGTCATGTGC AGTTGCAAAA 2 52 0 

ATAATCGCAG GTGGCTCTTT CATCTTTTGA ATCTTAGCTC CTAATTCGAT CCCATTTTCA 2580 

TCCATTAAAT TGACATCTAA AAATATAATG TCATATTGAT TGATCAGTAG TGCTTCCAAT 2640 

15 

GTTTCTTTTA CATTTTCTGC CTCATTAATT TCTTCAAAAC CACCAATTTC ATTTAATAAA 27 00 

TATGTTAATT CATTACGTGC TAATGGCTCA TCATCTATGA TTAATGCTTT CATATTTATT 2760 

CCTCCTCTTG TCTTTCATAA GGAAGTACAC ACCAAAAAGT GGTACCGCTC GATGTCGATT 2820 

20 

CAAATTGTAA TGCTGCGGAT TTTCCAAATA ATCCTTTTAG GCGTAAGTTT AAATTTTCTA 2880 

AAGCACTACC AGTTCCAGAC TCTGATTCTA CAGATGTnTC TCCCaACAAA TGCATTTTAT 2 940 

, 5 CTTTAGAAAT ACCCTGACCA TTATCTTGTA CAATAATACG TACATGTGTT GCAGTTTCTT 3 0 00 

TAATCACTGA CACGTCAATA TCGTT 3 025 
(2) INFORMATION FOR SEQ ID NO: 179: 

30 (i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1689 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 

~ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179: 

ACAGAATTTC ACAGCATTTT TAGATGAAAA AATAAGCCAG TCATAGCGTT GATTTAACAA 6 0 

40 ATGAATATCA AAATTTAGTG GCTTTATATC AATAAAGGGT TTGTGAATAA TTGATACTAA 12 0 

ATCACTTTGC ATGTCATTTG TTTGTGTCAT AACTACAACT GGCTTCATAT TTAAACGTCA 18 0 

CTCCATTATT TAATGTTGTT CATTTAAGCG TTTTATAATT TCATAAGCAC CTTGCTCTTT 24 0 

45 

TAATTTGTTA CTCACTGTTT TGCCTAACTC AACCGGATCT GTTCCGTTCA TTGTATATTC 300 

AAATCGTTCT TTACCATCTG GGGTCATAAT TAAACCTGTA AATTCGATTT CGTTTTGATC 36 0 

TGAGATTGTA GCATATCCTG CAATTGGCAC CTGACAACTA CCATCCATTT CTGCTAAAAA 42 0 

50 

CGTTCGTTCA GCAGTCACAC ATTTTGCAAC CTCATCATTA TGTACTTTGC TTAATAATGT 4 80 
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CGAATATTCT TATTATCATA AAGAAATCGT GAAATATTTA AATGATGATA TTGCTAAGGG 42 0 

CTCTATCTTC GACTATTTAG AATCAAACAT GAAACTTCGT ATTGGTTTTT CAGATATTTT 4 80 

CTTTAATGTA GATCAACTCA CTTCAAGTGA AGCTTCATTA CTACAATTGT CTACAGGTGA 54 0 

ACCATGTTTA CGTTACCACC AGACTTTTTA TACAATGACT GGCAAACCCT TTGATTCATC 60 0 

TGACATCGTA TTTCATTATC GTCATGCACA GTTTTATATT CCTAGTAAAA AGTAATAAAT 66 0 

ACATAAAAAC GTCTATATCC CAGTTATAAA CTGGAGTATA GACGTTTTTT TACGATAATA 72 0 

ACAATGGCTC AAATTGCTAT TATCTTGCTT AGGTTTTTCG TTTTAGAAGA ATATTGCTAC 780 

AAAGACAGGC ACAACTGCTA CAACAACTAC ACCAACTAAC ACTAAAGCTA TACTTGCCAT 84 0 

TGATTCTTCT ACAGGTCCTA ATTCTTTGGC TGGTGCTACA CCTAATGTGT GACCACTTGT 900 

TCCAAGTGCT AATCCTCGGG CAATAGGGTT AGTAATTCGG AAAAGCTTTA AGAATTTATT 960 

ACCTAGGGCA TAAATAATGA CACCATTTAA AATAACTGCT AATGATGTTA ATTCTTTTAT 1020 

ACCACCGATA CCAGCTGATA CTGGTAACGC AATCGCTGTA GTTGCTGCTT GAGGTAACAT 1080 

TGATAAAATA ACAT CATTGG CAAATTGTGC TAACTTCGCA AAAGTTAAAA TAATTAATAA 1140 

CGCTACAACT GTACCGATAC CAATACCTCC GATGATACGA TGCCAATGTT TAACAAGCAC 12 00 

TTCACGCTTT TTATATAACG GAATCGCAAA ACAGATTGTT GCCGGTTCTA AGAAGAAGTA 126 0 

AATAATGTCT CCACCTATTT TGTAAGTCTT ATACGGAATG CCTGTTAAAT AGAGGAAGGC 132 0 

30 CACACCAAAT ACCATACTGA CAAATAGCGG TGCGAATAAG AAGAAACGAT TAGTTTTTTC 13 8 0 

AAATAATATG GTCGCTAAGA AAAATGGTAT AACGGATAAC AGTATTCCGA AGTAAGGTGT 14 4 0 

GTTTaGTGCT AAGTGGTTAA TCaTGAGCTT GTGCCTCCTC TATTTTGATC TTTTTTGTGA 1500 

CTTTGTCACC TTTAGATCTC GAAGTAACTT TCATAATAAT TTgTGTGACA TAGCCAGTAC 1560 

AAATiAGTAA TAGTATTGTT GAGACGATTA TTAGTCCAAT GATTAAAAAT GGTGCTTGGC 162 0 

TAATGACACC TAAAGAGTTA ACAACTGAGA TACCGGCTGG TACGAAGAGT AAGCCAATGT 16 8 0 

TATTTGTTAG TGTCGTTCCT ACTTTTTCGA CTTCGCCTAA CTTAACAGCA CCAGTACATA 174 0 

ATAATACAAA TAATAATACT AAACCGATTA CTGATGCAGG CATAGGAATT GGCATAAATG 18 0 0 

ATTCAATTAT TTTCGATACA AAGAGTACTA AAGCAATTAC AATGACTTGG TGAAAAAAGT 186 0 

GTGCTGGTTT TGATGCGTCT TTTTGTTGTT TCACGACCAT TGCCTCCTAC GTTTGATTTA 192 0 

ACTAAAGTAT AGATGGCTCA CTTCGATTTG CGTGATTTTT AGTCCGAAAT ACAAAATATC 198 0 

ATAGGTAAAA TGCATAAAAA AAAGGATTAC TGTTAAAGTA ATCCTATCGA CGCTTTAAAA 2 04 0 

TCTTTCATAA ATGAACGTCC AACTTGCATC TTGACACCAT TTGTCAATAT TACCATATAA 210 0 
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AAATACAACC AGGTGACGAT GTAGGTCGTG CATTCAGCTT TGAAACAACA GAATATATAT I86 0 

TAGATCAATT GCCATGTTGG CTAACGTATA CTAATGCTGA AACACACAAA GTTATCGATG 192 0 

ATAATTTACA TCTATCTGCA ATGTATTCAG GGATGATTAA AGGAACCGGG CCACGTTATT 1980 

GCCCTTCAAT TGAAGATAAA TTTGTTCGAT TTAATGATAA GCCGCGACAT CAACTTTTCT 204 0 

TAGAGCCTGA AGGTCGTAAT ACAAATGAAG TATATGTGCA AGGATTGTCT ACAAGTCTTC 2100 

CTGAACATGT GCAcGTCAAA TGTTAGAGAC GATACCAGGT CTTGAAAAAG CAGATATGAT 216 0 

GCGTGCCGGC TACGCAATTG AATATGATGC GATTGTGCCA ACGCAGTTAT GGCCTACACT 2220 

TGAAACGAAA ATGATTAAAA ACTTATATAC TGCAGGTCAA ATTAATGGTA CATCTGGTTA 2280 

TGAAGAAGCA GCAGGACAAG GATTGATGGC AGGTATTAAC GCTGCAGGTA AAGTGTTAAA 234 0 

CACAGGCGAA AAGATATTAA GTCGTTCAGA TGCATATATT GGTGTCTTAA TCGATGATCT 24 00 

TGTAACTAAA GGTACTAATG AACCTTATCG TTTACTAACA TCACGTGCAG AATATCGTTT 2460 

GTTAcTACGT CATGATAATG CTGATTTGAG ATTGACGGAT ATGGGATATG AACTTGGTAT 2520 

GATTTCTGAA GAAAGATATG CACGTTTTAA TGAAAAACGT CAGCAAATTG ATGCGGAAAT 2 580 

TAAGCGTTTA TCAGATATTC GTATTAAACC AAACGAACAT ACGCAAGCGA TTATTGAACA 2 64 0 

ACATGGTGGT TCTCGCTTAA AAGATGGTAT TTTAGCTATC GATTTATTAC GCAGACCTGA 2700 

AATGACTTAC GATATAATTT TAGAACTTTT AGAAGAAGAA CATCAATTGA ATGCAGATGT 2760 

TGAAGAACAA GTAGAAATAC AAACAAAATA TGAAGGTTAT ATCAATAAAT CACTACAACA 282 0 

AGTTGAGAAA GTTAAGCGTA T 2 841 

(2) INFORMATION FOR SEQ ID NO: 178: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3025 base pairs 
Z (B) TYPE: nucleic acid 

{C} STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 178: 

ATCTAATTTC AAACCCGGTG ATAAATTGCC AAGCGTGACG CAATTAAAAG AACGTTATCA 60 

AGTAAGTAAG AGTACTATCA TTAAAGCATT AGGCTTATTG GAACAAGATG GTTTGATCTA 120 

TCAAGCACAA GGCAGTGGTA TTTATGTGAG AAATATTGCT GATGCCAATC GTATCAACGT 180 

CTTTAAGACT AATGGTTTCT CTAAAAGTTT AGGTGAACAC CGAATGACAA GTAAGGTACT 24 0 

TGTTTTTAAG GAGATTGCAA CGCCACCTAA ATCTGTACAA GATGAGCTCC AATTAAATGC 300 



55 



ATAGAGTnCT GGnACTTACT ATGACATATG 
CAAAACGTGC CTTTTTAAAT GGTCGTATTG 

5 TTATTCGCTC GAAGACAGAT AG AG CTTCT A 

TAAGTGACTT AATCAAAAAA CAACGTCAAT 
TGAATATTGA TTATCCTGAA TACGATGATG 

10 AGCAGTCTAA AGAAATCAAA CAGGAAATTA 
AAATTATGCG TGAAGGTTTA TCTACAGTTA 
CGATGTTAAA TAATTTAATA CAAGATAATA 

15 

CTAGAGATGT CTTAGAAGAG TACGTCAATG 
CTGCTGGTAT ACGTGAGACA GAAGATATAG 
AGGCTCTTAG CCAAGCAGAC TTAATTTTAT 

20 

AAGAAGATTA CACATTATAT GAAGTGGTTA 
AAATGGATTT AGAGCAAAAC ATAGATATTA 
CATTAATTCA AACTTCAATG TTAAAACAAG 

25 

gAGATTTGTT CTTTGGTGGa GAAGTACAAA 
GACATATTTC ATTATTAAAA CAAGCAAGAC 

3Q AATCTGGTGT GCCTATGGaT ATGGTACAAA 
GAGAAATTAT TGGTGAGACT GCAAgTGATG 
GCTTAGGTAA ATAGTAATTG AAATAGACGG 

35 CAAATAAGGA GGTTTATATT GTGGTTCAAG 
ATGCAGGTGT AGAAGCAGGT TTAGCATCTG 
CAATAAATTT AGATAATATT GCATTTATGC 

W AAGGTATCGT TGTTCGTGAA ATTGATGCTT 
AAACACACAT T CAAATG AG A ATGTTAAATA 

GAGCGCAAgc AGaTAAAGTA CTTTATCAAC 

45 

AAGGTGTACG TACAAATATT GGTACAGAGT 
GAACATTTTT ACGTGGTGAA ATCATTTTAG 

50 

ACCAATTACC ATCAATCACA TTATCAGACA 
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GCGCTAGAAT GGCTGAGCCA GGTGAATTTA 60 

ATTTATCTCA AGCTGAAGCA GTTATGGACT 120 

AAGTTGCGAT GAATCAAATT GAAGGTCGTC 180 

CTATATTAGA GATACTCGCT CAAGTGGAAG 24 0 

TTGAAGATGC GACTACTGAA TTTCTTTTAG 300 

ATCGTTTATT AGATACCGGT GCGCAGGGTA 3 60 

TTGTTGGTAA ACCAAACGTA GGTAAATCAT 4 20 

AAGCGATTGT AACTGAGGTA GCAGGTACTA 480 

TTCGTGGCGT GCCATTAAGA TTAGTTGATA 54 0 

TAGAGAAGAT TGGTGTTGAA CGCTCTAGAA 600 

TTGTATTAAA CAATAACGAA GCATTGACwC 66 0 

AAAATGAAGA TGTAATCGTA ATTGTTAATA 720 

ATGAAGTTAA AGATATGATA GGTGATACGC 780 

AAGGTATTGA TGaATTAGAA ATACAAATTC 84 0 

ATCAAGATAT GACTTATGTT TCTAATTCAA 90 0 

AAACGATACA AGATGCGATT GATGCAGCAG 96 0 

TTGATTTAAC TAGAACTTGG GAAATATTAG 102 0 

AACTCATCGA TCAGTTATTC AGTCAATTCT 108 0 

AATACCGTCT TAAGAAGGCT AGTAAGATAT 114 0 

AATATGATGT AATCGTTATA GGTGCGGGAC 1200 

CAAGACGTGG TGCTAAAACA TTAATGCTAA 12 6 0 

CATGTAACCC ATCTGTAGGT GGACCAGCTA 13 2 0 

TAGGTGGACA AATGGCAAAA ACAATCGATA 1380 

CAGGTAAAGG ACCTGCTGTA AGAGCACTAA 144 0 

AAGAAATGAA ACGCGTGATT GAAGATGAAG 1500 

ACGAACTTAT TATAGAAGAT AATGAAGTTA 156 0 

ATTTATCTAA AGCAGTAATT ATTACAACGG 162 0 

GTAATATGAA GTATTCAAGT GGACCAAATC 1680 

ATTTAAGAGA ACTTGGTTTT GATATTGTTC 174 0 
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GGTGCAACCA ACGCTATGTA CCCGG CATAT TTAGCCAATG CTCTACGTTT AGACATTAGA 792 0 

AG TAT CAT CG CCATAATCAC AAGTATAGCA ATTAATAAAT AAACCAAACT CATTATTAGC 7980 

CTCCTTTGTT TCTATAATTG TAATGAAATA TAAATACTAT GTTCACACTC ATTTTCTAAA 804 0 

CCGATAAAAT TTAGTGTTTC AATAGCAGAT TGATGCCCTA AATACTTTTG AATGACTGGT 8100 

ATAAGTATAC CTTTTTGATA AGCATGATAT GCAAATGTCT TACGCAATGT CGTTAGTCCT 816 0 

ACATTATCTA TACCAGCTTC AATTGATGCT TGGTGAATTA TTCGATATGC TTGCTGTCTA 8220 

GATAATACTT GATTTGTTCG TAGTGATTGA AAAAGAACGT CTTCATTCGA AAGACTCCTG 8280 

TCCTCTATAT ATTGAAGTAG TTCTTTCGAT AATGTTTCTG GTAACCTAAT TTTAATCAA 83 3 9 
(2) INFORMATION FOR SEQ ID NO: 176: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58B base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176: 

CCCGATTTTT TTACGTAATC TAATACATAC GGCAAAATCA ACTTTAATCA AAAAAGACTC 60 

ATACACAATG CCTTTAAAGC ACATGTATGA GTCCTTTTTA GTAGTTTATA TCAAAAAATA 120 

30 

GTTTAATGTA TAAATTAGTT TTTGTTTACA GATGCGTCGT AGATTGATTC TACAGCATCA 180 

CCTAAAGCTT TATCGAATTC TTCTTTAGAT TGATCAGCTC TTAAATCACT AGCTAATGCA 240 

CGTGAGAAAC TTGCGATAAG TTCAGCGTTA TCTTTAAGTA ATTCATTTGC TTTTTCTCTG 300 

35 

CTGTAACCAC CTGATAATAC AACGACACGA ACAACATTAG GATGTTCAGC TAACTCTTTG 360 

TATAAGTTTG GTTCAGTAGG AATTGTTAAT TTCAACATTA CTAATTGATC AGCATTTAAG 420 

CTATCTAAAC CTTTTTTAAG TTCAGCTTTT AATACTTTTT CAATTTCAGC TTTGTCTTTT 4 80 

40 

GCATTAATAT TAACTTCTGG TTCGATAATT GGAACTAAAC CTTTAGCAAT AATTTGTTTA 540 

GCAACTTCAA ATTGTTGTTC AACAACGTCT TTGATACCTT GCTCATTT 588 
45 (2) INFORMATION FOR SEQ ID NO: 177: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2841 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
50 (D) TOPOLOGY: linear 
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GACAGTTACG ATTGCTAATA TAATTTCTAA TGCCCCAAAT TCAGAAACAT GTAACTGATG 612 0 

TACTTTAGGA AGTCCaATTC GAATATAACC ATATCCAATG ATAATCATAA ATATGCCTAA 6180 

GGTCATAATA ATGTACTGGT TTAAACGATC TTGCATAACA CGTTTAAATC GCTTCGTAGC 624 0 

AAACTTTTCA AAATGTCGAT ATACCATCTC ATAGCTTTTT GAAACTGAAA TCTGTCTAAT 63 0 0 

TTTACCTGTG AACACTTTTT TCCAATCTAC TTTGATTGCT AGTACACTAC CCAATAAAAT 6 36 0 

AATGATGATG GTTAAAAGAA GCGGTATGTT AAATCCATGC CATTGCGAAA CATGTGGTGC 64 2 0 

CAATTGATCA ATTTGATGAT TACCACCTGA TACAGCTCTT AATGCnAGAA CGATAATCCC 64 8 0 

CTTCCCAAAT ATATnTGGTA CAAAAAAGAT TACAGGTACT AGCACCATTA aTATAAGAGA 654 0 

TGGTAAACTA aACAACCATG GTTCGTGGAT ATTTTTTTTA GTAAAAACCT TAGAATCATA 660 0 

TTTTGtCCAA AATACTTCTT TTACCATGTA TAGTGCATAT GTGAATGTAA AAACA CTCGC 666 0 

AATAACACCA ACAAACACGA TAGCTATCAT TGAAATCAAA CTAAATTGGG ATAATTGTCC 67 20 

AGTTTGTGTT AATGCATCTA AAAACATTTC TTTACTTAAA AATCCATTTA AAAATGGTAC 678 0 

TCCAGCCATA GATAGAGCCG CTATCGTCAT GACTAGATTC ATTTTAGGAA ATAGTTGACG 6 84 0 

25 CATTCCACTT AAAATTCGTA TATCCCTTGA ACCTGCTTCA TGATCTAAAA TACCTACTCC 6900 

CATGAAAAGC GCACATTTAA AGATGGCATG ATTCATTAGa TGAAATAGcG CACCArATAA 6 96 0 

TACmAATACA TAAATaGATG CTATTGCGTC TTGTTGGTGT TGAGCATATC CGCCACCTAT 702 0 

30 ACCCACCATA GCCATAATCA TCCCAAGTTG ACTGATTGTA GAGTACGCTA GGATACCTTT 708 0 

TAAATCCCAT TGTTTTAAAG CTGTAATTGA ACCAAATAAC ATTGTTATTA AACCAACAAA 714 0 

CGTAACGATA TATACGTACA TATTGCTAnG ACCTAATAAT GGTGTAAATC GAAGTAATAG 720 0 

AAnGATACCA GCTTTTACCA TCGTGGCTGA ATGTAAATAA GCACTTACAG GTGTAGGTGC 72 6 0 

AGCCATTGCT CTAGGTAGCC AGTATGAAAT GGAraTTGTG CTGATTTTGT AAATGCACCT 73 2 0 

AATAAAAACA TAAAAATCAT AGGGATAAAC AATCCATGAT TCTTAATATG ATCTGCTTGT 738 0 

CCTAATATCT CTGTGATGTT ATTCGTTCCT GTCATGATAT ACAGCATAAT AAAACCAACT 74 4 0 

AATAACGCCA ATCCACCAAA TACTGTAATC ATAAATGATT GAATCGCACC AAATTGACTG 75 0 0 

TCACCATTGT TATACCAATA TGAnATCAAT AAAAATGATG ATmCACTCGT TAATTCCCAA 7 56 0 

AAaATGTACA TCmATATCGT ATTGTCTGAT AATACaaTAC CAATCATACT GAACATAAAT 76 2 0 

AACGTTAAAT AAAAATAAAA CCTTGGTAAA TTGTCTTTTC GAGAGGATAA ATATTGAGTT 76 8 0 

G CAT AGAAG A ATACTGCAAT TCCAATAAGT GAAATAATAA GAGAAAACAT TAAACTTAAA 774 0 

CCATCTAAAC GTAAATCTAA ATTAATATCT AATGTCTTAA TCCATGGAAT AGAGGTAGAA 78 0 0 
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AAGTTTAATG TACCTACTGT TTTATAAAGT AAACCTATAC CTAATAAGAA TAGCCATGAA 4 32 0 

CCAATAATAT TCAAGACAAC ATAAATAATT GCAGCACGTA ATTGTTCTAC AGATTGTCCA 4 3 80 

AGTGTAATGA GTACAAATGA CGCTAGTAAC ATAATTTCAA ACATGACGTA TAAATTAAAT 444 0 

AAATCTGATG TTAGAAAAGA GCCTATCACG CCAACACTTA AAAATAATAT GAACGATGGC 4 500 

AAGTGATAAC GATTTG CTTT ATGTTCGCCA CGCCCAAATC CGTATGCCAT AATTAAAGTA 4 56 0 

ATCACAAACG AAGCGGTTGT AACCATAATT AAACTTAAAG AATCTCCTAA AAA CTG TATA 4 62 0 

CCAAAGGGCG CTGACCATCC TCCAAAGTCT AGCGTAATTG GACGGTGACG CTGAACATAA 46 80 

ATTAATAGCA TTAATGAAAT AATTGTGGTG ATAGTCATTG TACCTAAGTA TAAATATTTA 4 740 

GAAATACGAT CATTATTTTT TAAAAATACA AGGATTAAGG CACAAAGGAA TGGTAATAAC 4 8 00 

ATTGGTAAAA TCAATAAGTT ACTTAGCATC ATCTTCCCCC CTTAGGCCTT CAATTTCATC 4 860 

TTCTTTTGTT ACTTTATAAG TTCTATAAAC AAGTACAAGT AAAAACGCAG TCATCCCAAA 4 920 

CCCTATAACT ATTG CAGTT A GTACAATAGC TTGTAACAAG GGATCAACAA ACAATTGGTT 4 980 

TCCACCAGTT ATTAGTGGTT CTGATCTACT AGAACCATAC GTTCCCATAC TCATAATAAT 504 0 

25 GAGATTACCA GCATGAGTAT ATATTGAAAT TCCGATTACA ATACGAATTA AATTGATTGA 5100 

TAAAATCATA TATGTTCCTA TAAACACTAA AAATCCTATA ACTAGTAATA ATATTAAATT 516 0 

CATGATCGAC CTCCGCTAAG CGACAACATC ACTGTGACAA TAACACCAAC AACTGAGAAT 5220 

AAAATACCTA ATTCAAAAAG TGTTATTGTA CTTACATGAA TTTGTCCTAA AATTGGAAGT 5280 

ATCCAAGTTG TTTCATATTG AGACAAAAAT GGTTTTCCAA AAAACATAGG TATTATCGCA 5340 

GTAATAGATG ATACCAATGC TCCAATAATC ATTAAAATTC TAAAATCAAT CGGTAAACTT 54 00 

TCTAAAACCT CTTCAACATT AAAAGCCAGA AACATTAAAA TAAACGCTGA ACTAAATATT 5460 

AAACCACCAA TAAACCCACC ACCAGGATTA TTATGACCTG CGAAGAAGAC ATAGAATCCG 5520 

AAAGTCAATA AAATAAATAC AACAAGTTTC GTGACCGTTC TTAACACGAC ATCATTCTCT 5580 

TTCATCTTGT CCCCTCCGAT CTTGATAATT TAATAATGtg TAAATACCTA GCCCAGTAAT 5640 

AATTAACACT AATCCTTCAA ATAATGTATC TAATGCTCTA AAGTCACCAA GTATCGCATT 5700 

TACAATATTT TTACCACCTG TTAGTTTGTC AGCTTTTAAA TAAAAGTCTG ATATTGATGA 5760 

TAAACCATCT GTTTGTTGTG TAATAAAAAT TAATGATACA ACAATAAGTG CCATCAAGAG 5820 

TGATACAGAA ATTTTAATTA TTTCTCTTTT TTTGTTAGCG TTAGATCTTG GCACGTTTGG 5880 

50 TAATCTTGAA AAACTGACAA TAAATAGTAT CGTCGTTATT GTTTCAACTA CTAGCTGAGT 5940 

CAATG CTAGA TCAGGGGCTT TCATTGCTAT AAAGAATAAG GTCACAACAA ATCCGATGAC 6000 
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CTTAAGAAAA CATCTTGGAA TTTCACGATA CCTATTGCAC TAATAAGAGC AATAAAACTA 2 52 0 

CCTAACAACA ACATCACAGC AGCAATAAGA CTAAAGATTT CTTTTGTTAT TTCCATTAAA 25 8 0 

5 

CACATGCCCC CCACCAATAA AGCGTGATAT TGAAACAGAA CTTACAAAAG ATATAATGGC 2 64 0 

AATGAGCATG ATTGAATCTA AGAAAGAAAC GGTGCCCATA AGTACACTTA ACACACCCAC 2 700 

AATTGACATT ACGACAGCAC TTGTTGTATC AAATGTAACG ACACGATCTG CTGTTGTAGG 276 0 

10 

TCCCTTGATT AATCTAAATA AACAGATGAT TAATGCAATT CCAAAAATAA TGAGTGAACT 2 82 0 

AATAATCATA ATATGTGTTA TTGTTTGTAT CATCGCGACA CCTCCAATAT TAAGTCTTCA 28 8 0 

;5 TAATGCTTAA TACTTCTTAA CAAACTATCT TTTTCTTTTT CTGACACGTC GATACTATGA 2 94 0 

ATAAAAAACT TTTTAGAGTC TTGAGAAATT CGTATTACTG TAGACCCTGG AGTTATAATA 3 000 

ATTAAAATTG TTAAAAATGT TATTGACCAA TCACTTGTTA GTCTTGTTTC ATATGAAAGT 3 06 0 

20 AATCCAGGGT TCATATCTTT TGTTTTAAAA AGAATATAAT TAATCGTGCT AATGCTAGAT 3120 

GTTATTAATT GATATAAATA AACACCTAAA AATTTAATAG CTACCCATAT TTTTCTAACA 3180 

TAAAAATCAT CGCTGAAAAA CCTGTGTAAT ATATAAATGA CAATTAAACC AATTAGATAT 324 0 

25 CCAGAAAAGA AAGTCGAGAA TTTAAAATGA TCTTCATCTT GAAATAATAC CCATAAGAAT 33 0 0 

GCAATGATAA TATTTAAAAC TATTTGATTC ATTTAGTCCT CTCCTTTCAA ATGCGGATTT 3 36 0 

ACAAGTTTTT GATATAATTG ATCACTCGTG TTCAACTCAG TTGCATCACT TGTAACATTT 34 2 0 

30 

AACACAACAG GTGCAGCAAT TCCGATTGCG ATAACCACAA CTACTAAAAT ACTTAAAATT 34 8 0 

CTTTTTCGAT ATAGCGGGAT TTTCTTAAAA TTAACTTCCT CCCCATCTTT ATCTCCAAAA 354 0 

TACATATAAA AAAGTATCCT AAATAAACTG TACATTGCAA TTAGACTAGT AATAATCATT 3 6 00 

35 

AACGCTAGTC CAATATAATT GCCATTTTGC AATGCACCTT GGAAAATAAG TACTTTCCCC 3 6 60 

GGAAAGCCAC TAAATGGAGG CACGCCGCCA ATAGCAAAAA TCATTATAAT AAACGCAACT 3 720 

CCAAATAAAG GTTCTTTTTT AGCTAAGCCA TTCAAATATT GATATTGTCG ATAGCCTGTA 3780 

40 

ATGTAAACTA AACTACCAAT AATAAAAAAT AGCAATGTTT TTACAACAAT GTCATTTACC 3 84 0 

AAATAAAATA TTGCACCATT AATACCTGCA AACGTGTTTG TTCCTAAACC TAAAATGATA 3 90 0 

45 AATCCTATTG AGATTATGAC TTGGTAAGCT GCAATCTTTT TAATATCTTT ATAAGCAATG 3 96 0 

ACACCTATAG CGCCGATGAC CATAGTTATA GCAGCCATAG TTGCTAGCAA TGGATGTATG 4 02 0 

AGATCATTAT GTTGATCAAA TAGTAAAGTG AAGAATCGAA TTAATGCATA GGCCCCTACT 4 08 0 

50 TTGGTCATTA ACGCTGCAAA TAATGCTGCA AGCTcAGTAT TTAACACAGC GTAGGCTTTG 414 0 

— ^r.n-n^-:, T^ARAA^A^ OA^rcr^^T -r^r^A^TA.A AT^PGACTAA ^AAGATTAAT 4?^ 
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ATTGTTCCAT GCACACCACA TAACGTCATA ATTAAAGCGT ATAAACTTCG CTTTGGTGGT 72 0 

TTCTCAGTCG TTGGATTATC ATCATCATTT TTAGTCATCA TTTTTTGGAA TGGACTGATG 780 

5 GCTAAATAAA AATAAGGATA TAAGACATAA ACCCAAACAA ATCTAAATAG ATAGACAGCT 84 0 

AAAGCAACAA CAATAGTGAT GCCTATTAAA AAGATTAAAT TGTGCGGTTC TGTTTTGATA 900 

ATTTTAATAA TAACTTCAGG TACTAAAAAT CCTAATATTG AAAAAACAAA GCCATTTAAA 960 

10 

ACATAACCTA GTATATTCCA TGTATGATTG TAACTCATTT GCAGTTGTGT ACGTACTTGC 1020 

ATAATTCTGT CACGTTCGAA ACCATGTACA AGTCCTGCAA CTACTGCTGC AATGATTCCT 1080 

GATGCGTGaA ACAATTCAGC AATTAAATAC GTAACAAATG GTGTTAACAA TTGAATAATT 1140 

75 

GT AAA CAT AT TAATGTTTTC ATATCCTCGA CGCATCAATG TTAATCGGAA CCTTACTAAT 1200 

GCCATACCTA TAAGTAAACC AACCACTGCG CCACCAATTG ATGCAATTAA AAACAACTGA 1260 

2Q ACAGCATCAA CAAGTGAAAA AGCACCTGTA ACTAATACTC CAACAGCTAT TTTAAATGAA 132 0 

ATAATACCAG CAGCATCATT CAATAATGAC TCACCTTCAA GAATTGTCAT TGCTCCTTTT 1380 

GGCAAGACCT TTCCTTTAGT GATTGCTTGC ACTGCTACTG CATCAGTAGG ACAAAGAATG 144 0 

25 GCAGCAATTG CAAATGCTGC TCCAATAGGT AAATCTGGCC AAATCCAATG AATAAATAAA 1500 

CCTACACCTA TCACAGTAGT AATGACTAAT CCTAATGCCA TCATCATCAC TGGCTTAATA 1560 

TATTTCCTTA AATGGACTCT AGAAACATTA ACACCTTCTA CAAATAACAA AGGCGCAATC 1620 

30 ATTGTTACCA TAAACAATTC AGAATCAAAA TTAAATTGAA CAGGGATTGG GGTAATAAAT 16 8 0 

AGTAACATGC CCAAGAAAAT TTGTATAAAT GCTAGGGGTA CTTTAGGTAT GAAAGTATGG 174 0 

ACAAACGAAC TTAGTATCAC AACAG CTATA AATATAAGAA TTGTTTCAAA TATTTCCAAA 18 0 0 

35 

CTTTCACCTC TCTAAAAAGT ATTGTTTAAT TGAAAATTAA GTATCACATC TCGTTGTAAT 18 6 0 

TATACTTTAG AGGATAAATT GAGTTAGCGA CCACAAAAGC ACTTTAATAT AGATATATGT 1920 

CTACGATTGC AGTACTTAAA TTTGCAATTA TTTAATTTTA TTTTATCACT AATTGTTTGT 1980 

40 

ATAAATAAAC AACTTGCTTT CACATAACAA CATTAACTTA TAATACAAAA AATGAGCACC 204 0 

TTAAAATCGA CTAACCAATT TCaAAGTACT CTTTTAATGA TTAATTTTGA AAACAGATTT 210 0 

4 $ TCaAAAGCAT TGTTATGCTT AACAATTTAG CCAACACTTC AATCGTTTTG ATACCATTTC 216 0 

TTACGATGCT CTTCTCGT7T TTCAGCACGT AATTGTAATG CTTCTGTAGA GTTTTGTTCA 222 0 

TTTGAACTTA ATAATATTGA TGCATGTGTG TGAGCATCAT TTTTTCGATA CATATAAGCG 22 8 0 

50 CCGTTGCGAT AAGCAGCGCG AGCGACTAAG TGCATGCCGA CTGGTGAAGT TAAATTAATA 2 34 0 

AAAACAAGTG ACAGTAATAA ACGCACACTG AAAAATCCTG TATTCACAAT AAAATAAATT 2400 
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GATCGTATTG TTTTAAACCA TCCACACCAA CACTAAAATC AGCAAATTGC TTCACAAATT 3 84 0 

TCGCTTTATG TTCAACACCA TAATTTAACA TATCGTGATA AACCAATACT TGACCATCTG 3 90 0 

TACCTTTTCC TGCACCAATA CCAATGACTG GAATTGTTAA GTGCTTGCTA ATTTCTTCTG 3 96 0 

CTAAATCATT TGGAATTGCT TCAAGTACTA ACGCAACTGC ACCAGCTTGT TCTACATTTT 4 02 0 

TCGCGTCTAA AATAAGTTGc TCCGCTGCTT CTTTCGTTGC ACCTTGTAAT TTATACCCCA 4 0 80 

TAACGCCAAC ACTTTGAGGT GTTAATCCTA AATGTGCAAC AACAGGAATA CCAATTGCCG 414 0 

TTGCTTTTTC AATAAATGGT GTAATATGCG CTCCTTCTGC TTTAATTGCA TTTGCATTCG 4 2 00 

TCTCCTGATA AAGCTTTAGA GCATGATTTA AGTCTTGTGT CATAGAGATG CCTACTGCAC 4260 

CAATCGGCAT ATCAACAACT ACAAATGTAT TTGGTGCGCC TCTTCTTACT GCACGACCGT 4 3 20 

GATGAATCAT ATCTGCTAAC GTCACTTGTA CGGTACTTTC ATAACCTAAT ACAGTCATAC 43 8 0 

CAAGTGAATC CCCAACAAGA ATCATATCAA TACCCGCTGC TTCCACTTGT TTAGCACTTG 44 40 

GAAAATCATA AGCTGTTACC ATAGAAATTT TAGTTTGCTT TTGTTTCATA TCTATTAATT 4 500 

GACTTACTGT TTTCAATGTT ATTCAACCTC TTTTTGCAGT ATnATTAGA 454 9 
(2) INFORMATION FOR SEQ ID NO: 175: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 833 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175: 

TTATCTTTTG TTGTTTCCTT AGACAAACGA CTAACCACAT TATAATGGAC TAATTTATTA 60 

ATTTTATTTA ATTCCATTAA GTTATCCGTA ACACTAAGTG AAGATGCGGA GTTCACTCTC 120 

GTTTGTACTC TTCGTTTTAA TAAAGCACCT CGTAATAATA CAATCATTCT TCTTATTAAT 180 

GATGCTTGTC TATATACCTG TGTTCTTTCA GCATAACGCA TATAGTTTTC AAGTACACTA 24 0 

TTCGTTATTT GTCCTTCATC TACTAAAGAC TCTAATGTTT TkGTTTCTAC ATTAAAAGCA 3 00 

ATTTTTTGTA GACGTTCTAA TTCTTTAGAG TTTTCATCAT CTTTCTCTAC AGTTTTTAAA 3 60 

AATGCTAATT TATCATGATA TTCTTTAATC ACGTTACCAT ATTTAAAACT TGTTTCGAAA 420 

GTAGATTTTT GATTTAGATA ATCAATAACT TGTTCTAATA TATAAATTCT AGCAACTTTA 4 80 

AACGACATAT TGCCAATTAC TGTTTTAGGT GCAGGTTTCG TTAATAATGG CAATAATACT 54 0 

T^CGrAAfTA TAAA<"TAAT AATAATCATA crArJATHrAA ~AAATAATAA r-^r-rrT. c - - 
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CCTGATTTTA ATCATATTGA AAATGGTCAA GTAGCTACTG GTTGGTTACG CGTATCACAT 2 040 

CGTGAATTAG ATCAAGAAAA ATCCTCAATC GCGCAACCTT GGCATAAACA TGAAACAGAA 2100 

TTAAAGTTGT CACAAGATGA GATTGTACCT GTTGAAATCG AATTGTTACC TTCAGGCACG 216 0 

CTATTTAAAC AAGGCGAAAC ATTGGAAGTT GTTGTAAAGG GTAGTGAAAT TGTAATTGGT 2220 

AATAGTACTC CTGGCATGAA AACACGTTAT GAACATGAAG AAACCGTAAA TAAAGGCATG 228 0 

CACATGATTT ATACTGGTGG TAAATATGAT TCACAATTAA TCATTCCTAT CGTTAATTGA 234 0 

TATGCAGCAA TTACGGTCGC TTTTGATTAA AAGTGACATA GTGATAGGAC TGTATAACAA 24 00 

GAGAAAGCCA CACGCTTGGA ATCTTAAACC AAGGTGTGGC CCTTTTTATT ATTGATGGCT 24 60 

ATTTAATTTT ATAACACTAT CGTATTTTCT TTTTCATGAA TCATTTCAAT AATGACATTA 2520 

TCTTCATTCA TTACTGCTAC TTTAGGTGCA TGGTTTTTAA TTTCTTCTTC ATTCAACTGT 2580 

GCATAAGTCA TGATTATGAC TACATCGCCT ACTTCAACAA GTCTTGACGC TGCACCGTTT 2 64 0 

AAACAAATTT TACCACTACC TCTTTCACCA GCTATTACGT ATGTTTCAAA ACGTGCACCA 2700 

TTATTATTAT TCACGATGGC TACTTTTTCA TTTGGCAAGA TGTCTACCGC TTCCAATATA 2760 

TCTGAATCAA TCGTAATGCT ACCTACATAA TTTAAATTTG ACTCAGTCAC TCTTGCTCTA 2 820 

TGAATTTTAG CATTCATCAT TGTTCTTATC ACTTTATTCA GCTCCAATTA TTATATTATC 2 8 80 

TATTAAACGC GCTTTTGAAA ATTTAACAGC TAAcGAGATA AATATGCGTC CAGTTATTTC 2 94 0 

30 GTGTTGTTCT ACTAATTGAG GATAACTATA AACAGCAACT TCTTCAATGC GTTCACTTAT 3000 

ATGTGATTCA AGATATTCAG TAACCCTGTC TATAATTACT TTACTTTGAC GTTCACCGTC 3 060 

TTGATACAAC GCTTGTGCTA ATAGCAAACT TTTACTTAAA TGTACCGCTT CTTGTCGTTC 3120 

TTGCTCCGTT AAATAAACAT TTCTTGAACT TTTCGCCAAA CCATCTGCTT CTCGAACGAT 3180 

ATCA^TACCA ATAATTTCAA CGGCATGATT GAAGTCTTTT ACCmTTTGCT CGaCAATAGC 324 0 

CAATTGCTGG GCATCTTTTT T AC CAAAATA AGCATAATCC GGCATAACAA TATTAAATAG 33 00 

CTTATTAACT ACTGTTACCA CCCCATCAAA ATGCCCTGGr CCGtTCGCTC CTTCTAACAC 3 360 

ATCAGCTAAT GGGCCTACTT TGACATCAAT ACCTAATTCA CCTGGATACA TATCTTCTAC 34 20 

TGCAGGATGA AAAACAATGT CCGCTCCTAC TTCTGATACT AATTCTAAAT CTTTATCAAT 34 8 0 

TTGTCTCGGA TAAGCATCGA AATCTTCGTT TGGACCAAAT TGTAATGGAT TAACAAATAC 3 54 0 

ACTCACAATT GTAATATCAT TTGTACTAAC TGATTCGCGT ACCATCGTTA AATGTCCATC 3 600 

ATGTAAGGCA CCCATTGTTG GGATAAAACC AATCGTTGTG CCTGAGCGTT TGGCTGCTTT 3 660 

AACAATGTGT TGCATCTCTT TTACCGTAGT AATCAGCTTA GTCATTGTTA TTAAC CTCAT 3 720 
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AATCTTTACC CATACGAAAC ATCAATTGAT AAAATGCGAT GTCTTTTTCT ATCATTTCTA 24 0 

TTAAAACGGT CATAATTTGA TGTATGTTAT CCGTGGATAA CTTAACTGCT CCATTTAACT 3 00 

TCTCATCATG AATGAAGTCT CTTATTTCCT CCAACTGCTG GTCCTCTAAT TTTTCAAGCA 360 

AATCATACTT AT CAT AAT AA TGCGTATAAA ATGTACTACG GTTAACATCA GCTAAATCTG 420 

CAATTTGTTG CACAGTAATC TCTTCTAATT GGTGTTGATG TAAAAGTTCA AT AAA TG CAT 4 80 

TTCTCATTGC AACTTGTGAT TTTCTAATAC GTCGATCTAT AGTCATTTAT ATCAAGTCCT 54 0 

CCCCAATGAT TATAAACGTT ATGTTCATTA TCCCACAAAT CTCCAACATT GATGATTGGC 600 

ACACAATGTT TACCTGTTTA ATATAGGTGA TACAAACAAA CAGAAAAAGG TGATAACAAT 66 0 

GAACCAACAT TTACTAGGAA ATCCAAAATT AACTGTAACT CATGTCAATG AAGTTAAAGC 720 

CGGTATTAAC CACATCGTTG TCGACAGTGT TCAATATGGA AATCAAGAAA TGATTATGGA 7 80 

AAAAGATGTC ACTGTGGAAA TGCGCGATGG CGAAAAATTA TATATTAATA TTTTCAGACC B4 0 

AAATAAAGAT GGCAAATTCC CTGTAGTTAT GTCTGCAGAT ACTTACGGTA AAGATAATAA 900 

GCCTAAAATC ACAAATATGG GTGCCCTTTG GCCAACATTA GGTACCATTC CGACATCTAG 960 

25 TTTTACACCT GAAGAATCAC CAGACCCAGG ATTTTGGGTG CCAAATGATT ATGTTGTAGT 10 2 0 

TAAAGTTGCA TTACGCGGTA GTGACAAATC CAAAGGCGTC TTATCTCCAT GGTCAAAAAG 1080 

AGAAGCGGAA GATTATTACG ArTGATTGAA TGGGCAGCAA ATCAGTCATG GAGTAATGGA 114 0 

AATATCGGGA CAAATGGTGT TTCTTATCTT GCGGTGACTC AATGGTGGGT CG CAT CATT A 12 00 

AATCCACCAC ATTTAAAAgC AAtGATTCCT TGGGAAGGCT TAAATGATAT GTATAgAGAA 1260 

GTAGCCTTTC ACGGAGGTAT mCCAGATACT GGCTTTTATC GTTTCTGGAC TCAAGGTATT 13 20 

TTTGCGAGAT GGACAGATAA TCCAAATATC GAAGATTTGA TTCAAGCACA ACAAGAACAT 13 80 

CCTCTGTTCG ATGATTTTTG GAAACAGCGT CAAGTGCCAT TATCACAAAT TAAAACACCT 14 4 0 

CTACTAACAT GTGCTAGTTG GTCTACACAA GGTTTGCACA ACCGTGGCTC TTTTGAAGGA 15 0 0 

TTTAAACAAG CTGCATCTGA AGAAAAATGG CTATATGTGC ATGGACGTAA AGAGTGGGAA 156 0 

AGTTACTACG CTAGAGAAAA TCTCGAACGC CAAAAATCAT TCTTTGATTT TTACCTTAAA 162 0 

45 GAAGAAAATA ACGATTGGAA AGATACGCCT CATGTCATTT ATGAAGTTAG AG AT CAATTT 16 80 

TATAAAGGCG AATTCAAATC AGCGTCACGT GTCCCTTTAC CTAACGCAGA ATATACACCA 174 0 

TTGTATTTGA ATGCTGAAAA TCACACATTG AATCATGCAA AGATTAGTAG CGCGCATGTC 180 0 

50 GCACAATATG ACTCTGAAGA TAAACAACAA GATGTAAGTT TTAAATATAC GTTTGACAAA 18 60 
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TCTAATAAGT ATGATTTGAT GACTTCTTTT AATCGTTTGC CAGCTTCATC TGAACCAATA 12720 

ATAATCGCCA TAATAAGACT CCTTTTTACT TTAATTTTGA AATACCTTTC TTAAAATGTG 127 8 0 

5 ACATATTTAT TTGTAGGTTA TGAAAATCTT GAGAAAAGGC TTTCAATTTG ATTACGTTTA 12 840 

AATTATAAAC ATAAACAAAC AATAAATCAA CATAATATGT TTATAATATG TTTGTTTATG 12 900 

ACGTATTTTC AAACAATAAG TGAACATTCA TATTGTGGTG TTGTTTTAAT TAGGTATTCG 12960 

10 

TCTGAAATTG TAGTAAAACT TTGTCGAGGT TCCCGTTGaC ATAAATTTGC ATAAAAAAtA 13 020 

GCCCATAAAT GAATGCAAAT TCACATTCAC TTATGAGCAT ATAGATACAT ATTTTAACAA 13 080 

TGCAGTTATA CTTTTAATTT AGTCGACTAC TTCAATATAT GTTTTAATCG TTTCTACTTT 13140 

15 

TTCTTCATCT TCATAGTCCA TGACCACTGC AGTCAATTCG TTTAACTGAC AAAATGATGT 13200 

AAAATCTTCT TTGCCAACTT TCGTATGATC GATTAACAAG TATTTTTCAA TTGAATTACT 1326 0 

20 TAGTGCCAGT TGTTGCGTAT AGGCTTCATC TAATGTAGAT GTCATCACAG CACCTTTATT 13 320 

TACTGCGTTA CTACTAAAGA ACATCTTGCT AAATCTTAGT TTTTCCAACA TGGCGTTCGC 133 80 

cATTTCACCT ACAAATGCTT CTGTAATATG GCG CATTTCA CCACCAATTA AATAGACACG 1344 0 

25 AAAATGTGCT GTTTGTTTTT CTAACAAAAT TTTATACACC GGCAAACAAT TCGTAATAAT 13 500 

TGTGAGCGTA TGATGATTGA CTTCTTCTGC TAATAGTTCC ACTGTTGTTC CTGGTCCGAA 13 560 

AAACAAAGTA TCCCCATCTT CAATTAATGA TGCAGCTTTT TTAGCTATAA ATCGTTTTTC 13 620 

30 TGCAATTTGA CGGGTATGTT TTTCTTTATG CGATATTTCT TTATACTGAA ATGTTGAATT 13 6 80 

ACTGCGTGCA CCACCATGAA TCTTCGTTAA AATCCCTTTA TTTTCCAATT CAATTAAATC 13 740 

TCTTCGAACT GTCATATCAG ACACATTTAA ACCTTCGACG ATTTCATTCG TTCTTATCGT 13 800 

35 

GCCCTTTTTA TTCACTAGTT TAGCAATTTC GTCCAAACGT TCATGTTTAT TCAATGTAAA 13 860 

ATTGCSrrC 13 86 8 
(2) INFORMATION FOR SEQ ID NO: 174: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 54 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
45 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174: 
50 TTAAGTCAAC TTTGTCTATA CGGTTTGGAT CtTCTaCCCA ATGTCTTATA AAAGACAATC 6 0 

CCGCACCTGA AACATAACTC ATGAAATAAG AAAATGGTAT ACCATTAATT TGATCATTTT 12 0 



55 



EP 0 786 519 A2 





TCATCATAAT 


TATTTAAATT 


GACATAACCT 


GTTTGTGCTT 


CTTGTGCATT 


CAGCATGCCT 


10920 




AAAGTATTGG 


CI TITITTAG 


TAAATCGTGG 


TCGTTTTCAT 


GATTAAGAAT 


TGCTGAAGTA 


10980 


5 


ATTCCAGCAA 


CTGTAGAATC 


ACCTGAACCA 


ACCGGATTTA 


ATACACTTAT 


TGTCGGAATA 


11040 




TTCACTCTAT 


AGAATGTATG 


ATTGTG CTTA 


GCGAATGCAC 


CTTGTGCACC 


TAAAGACACA 


11100 


10 


ATAATCCACT 


CAATCCCTTC 


GAATAAGGGT 


TGTGACACTG 


CCTGTTTCAA 


ACTTTCTAAA 


11160 


CTTTCATCAA 


GTGGCTGGTT 


AAGCAATTGA 


TATAGTTCAG 


AAATGTTTGG 


TTTAATGACT 


11220 




GTAGGTTTGT 


ATGGATTTTC 


CAAAA CTGTT 


TGCAAAGTtG 


CACCCGAGCA 


ATCTAATATC 


11280 


15 


ACAGGCACAC 


CTTrGTTTTG 


GCATCGTTCA 


ATGATTTGTG 


CATAATAATC 


TTGATTTAAT 


11340 




CCTTTAGGTA 


AGCTACCTGA 


AATAGCAACT 


GCTTCAACTT 


TTTCTAATAA 


TTGTT CAAAA 


11400 




TGTTTAATAA ATCCTGCAGC 


CTCTTGATTA 


TCAATCTCCG 


GTCCCTGCTC 


TAAAATTTCT 


11460 




v7 X L X \J X 1 V7wV_ 




AATTGCAATG 


CAGTTTCGTG 


TTTCACCCTT 


AATGTTATAA 


11520 




f\r\.i. VjwiiuL. x. 




ATGATCTAAT 


TTTTTAGCAA 


TAAATTGACC 


TAATTCACCG 


11580 




CCAATAAAAC 




GACTGGCTCA 


CCTACTTGCG 


CAAGTACTCT 


TGTTACATTT 


11640 


25 


AAACCTTTAC 




TTTACTTACT 


TCTTG AACAC 


G ATT AA CATC 


ATCTAATTTC 


11700 




AATG CTGTT A 




AATATCAACG 


GATGGATTTA 


ATGTTAAAGT 


TAAAATCATA 


11760 




TGTGTCGTCC 


CTT AAT CGTG 


GTATTCGCCT 


CTGTCCCATT 


TTTCTAAGAA 


TTCATCAAAG 


11820 


30 


AAATGTGGAT 


CAGCTTGATC 


TGCATTGCTT 


GTTTCTAAAT 


GTTTAATTTT 


AGCGATTAAT 


11880 




TTTTTGTTCT 


CTTCAGTTGG 


TTTATATTCA 


GCATTAATAA 


ATGCATCGAT 


AATATCGCAC 


11940 


35 


ATTAATAACT 


CACCTATAAT 


ACGTCCACCG 


AAGCCAATAA 


CGTTCGCATT 


TAATTCTTCT 


12000 


TTAGCGTATA 


ACGCTGATGT 


CATATCACGT 


ACTAGTGCTG 


AACGAACGCC 


AGGTACTTTA 


12060 




TTTACAGCAT 


TGTTAATACC 


AACAC CTGTT 


CCACAAATAC 


AAACACCTAA 


GTCTGCATTA 


12120 


40 


CCGCTAACAA 


CTTGTTCGCC 


AACTTTTTTA 


CCAAAAATTG 


GATAATGTGT 


TCTTGTGAAA 


12180 




TCGTATGTTC 


CTACGTCAAT 


GACTTCATGT 


CCTTTTGATT 


TTAAAAATTC 


AGATACACGC 


12240 




ATTTTTGTAT 


CTGTAACAAT 


ATGGTCGCAT 


CCTAATGCAA 


TCTTCATAGT 


AATTTTTCCT 


12300 


JS 


CCTTAGCACA 


TTTTATTAAG 


CATATCTACG 


CGGATTTGGT 


GTCTACCACC 


ATCGTATTTA 


12360 




CCTTCAACAA 


AACCTTTAAC 


GACATTTTTC 


GCTAATGTGT 


CTCCAACAAT 


TTCAGATCCC 


12420 




AT AG TG AT CA 


TTCTTGAATT 


GTTATGGCCT 


CTAGTCATAT 


ATCCAGAGCG 


TTCATCTGAT 


12480 


SO 


ACTTCAGCAG 


CAATCATGCC 


TTTGATTTTT 


GTAGCAACCA 


TAAAGCTACC 


TGCACCAAAT 


12540 
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AAATGCTGAT AAGAATGCTG TTAATAAACC TTTAGTTCCC ATAAATGCAC TTAAGAATCC 912 0 

AC CATCTTTG GCTGGATCAG AAGCTAAGAA CAAGAATCCA CACATCGCTG CTAGCATTGT 918 0 

AGAAATAAAG TTAATTTGAT TTGTACTTTC TAGCTTACGG TTAAATGAAT CTGTTAAAGA 924 0 

TTTCGCTGTC GTTCCTGCTA CTAAAAATGC TACAAGCCCC ATCGTATAGT TATATGGTTT 93 0 0 

CATTAAAATG GCTTCCATGC CTTTATCCCA TTTAAAACCA AATATATTTG GCACATATGC 93 6 0 

AATTAATAGA AAGATACTTG AGAATAAGAT GACAGGCATT GCAGAAATAA ATCCATCACG 94 20 

GATGGCTCTT AAATATATGT TACGTGATAA TTTCTCGAAA AATGGCTTCC CTTTTTCAAT 94 80 

TTGTGCGATC AATTTTTGCA TCATTGTCAT CACCCTCTTT TATAAAATTC TAATAAATGC 9540 

TTCATTAAAT CTTTCAGTAA AATTGTTGTC ATTAAATGAT CTTGACCATG CATCATCGTT 960 0 

ACACTGTATG CAATATCATC ACCTTGCGCT TCTTTAGCCA ACAGGCTTGT TTGTGCTCTA 9660 

TGCGCTTCCG CAATGCAATT GTTTCCTTCT TCAATCAGTG CATCTGCTTT TGCAAAATCT 9720 

CCAGCTTGAG CTGCTGTTAA TGCTTCTAAA AACTTAGAAC GTGCATCCCC TGCAAATGCA 97 80 

ACAATTTCAA AACCTAATAA TTGGACTTCT TCTCTATTCA TAGCATTAAT CCCCTTTTAA 984 0 

ACTTATTTTC TTTGTTTCCA AGATGTCGCA GTATCTTTTA ATACTTTATT TAAGTCATCA 9900 

ATATTTTTGA AACCAGTTGT ACGTAACCAT TCACGAGCAG CATCTTCACC TTGTTCAATG 9960 

TATACTTGAA CAGCACCAGA CCATGTAGCA CGGCCACAAA GTACCCCGTT GAATTTAGCA 10020 

30 CCAGCTTCGT GAGCAAATTT TAAAGTTTCT TGGAATAATT CCGCAGAAAC ACCAGCACTT 10080 

AAGTAAATGT ATGGTAAATG AGTTGCTGCA TCTTGATCTT TAAAGTGTTG TGCCGCTTCC 1014 0 

TCTTTTGTAT AAACCACTTC ACCTTCAGCA AAGCCTTCTA CATATTTCAT GTTTACTGGT 10200 

ACTTCAACTT TCAATACATC AACGTTAAAG CGTGGTTCTG AGAATAATTT CATTGCTTCG 10260 

TTAACCTTTC TAGGCTTAAC TTTTGCGAAT TcAACAcTAC CGTTATcAGG AATGTTGTCA 10320 

TCGTATGTTA ATACTTCTAA AAAGAATGGA ATATCTTCTG CAACACATTC TGAACCGATT 10380 

CTTTCAATGT ATGCTTTCTT TTGAATGTTA ATTTCTTCAG CATCATCAAC AT CATAGTAA 10440 

AGTAAGAATT TAACAGCATT TGCGCCTTGT TCTTTTAAAC GTTTTGCAGA CCACTCTACT 1050 0 

AAACAGTCAG GTAAACGACC TTTAGCGTTT ACGTCATATC CAGTTTTTTC ATAAGCAAGT 10560 

AATAATCCAC AATCTTTGTT ACGTGCATCT GAAGCTGGTA AACCATATTC AGGATCTAAT 10620 

AAAATTGAAG ATGCATATTG TGTTAATTCT TCCGCAACTA ATACTTTTAA TTGTTCAATT 106 8 0 

TGAGCTACAG TTGGTTCTTC AGTTTGATGT TTTGCCATCA TGCGTTTTAA AGCACCACGT 10740 

TGGTCAAATG CTAATGCAGA AATGATACCT TCGTTGTTAC TTAATTGTTC AATTGATGCG 10800 



55 



35 



40 



45 



50 



EP0 786 519 A2 



10 



15 



20 



CTACTAAATA TTGACCATCA CCAATAGGTC CAATTTCATT GAATGTAGTC CAATATTTTA 73 2 0 

CTTCTGGGAA TTCTTTAAAA CAATATTCAG CATAATCTAC AAAGTAGTCA ATCGTTTTAC 73 8 0 

GATTTAGAAA ATCGCCATCT TTGTGTAaCA CTTCTGGTGT ATCAAAATGA TGCAATGTTA 744 0 

CAAATGGTTC AACATGACGT TTATGACACT CTGCAAATAA CTTATGGTAA TACTCAACAC 75 0 0 

CTTTAGGGTT AACTTCGCCA TATCCATTTG GGAAGATACG AGACCATGCA ATTGAAATTC 7560 

GGATACCATT AACACCGAAT TTTTCACTTA ATTCTAAATC CACTGGATAT CTGTTATAAA 762 0 

AATCACTCGC TGGTTCTGCA GTGTACCAAT AGTTTTCTTC TAAATACGTA TCCCATGCTA 76 8 0 

CGCGACCTTT ACCATCCGTA TTTGTCGCAC CTTCTGCTTG ATATGCTGCT GTTGCTCCAC 774 0 

CAAAAATAAA ATCTTCAGGT AATGTTTTAG TCATATGAAA AACTCCTATT CTTAATTTTC 78 0 0 

AAATTGTTGT TGAACGAAAT CAAGGGCTGC TTGGCCATCT CGTGTCAATT TGATATATTC 78 60 

AGCACCTTGA GTCTTCGCTA ATTTAATACC TAATCTATCT GTATCTTGCT TAATATCTTC 7 92 0 

ATAGTTAGAC GCAACTTGTG GCGCTAAAAT GATTAATTGG TACTCTTTCA TAATGTCCAT 7 98 0 

ATGTGCGCCA TATCCGCCAG cTGCCGCTTT CACTGGCACA TGATATTCTT CAGCTGCTTT 804 0 

25 ATTAAGTGCA TTGGCTAATA ATCCACTTGT ACCACCACCG GCACAAAGTA CTAAGACATT 810 0 

TGTTTGTTCT GTGATATTTG AAGCTTTAGC TGCATCGTCT GATACACCAC TTGCCGCTAA 816 0 

AATTGAATCA GCTTTTTTCG TATCAAAGTT TGCTGCAACT TTTTCTTTTA AATCTGAATT 822 0 

ACTTTCTTTA CGTCCTTCTT CTTCATCAAG AATTTCACTA TCATAAACTT TTAGGAATGG 82 8 0 

GTAGTAAATA ATAATATCTA CAACAATCAA AGTAATAGCT AGTACGAATG ACCATAAACC 834 0 

AAAACCTGTA CCCATGATAA TGCCCAATGG ACCTGGTGTT GTCCAAGGTA AATTCACACT 84 0 0 

AAAACTATTC ATTCCTAACA CTTCAACGAA AAGTTTGAAA ATCCATACGT TAACAATTGG 84 6 0 

TGCTAATACA AATGGAATAA AGAACACAGG ATTCAATACT AGTGGTGCAC CAAATAAAAT 8 52 0 

TGGTTCGTTT ACACCAAAGA ATGTTGGTAC AACTGATGCA CGTCCAATCG CTTTGTTTCG 8 580 

TTTAGATTTC GTCATCCACA TAAACATGAA CGGGACGACC AATGTTGCAC CCGTACCTCC 8 64 0 

AAATGTAACG ATAAACATTT GTGTACCTGA TGTAATAATT TTATCTGCGT GTTCTCCAGC 8 700 

TTGAAGCAAC TTGAAGTTCG CTTCGATATT CGCATATGTA ATGGCTGCAA TTGCTGGCTC 8"? 6 0 

TACAATTGAC GGACCATGAA TACCTACAAA CCAGAATAAT GCAAAGGCAC CAAAGATAAT 8 82 0 

TGTGACACCA AT C CATC CAT CTGCTGCTGT AAATAATGGT TCGAATAATT TTAAAATACC 8 880 

TTCCGCTACA TTTGATTTAA AGCTGTTGCG AATGACTAAA TCTAATGCAT AAAGAATGAT 8 94 0 
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ATCTTTTAGC ACGCGTAATT GCTGATAAGG TTGATTCATT CGACTTGGTT TACCATCTTT 5520 

ATCAACTGTA ATTTCATTGA CATCTTCATT CATATTTAAA ACACCATTAA ATGTCCCTGC 5580 

AATATTCACT TGTTTGTTTA ACTGTGGCAG TGACTTGTCG TTACCATATG TCATCATATA 564 0 

TTGTGCAAAT GTTAAGTTCC CCATTGAGTG ACCGACAAAG TTGAATTTAT CGAAATTGTA 57 00 

TTCAGATTGT AACTTAGTCA GTACATTTTT AAACCACGCA GCATTCTTAT CCAAATAGCC 576 0 

TTGTCTGTTA TTTTCAAGTT CAATTTTCAC AATAGGATTC ACTGCATCTT TTCTTAGTTT 5820 

CCCTTTAAAT GTCACTGCAC CATCCTTTGA AACGTAAGCA GTGATGATAT CTTTAGTTAC 5880 

CCCT CT TTTT TCTGCTTGCT TCACCATAAA CTTTTCAGAA TTGGCACTAC CACCAAATCC 594 0 

ATGTAAGAAC AATGTTGGAA TTGGCTTTTT AACAAATTGC TGTTGTTGTA TTTTAAATGT 6000 

TTGTGCCTGT CGTTGACTAA ACACCACCAT AATAATAGAG CCTATAATAA TAGCGACCGC 6060 

TAACAATGTC GTAATAATTA CAAAAATTTT CTTCACACTT TTAACTCCCA TTCATGTCTT 6120 

TTATATAAGT ATAAAGGATG TGATTAAAAA TGTCCTTTAG TTGATTTTGA ATACATCATT 6180 

AACTTTTAAG ATGACTTTGG AAAGTTGTCC GTTAACGTTT GTTAATTGAT TGCTTCTTTA 624 0 

GCTTTCAATG GTGTGTCACC CATTGATTAA TATATAAATA TGTATATGCA TGTTTAATTT 6300 

ATCTCTTTCT ATAAATAAAG ACCTACCAGC ACTCGACTGA TAGGCCTTTT AATATCTATA 6360 

ATTATTTAAT TTCTTTTGTT TCGGCTAACT CTTTGTACCA ATAAGCACTT TTCTTAGGAT 6420 

30 AACGTTCTTG AGTCTCAAAA TCGACATAGA ATAAACCATA TCGTTTTTCA TAACCATTTG 64 80 

ACCAAGAGAA CACATCCATT AATGACCAAA TAAAGTAACC TTTAACATTT GCAC CATCTA 6540 

TAATAGCATC TGCAATAACG TTCAAATGTT GTCTTACATA ATCAATACGT GCATCATCAT 6600 

35 GAACTGTTTT TTCAGATTCA ATAAATTCAT CTTTATATCC TAAACCATTT TCAGTGATAT 6660 

AAAXCTTATG aTAGTTAGGA TAATCTTTAA CAACACGCAT GaTTTGATCA TATAAACCTT 6720 

GAGGATAGAT CATCCAGTCC CAGTCTGTGC GAGGTACGTC GACATCAAAT TCACGTTGTC 6780 

CGACACCTTT AAGTTGGTAT TTAGAACCGC CTTTATCACC TGTCGCATTA TGCGTGATTT 684 0 

CAGATTCTCC ATCGTAACCT CTCATCCAAT CACTCATGTA GTAATTGATA CCTAAGAAGT 6 90 0 

CGTTTAAATC TTTGGCTGCA TCTAAAATGG CATAATCTTC ATCTGTAATG TTTAATTTAC 6960 

CGCCATTAAC AGATAAGATA TGTTGCACAC CTTCCATCGT TTCACGAGAA TACTTACCTA 7020 

AATATGTTGC ATCTAAGATG AATTTATTAT GGATGATATC TTCTAATTCT GCTGCACGAA 7080 

CATCTTCAGG ATTTGATGGA TCGAACGGAT ATTTTGTTGG CAATGCGTGT ACAACACCAA 714 0 

TTTCTCCTTT GTATCCGCCA TCTTTAAATA ATTTTACTGC TCTAGCATGA GCCACCATCA 7200 
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ATTTCCTTCT TCCAATTTAA AGGGCTTCTC AAATCCTATC ATTTTCATAT CGTTTCACCT 3 72 0 

CATTTATGAA CTTATTTCTT ATTATACAAA ATAGAAGCCA TGTGTGCTTA TATCGCAGCA 3780 

TCATGACTCC TTTTTCATTT GAATATATAA ATAATTACAG ACGACTTTCG TATTAAATTT 3 84 0 

TAGACTTATT TCTACCATGT TGCTGAACAA ATTTACTTTA GATAAAAAAT TATTAAATTT 3 90 0 

TGGTCAATTA ACAAAGTTAG TTTGTTAAAA CGTgATACTT TATTATTCCG TTACTTTAAT 396 0 

AACTTGTTTA CCAAAGTTAT CGCCAGTaAA TAAATTTTTA AATGCATGTG GCGCATTTTC 4 020 

AAAACCATCT TCAATGGTTA CTTGTGACTG AATTTTACCT TCTTGAACCC ATGTTGCAAG 4 08 0 

CTGTTCACTA GCTTCTTTAA AAGCATTAGC GAATTCACTT ACCAAGAAGC CTCTCATCAT 414 0 

TACTTGCTTC TTAATAAGCG TACCTTGAAT ACGTGGTCCG ATATCGGCTT CAGGATGATT 4 200 

ATATGACGAA ATTGCGCCAC ATACTGGTAC ACGTGCAAAA CGATTTAAAT GCTTAAATAC 4260 

TT CAT CG CCA ACTGTTCCAC CAACATTTTC AAAATAAACA TCAATACCAT CTGGTACTGC 432 0 

TTGTGCTAAC GCTTCTGCAA AATCCTCTTT CTTATAATCA ATACCAGCGT CAAAGC CCAG 43 8 0 

TGTCTCTGTT AAATAATTTA CTTTTTTGTC GCCACCCGCA ATACCTACTA CACGGCAACC 444 0 

25 TTTAATCTTA GCAATTTGAC CTACAACTGA ACCTACAGCA CCAGATGCAG CTGAAACCAC 4 500 

AACAGTATCA CCGGCTTTAG GTTGTCCAAT ATCAAGCAGA CCATGATATG CTGTTTGTCC 4 560 

TGGCATTCCT AAAACACTTA AATATAAATC AAGTGGTACA TCTGTCGTTG GAACTTTAGT 4 62 0 

AATTTGATCC G CTTGGAC AT GATTAATGAT TCGCCAAGGC AACATACCTA CAACGACATC 4 6 80 

TCCTTTTTTA TAATCTGCGA GTGTCGAATC AATTACTTTT GCAACGACAT GGCTAACAAT 4 74 0 

CGGTTTACCA ATTTCAAAAG GCTGTACATA CGAATCTGCC TTAGTCATAC GTCCTCTCAT 4 8 00 

ATATGGATCC ACTGAAATAT ACAGCGTTTG TACAAGTACA CCATCGCTCT CAAGTTTaGG 4 860 

CGTGTCAATC TCTTCaATTT TGAATGTATC CTCTTGAGGC ATGCCkTCTG GTATTTTGTT 4 92 0 

AAAAAGAATT TGTTTATTTT GCATCATTAA TCACCTTTCT TTATTTGAAA CTTTTACTTA 4 980 

TTTGTTACTT AAGCGTTAAG TTTGAATTGT GTCtTCGTGA TGTCTGTATG CAAATACATT 5 04 0 

CTTAG TTGTT ATATTTTGAC TTAAGCACTG ATTCATTCAT GTAACTTCAA CCACATTATA 5100 

45 TTTGCTATAA TCATAAATTT AAAATGTTAC GACTTAGACA TTTTATGGAA ACTCTCAAAC 516 0 

AATAGATAAT TTTTGAAAAG CTCTAATATT ACAAGCTTTT TTGCCCCAGA AAAACTAGCA 5 22 0 

GTTGCTTTAT TTTTTCCATA AGAAGTCGAT TAACTCATTA GCAACATTTT CATTCTCATG 52 8 0 

AAGCTG AC T A TGTTGTGCAG GCTCACCTTC ATATTTAGAT TCTCGATAAC TTTTCGGACT 534 0 
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CATTGCCTTT TCACCTAATA AAGGCGTCAA 
ATGAAAGCTA AAATATGCAA ACACTAGCGC 
5 CCCCCAATGG AAAAATGAAA ACTGCATTGC 

ATGAATAGGC GTTAATTTGA AGGCATGACT 
TCCTATTCCC ATACCAGCAC TAAATAACAT 

w 

ATCTTCGCCT TCTTCACCTA ATGTAATGTT 
GACAAATAAA ATAACTAAAA CGAGCAATAA 
TGTAGTAATG TTTTGCGTGA GTTTTTCTAA 

15 

AAATATCGTA CATATCACTA AAGATACCCA 
AAATACAAAC CCTTTCTGTG TAATGGTAAG 

2Q TTTGTAATTT TATATAGACA CAATTAATAA 

CACACTCAAA TTATTTATCA TTGAGCAACA 
TCTAGTGCAT TATTGATTTG ATACCTGCAA 

25 TCGCTAGACA CAAAAAATAA GTCGCCTATA 

ACTAATCTAT ATATTAATTA CTAATTTTCC 
TGCTTGATAT AAGTTTTCAG GTGATAAACC 

30 ATGCCCTGAT TCTATATTTT TCGTAATATC 

AGGCGTTCGA TGAATTGGAC GCGCAAACAT 
TTTTAATGCA TTTAAATCTT GATCTTCATT 

35 

TGGTTTTATC AGTTCAATCA TAGTATTGTA 
ATAATCTACT AATGGAATTT CTTTAAATTG 
GATATCTGCG CCCATCTTTT CACACCACTC 

40 

GACAGTTAAA CCATACCGTT TAGCAATTTG 
ATTAATGATT AAGACAGACT TCCCTTCGTT 

45 ATCAAAAAAC GTTTCATATG CCGTAATACC 

ACTTATGTTG TGTGGTGCTT TTGCAACTAT 
TGATCCTTGT CTATTTGGCG ATCCAGCATA 

50 AACGTCTGGT CCGATAGCTT CAACAGTACC 

TTGAGTGACT TCCATTTGTC GTTGCTTTGT 



TGTAGCGCTG ACTAAGCCAG GATATCCTTT 192 0 

GACAATACCA TAGACTGCCC ATGCATGAAT 1980 

ATCATTAATT GCAGATTGCG TGCCAGCTTT 2 04 0 

GATTGGTTCT GCCGTTGTCC AGAACACAAG 2100 

AGCAAACCAA GACGGCAATG AGAATTCAGG 2160 

TGCGTATCTC GAAAATAAAA TATACACACA 222 0 

ATAATACCAA GAAAAATGTA GCGCAATAAA 22 80 

CTGTTTCGGA AATATTGCTC CAAAAGCAAC 234 0 

AAACACTAGA CTTACTGATT TATTTTTCAT 2400 

TTCATACCCA TAACTGCAAC ATTTTAATCA 2460 

TGCCTCATCT TTTAAAAATG ATATATAAAA 252 0 

AAGTATTTTA TTGTATTTAA GTAATGCCTT 2580 

AGTTGCCATA TTTCCGTTTA GAATCAATAG 264 0 

CAGTATTTTC TGCATAAGGC GACTTTACTT 2 700 

AATCATTGAT TGTTTTTCCA ACAATTGATG 2 760 

TTCAAAAACT TGTGTCGTTG TTGGTTGGTA 2 820 

TTCTAAATAC TCATGTTGTT TAATCATATC 2 380 

AAATTCATGT GTAAATGTTA TACTTTTTAA 2 94 0 

AAAAGCTACG ATAGTCGTAA TATGCCCTAA 3 000 

ATACAAGTCT GTATTATAGG TGCAAAATAT 3 060 

ACGCACTAAA TCCTCTTTAT GATTCAATAC 3120 

TGTTGTTTCT TGTCGTGATG CTGTTGTAAT 3180 

AGTGGCTATA CTGCCTACAC CACCGGCACC 324 0 

TTCAGCAGGA TTCGTAGAAA TTTTAAATGT 33 00 

AGTTAGCGGT AGACTAACCG CTTCATTAGC 3 3 60 

AGCTTCTGAC ACCAATTGAT ATGTCGCATT 3420 

AAATACAACG TCACCCGGAC TAAATAATGT 34 80 

AATAGCATCA AACCCAAGTA CACGAGGTGC 354 0 

ATCTACAGGA TTTACACTAA TGCTATTTAC 36 00 
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ATTAATCACT TGTTGTGTAG AGTCTTGTCC 
CCTCCCTTAC AACACTCGTG GACCAGAAGT 
5 ACGTACTGGC ATTTCTGTGA AATATTCTAC 

AGTTAAAGAT TGAACTTGAA CATTATTTGG 
AGATTTATTT TTTTTGTTAT ATAGTTTCGC 

w 

TTTTGCAACC GTATCGAATG CCGTCTTTTC 
ATCATCGAAG TCTTTGTATA ATCCAGGTTT 
TAATATAAGA CCTATTAATA TCAATAGTGC 

15 

TTCAAATTGA AGGAAGTAGT CTTGATATTC 
TAACAGGAAC CCCACGaTTA CTACTATTAA 

20 CACCCCTAAC GACACCTCCT TAGTTAAAGT 

TCAGTTTTTT TCTTAAACAT AATAAATTAA 
TGAAGTACCA TTTAGTAATA TATTCCCTAG 

25 ATTyTCGTCA ATATGTTTTG AAGAACGATA 

TAAAACCTTA TTACTATTTC TCGGTTTCTC 
CATCGCTATA ATGACCACGG AAAATGGTAA 
TTGAGTACCA CCTGTGTAAA TCATGATGAT 
CACTTTGACA AATGACTTCG GATTAATATC 
AGTTGCCGAA TCCGCTGATG TAACAAAGAA 

35 

TAATACAAAA CCTAGCGGAT AATGTTGTAG 
TTTA2CAATA TCGGCAATAT GATTATCTTG 
AAAGAATATA AAGCAAACTA ACGCCGGGAC 

40 

CGTACGTCCT TTTGACACAC GTGCAATAAA 
CCATGCCCAG TAAAAGATTG TCCAATTTTG 

45 AATGCGTAAA C7CATACTAA AGAAATTTGC 

ATTTAAAATG TATAGCGTTG GCCCAACAAT 
TAACATGTTG ATATTACTCA ACGTTTTGAT 

50 AGTAAATAAT ATGGTTGCAA TGACAATCAA 

A"TAAATAA^ B&A^T^AAA'" ^rr^yry^^r^ a/r 



GTTTTGGTTA TGATTGTTAG CCATGATATA 12 0 

TTTCTGATCT CTCACATTAA CTTCTAACTT 180 

ATTCTTTTTA ATATCCGAAC GAATTGCTTC 24 0 

TACGAAAAAG TCAGTTTTAA TGTCGATATA 3 00 

AACTACATTT GGTTGTCTTA CTTGATCATA 3 60 

AACAGCTTTA CGAGATACGT AAACATGACC 42 0 

TCGATGCGTA GGTTTGAAGA TACTAAATAC 4 80 

AGCAAGTGAA ATAAGTAATG GTTGGAACCA 54 0 

AGTTATACGT CCATCTTGGA TATACATGAA 6 00 

TAAGCCAAGG ATAAAGTTTT TAAGTCGTTT 660 

TAATTTAAAA A CAT ATT AAA TATGTAC CCA 72 0 

TAACTTTAAA TTTATTTTTA ATATATAAGA 78 0 

TTTTTGTAAA TAAAACCTCA TTATTAATTA 840 

TTCTAAAATA TCTGGGTCAC GATGTTTAAT 900 

CTCACTCAAA GATTTTATAA GCGACCATAT 96 0 

CGCAGCAATG ATTAATAAAT TTTGAATTGC 1020 

TGCAAATAAT GCCATAATGA TACCCCAACT 10 8 0 

ACCACTTGAA CTCAACATAC CTAAAACATA 114 0 

AATCATAATA ACAAGTAAAG TAATTAAG CT 12 0 0 

CGTCGCAAAA GTTGCTGTTT CTGTCGCAGC 1260 

TAAGTAAATT GCTGACGCGC CGAATACCGC 13 2 0 

AAAAAGTACA CCTAGAATAA ATTCTTTAAT 13 80 

TATACCTACA AATGGTGCCC AAGATATCCA 144 0 

TAACCATTGG AATTTTTGAC CACCTGTCGG 150C 

AATATAATTA CCTAGACCAT TCGTAAATGT 156 0 

AAAAAGACCA ATAAGTACTA CAAAAGCAAG 1620 

AC CTTT AT CG ATACCTGACC ATGCTGACCA 16 80 

GATTACTTGC ATCGTGAAGT TACTCGGTAC 174 0 
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CATCAAGTTC ACCGTAATCT TTTAACTTTC CGCCTTCAAT CCAAGCAATC TTAGTACAAA 234 0 

ATTGTCTCAC TTGTCCTAAG TTATGACTAA CGAAAAAGAT GGTTTTGTTT TGCTCTTTAA 24 00 

5 ACTCGTAAAT TTTATCTAAA CATTTTTGTG CAAAAGTTTG GTCACCTACA GATAAAGCTT 24 6 0 

CGTCAATGAC TAAGATATCT GGATTAACTG TGATATTAAT TGAAAAACCA AGTTTTGCAC 252 0 

GCATACCACT TGAATACTTT TTAACTGGTT GATAAATAAA CTCACCAAGT TCACTAAATT 2580 

w 

CAATAATCTT AGGTGTCATC GCTTTAATTT CTTTTCGCTT AAAGCCCATA CATAACATTT 264 0 

TAAATTCGAT ATTTTCAATC CCTGTAAGTT GTCCACTCAA GCCAGCACTA ATTGCGATAA 270 0 

CGCTGACTTC ACCATTACGA TCCACTTTGC CAACAGTAGG CGACAAAGAA CCGCCAATGA 276 0 

75 

TATTGCTCAA CGTTGATTTG CCGGAACCAT TGATGCCAAC AAGCCCTATG ACGTCACCTT 282 0 

CATATGCTTT TAAACTAATG TCATCTAAAG CGAAAAATGT TTTGTTTTTA TGTTTGGGAA 28 80 

2Q TGAGCGCATC TTTCATACGT TCTTTATTTG TACGATAAAT ACGATATTCT TTTGTTACAT 294 0 

TTTTAATGTT TACCGAAACG TTCATTTGTA GACCTTCCTT ATTCACATTT ATCTAGATTA 300 0 

TAATATACTA CTCAACAGTT GTTAAATTTT AAAACCTGTT GTAAAGTGTA TAGAAGATTT 306 0 

25 TGTTATTATC AGAGTGGGTG TTTTGACACA AAATGTTAAT CATCAATGAT AACAATGATA 312 0 

TTTAAAAACT AAACTTATTT CAACTTACAT GATTGTATAC TATAATGTAT TTGTAATAAA 318 0 

CTAATATTTT AAAGAACTAG ACAATAATTT TG AT AG CATC CATGTATAGT GATAGTATTT 324 0 

30 ACAACAATTA TTATAATACT ATTTAGTTAA GTAGAGAAAT AGTTAAACAT TTGAAAGTGT 3300 

GGTTTAATGG AATGTCAGCA ATAGGAACAG TTTTTAAAGA ACATGTAAAG AACTTTTATT 3 3 60 

TAATTCAAAG ACTGGCTCAG TTTCAAGTTA AAATTATCAA TCATAGTAAC TATTTAGGTG 3420 

35 

TGGCTTGGGA ATTAATTAAC CCTGTTATGC AAATTATGGT TTACTGGATG GTTTTTGGAT 34 80 

TAGGAATAAG AAGTAATGCA CCAATTCATG GTGTACCTTT TGTTTATTGG TTATTGGTTG 354 0 

GTATCAGTAT GTGGTTCTTC ATCAACCAAG GTATTTTAGA AGGTACTAAA GCAATTACAC 36 00 

40 

AAAAGTTTAA TCAAGTATCG AAAATGAAcT TCCCGTTATC GATAtACCGA CATATATTGT 3660 
(2) INFORMATION FOR SEQ ID NO: 173: 

45 (i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 13 868 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

50 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173: 



55 



10 



15 



20 



25 



EP0 786 519 A2 

CTGTGGACTC GGACGCTGGA AAGTCAATTT AGCAATCGTC CAACTAGATT GTAGAACTTC 54 0 

GCCTAATAAT ACACCTAAAA TATATTGATA ACTCATTGTG ACAAGTAGTT GAATTTCTAC 6 00 

TATATTTTCA TCTTTTAATA TAAAATACAA CATGATAGAA ATTAAAGTTA TAACAACAAT 66 0 

GGG TGAGCCT TTTCtAGATG TTAAAATTAA AAAATAAATA AATATCAATA AATAGGTAAA 72 0 

TATAAAGAAA CTAGGTATCT GATAATGGCT CGACGCTAAA CCTATCAATA ACATAATAGG 78 0 

TGGCATAAAA TAACCACCAA TCGTTGTAAG CCATTGGCCT GCTAGATGTC TAGATTGTGT 84 0 

AATTGCGAAT CCTTGTTGTA ATGTCTGTTG TCGCTCTCGT GGACTTGTTA CAATGACTAA 90 0 

ATCTTTTGCA CGGCCACCAG CGAGTTTATT AAACAGTACA TGACCAAATT CATGTGTTAA 96 0 

AACAGGGATA TAGTTTAAAA TGACATCTAA ATAGTTCAAA ACAGGCTTAT GTCTATATTG 102 0 

ATGAATAGCA ATATAACAAG CTGCAACAAT AACGATAATG TATATATTAA GTTGAATTGT 108 0 

CGTATTAAAA AAGTTTGATA AATAATTCAT TGTTAACCTC ATATAAGATA TTAATTTAAA 114 0 

GTTTGCTTAT CACTTATTAT AAATGATATT GGCATCAATA GCGTTAGACT TTAGACTTAC 12 0 0 

CTTAGTTAAA CTAATTTTAA TTTTTGAAAA GGTGAATATG TGTTAAAATA AAGCAAAATC 12 60 

ATTTCGATAT AAATAGGATG AATATAAATA CTGTTAATAT TGATTACACT AACATAATAA 132 0 

TGAAATAAGA TAGGAGATTC CTGTTATGAC TGTTGAAGAA AGATCCAATA CAGCCAAAGT 13 80 

TGACA T TTTA GGGGTCGATT TTGATAATAC AACAATGTTG CAAATGGTTG AAAATATTAA 144 0 

30 AACCTTTTTT GCAAATCAAT CAACGAATAA TCTTTTTATA GTAACAGCCA ACCCTGAAAT 1500 

AGTGAATTAC GCGACGACAC ATCAAGCGTA TTTAGAGTTA ATAAATCAAG CGAGCTATAT 1560 

TGTTGCTGAT GGGACAGGAG TAGTCAAAGC TTCGCATCGT TTAAAGCAAC CTCTAGCGCA 162 0 

TCGTATACCT GGTATTGAGT TGATGGATGA ATGTTTGAAA ATTGCTCATG TAAATCATCA 16 8 0 

AAAAGTATTT TTGCTAGGGG CAACTAATGA AGTTGTAGAA GCGGCACAAT ATGCATTGCA 174 0 

ACAAAGATAT CCAAACATAT CGTTTGCACA TCATCACGGT TATATTGATT TAGAAGATGA 18 0 0 

GACAGTAGTG AAcGnAnTTA AACTGTTTAA ACCTGATTAC ATATTTGTAG GTATGGGATT 186 0 

CCCTAAACAA GAAGAATGGA TTATGACACA TGAAAACCAA TTTGAATCTA CAGTGATGAT 192 0 

GGGCGTAGGT GGTTCTCTTG AAGTATTTGC TGGGGCTAAA AAGAGAGCGC CTTATATCTT 19 80 

TAGAAAATTA AACATTGAAT GGATATATAG AGCATTAATA GATTGGAAAC GTATTGGTAG 2 04 0 

ATTAAAGAGT ATTCCAATAT TTATGTATAA AATAGCCAAA GCaAAAAGAA AAATAAAAAA 2100 

50 GGCGAAATAA TCATGATGAC AAAAATAAAA CCGAGGAAAT CCTTAAATGG AGATTCTCGG 2160 
TTTTTTCGGT TTATTTAATA ACGAAGCGGG ACTCATCGAG TTTGTTTOTA AATTrTTTT'*' 
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CATCCATTTG TAATTTTAAA GCAGTTATAG CTTTTAATGC ATCAGCCTTA TTACGATTAC 10260 

TTACTTTTCG ATAATTTTGC ACTAAAGCAG TGACGCGTGC AAGATCATCA TTAATCGTTT 10320 

TTTCAGCATC TGGCTTTTTA ATAGGATGTA CATCTAAATC ATGTATTGTT TGTAGATTTA 10380 

ATGATGCTGT TTTAT C AACT TGTGCATTGC TACGATCTTG ATCAATTTGT CCAATAGCAG 1044 0 

TGTCATAAAT ATTTTGTAAC TGTGCTAATA TACTATTTCT TTCTTCTACC GTTGCTTGAA 10500 

TATTCGCTTC AATTGCTTGT TTTTTATCGT TGAATAATGT TGTCAATTGT TCTCGAGCAG 10 560 

ACGCCTTTCT GTTAATAACA GGTTCGATTT CACGAATTTC GTTTTTCTCA TCATGCAATA 10620 

AATATGCCAC ATCTG CATTA GTCACTGCAC TAGCAATTTG TTGTTTAGCT TTAATTAACT 10680 

CTTTTTCAAC TTGTGCTATT GCAATATTTT GTTCTTCATC TGTCGCTTCG TTATTTGCTT 10740 

TAATTAAATT AATTTTATTT GTAGCGATAT TTTGAATTTG TTGTAATGCT GTTGCTTTAA 10800 

CTGTTGTCGC TGGTTTAATT TTTGAAATAA TATTTTGAGC ATTTATACTA TCTTGATTAA 10 860 

CTTGGGCAGT CTTATCTGCA TGATTGATCT GATCAATAGC CTGATTAAGT GCTTGTTCTA 10 920 

CTAAATGTTT AGCAGCTAGT CTTTCTTCTT CAGTTGATAA ATCGCTTTGA TCGATTAGTG 10 980 

CATTTTGAGC TTCGGCTTTT ACACCAACAG ATTGACGCGC TGCTGGTTTA ACTTGAACTT 11040 

TAGGTAAAAT CACTTTGATG TTGTCGTTGC CATCAGTCnC AGTnCGATCC ACTTCTGCAT 11100 

TCGTTTTGTT TTGTGCAATG TCATTT 11126 
(2) INFORMATION FOR SEQ ID NO: 172: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3660 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172: 

TTGCCCCGCA CGGCGGTGTG nTTCCTAGAA ATAATGAATA TAAAGaGAAA TATATAACAA 60 

CGATTTTGAA TTATGAACCT GGTGATATCG TTACAATCAA ACGTGTGAGA GATAAGACCG 120 

ATTTGCTAAT ATATTTGTCT AGTAAAGATA TTTCTATTGG TAATGAAGTG GAAATTGTAT 180 

CGAAAGATGA AATGAATAAA GTAATTATCA TTAAACGTAA TGATAATGTA ATTATTGTCA 240 

GTTACGAAAA TGCAATGAAC ATGTTTGCTG AAAAATAAAA TAAAGAAGCC ATAAAGATAT 3 00 

CCATGATTGA ACTGATAAAG ACATATGGAT AATTGCTTTA GGCTTCTTTT TTATTAGTTA 36 0 

ATTTATCAAG TGAGTATATT TGAGTAAAAT ATTCACTGCA TAAAGATTGA AGATAATCCA 4 20 



55 



10 



15 



20 



EP 0 786 519 A2 

GTAAACTCAA AATATCTGGT GG COG TAG AT GTAACGTAAC TAATCGATTA CCATATGCTG B4 6 0 

AAATTATTAA GAACATTCCT GGaAATGGGA AATTTTTATA TAGTCCCTTT TCAATTTTTG 8520 

ATAATGAATC CATCATAGAT TTTTTTGAGT CTAGGGGTGT TAAATTAAAA GAAGAAGATC 8 580 

ACGGGCGTAT GTTTCCAGTT TCCAACAAAG CACAAGACGT GGTTGATACA TTAGTGACAA 8 64 0 

CTATCGAACG CCAACATGTA ACGATTAAAG AAGAAGAAGC TGTTAGTAGA ATCGAAGTTA 870 0 

ATACAGACCA AACTTTCACT GTACATACTC AAAATAATAG TTATGAAAGC CATTCGCTAG 876 0 

TGATTGCTAC AGGTGGTACA AGTGTCCCTC AAACTGGTTC AACTGGTGAT GGTTATAAGT 8 82 0 

TCGCACAAGA TTTAGGTCAT ACCATTACTG AGTTATTCCC GACCGAAGTT CCAATTACAT 88 80 

CAGCTGAACC TTTCATCAAA TCCAATCGTC TAAAAGGTTT AAGTTTAAAA GATGTTGAAT 8 94 0 

TGTCAGTACT TAAGAAAAAT GGTAAAAAAC GCATCAGTCA TCAAATGGAT ATGTTATTTA 90 00 

CTCATTTTGG TATCAGTGGT CCAGCTGCAT TAAGATGTAG TCAGTTTGTT TATAAAGAAC 90 60 

AAAAAAATCA AAAGACACAG CACATTTCTA TGGCAATCGA TGCATTTCCT GAATTAAACC 9120 

ATGAACAATT AAAACAACAC ATCACATCAT TATTATCGGA CACACCAGAT AAAATCATTA 9180 

25 AAAACAGTTT GCATGGTCTA ATTGAAGAGC GCTACTTACT GTTCATGCTG GAACAAGCAG 9240 

GAATCGATGA AAATACCACA TCACATCACT TATCAAATCA ACAATTGAAC GACTTAGTAA 93 00 

ATATGTTTAA AGGGTTTGTA TTTAAGGTGA ACGGGACATT AC CT AT AG AT AAGGCATTTG 93 60 

30 TCACAGGTGG TGGTGTGTCA CTTAAAGAAA TTCAACCTAA AACAATGATG TCTAAATTAG 9420 

TTCCGGGATT ATTTTTATGT GGTGAAGTAT TAGATATACA TGGTTATACT GGTGGTTATA 94 80 

ATATTACAAG TGCACTCGTA ACAGGACATG TCGCTGGATT ATATGCCGGA CATTACTCAC 9540 

35 

ATGCATCAAT GGAATAATAG TATAAAATTT GGTTCGATTC TCTTTAGTAG ATCAACTTTT 96 00 

TCATTCAAAT AAAAATGACC TTAATATAAC TGAGTCACTA AAAAGTGTCG TTATATTAAG 96 6 0 

GTCATTTCGT TAATTATGAT TCTTTTTCGT TTTTAGTACG TCTTCTAGCT AACAAAGCCG 972 0 

40 

CACCTGTAAT CAGTGCAAAT TCTTTCAATG GTAAATCCAT TCCTTCAGAA CCTGTATTTG 97 8 0 

GAAGTTCTTT TTCAACTTTG CGCGATTCAT GTGTCTCTTC TTTTTTAATA GGCGTACAAA 9 84 0 

CTTTTGGAGC TGGCTGAATT TCTTTTGGTG ATACTTTCGT CGCTTCAGCT GGTAATTTAA 9 9 00 

45 

TTGCTAAAAT TTCATCAACA ATGAATTGCG TGTGTTGTTT GATGTCATTT AATGTCGCAT 99 6 0 

CTTCATCAAT CATTCTATTG CCATCTGCAA CATATTGATC AATTAATACT TTTACTTTAG 10 020 

50 CTAATTGTTC TGGTGTTGCG ATCGCTTTGA ATTTCGCATA TGTTTGTTGA GCAATGTTAT 100 80 

CAATTCGCAG TAAGCTATTT TCTTTTTCAG TAATTACTGC TTCTATATrG CTTAATHr'AA - ~ 
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ACTTGATAAC 


TACAGAAGCT 


AAAATAACAT 


AAACTGTTAA 


TTTCTGTTTA 


TCTATACCTT 


6660 




GTAACATTGA 


TGCCGTTACA 


CTTAATAGTG 


AAATTAGTAT 


TGCTACAGGC 


GCATAATAGA 


6720 


5 


ATAATAAGCG 


ACT AC CAT CA 


TGGTTAGGGT 


CATGACCTAA 


AACAATTGGA 


TCGTAACCAT 


6780 




AGAAAACTGT 


GAATAATGGT 


TGTGCCAAGG 


CCATAATTCC 


AATACTAGCT 


GGAACAGTTA 


6840 




TAAACATTAA 


TACACCAATA 


GATGTTCTAA 


TTTGATGATG 


CATTTCATGT 


AAGCGACCTT 


6900 


10 


CTGCAAATGT 


TTTTGTAATA 


TAAGGAATTA 


AACTCACTGC 


AAAACCAGCA 


CTTAATGATG 


6960 




TCGGAATCAT 


TACAATTTTA 


TTAGTTGACA 


TATTTAGCAT 


ATTAAAGAAT 


ATATCTTGTA 


7020 


15 


ACTGTGAAGG 


TATACCAACT 


AAAGATAAAG 


CACCGTTATG 


TGTAAATTGA 


TCTACTAAGT 


7080 


TAAATAATGG 


ATAATTCAAA 


CTTACAATAA 


CGAACGGTAT 


ACTATAAGCA 


ATAATTTCTT 


7140 




TATACATCTT 


GCCATATGAC 


ACATCTATAT 


CTGTGTAATC 


AGATTCGACC 


ATACGATCAA 


7200 


20 


TATTATGCTT 


ACGCTTTCTC 


CAGTAATACC 


AGAGTGTGaA 


TATACCAATA 


ATCGCACCAA 


7260 




CTGCTGCTGC 


AAAAGTAGCA 


ATACCATTGG 


CTAATAAAAT 


AGAGCCATCA 


AAGACATTTA 


7320 




GTACTAAATA 


ACTTCCGATT 


AATATGAAAA 


TCACGCGTGC 


AATTTGCTCA 


GTTACTTCTG 


7380 


25 


ACACTGCTGT 


TGGCCCCATA 


GATTTATAAC 


CTTGGAATAT 


CCCTCTCCAT 


GTCGCTAATA 


7440 




CAGGAATAAA 


GATAACAACC 


ATACTAATGA 


TTCTTATAAT 


CCAAGTAATA 


TCATCGACTG 


7500 




ACCAACCGTT 


TTTATCATGA 


ATGTTTCTAG 


CTAATGTTAA 


TTCAGAAATA 


TAAGGTGCTA 


7560 


30 


AGAAATACAG 


TACCAAGAAA 


CCTAAAACAC 


CGGTAATACT 


CATTACAATA 


AAACTCGATT 


7620 




TATAAAATTT 


CTGACTTACT 


TTATATGCCC 


CAATAGCATT 


ATATTTCGCA 


ACATATTTCG 


7680 




AAGCTGCTAA 


TGGTACACCT 


GCTGTCGCAA 


CTGCAATTGC 


AATATTATAT 


GGTGCATAAG 


7740 


35 


CGTATGTGAA 


CGGCGCCATA 


TTTTCTTGTC 


CACCAATTAA 


ATAGTTGAAT 


GGAATGATaA 


7800 




AAAGTACGCC CAATACCTTG GTAATTAATA TACTAATGGT AATTAAAAAG GTTCCACGCA 


7860 


40 


CCATTTCTTT 


ACTTTCACTC 


ATTACGAATC 


TCCCTATCTC 


ATGTTTATTA 


AAGTTTTGTA 


7920 


AACTAAAAGC 


TGTTTCTCTG 


TAAAATCATT 


TTTCATTATT 


ATGAATATAT 


CACAAAACTT 


7980 




TATTTCATTG 


TCGTATATTC 


AATGAATTAT 


CATAACAAAA 


TTATCAACAC 


ATTGTCATTG 


8040 


45 


AATACTAGAT 


TTTGATTAGA 


ATATTACGAA 


ATTTCATATA 


AACATTATAC 


TACTATTTGA 


8100 




GATGAACATC 


GCATAACAGT 


AGAAAAATCA 


TTCTTATCAT 


ACACATACAT 


CTTCATTTTT 


8160 




TATGAAGTTC 


ACATTATAAA 


TATATTCAAC 


ATAATTGTCA 


TCTCATAACA 


CAAGAGATAT 


8220 


50 


AGCAAAGTTT 


AAAAAAGTAC 


TATAAAATAG 


CAATTGAATG 


TCCAGTAACA 


AATTTGGAGG 


8280 




AAGCGTATAT 


GTATCAAACA 


ATTATTATCG 


GAGGCGGACC 


TAGCGGCTTA 


ATGGCGGCAG 


8340 
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TAAAGAAATA CGTTTTCTTT AGATATTAGT ATTTCTTATG AATGAGTTTC ACGCATGTAT 4 86 0 

TCTTCTTTCT ATATGCATAT TAGCTATGAC TAACGATAAA GAACCTGAAA CACTAATAAA 4 92 0 

TGTCCTATAG TTTACAATAT TATATTGGCA GTAGTTGACT GAATGAAAAT ACGCTTGTAA 4 980 

CAAGCTTTTT TCAATTCTAG TCAACCTTGC CGGGGTGGGA CGACGAAATA AATTTTGCTA 5 04 0 

AAATATGATT TCTGTCCCAC TCCCTTATCA TTTCTGTCCT ACTCACATCT TATTCTTTAT 5100 

CAGATAATGC ATTTTTATTC TTTTTTAAAT CTTCTTCAGT GACGATACGT AAATTATTAT 5160 

TTGGTGTGCG CCACCTTCAT CATCAAATTT ACCTTTTTCA ATACTTTCGT CAGTCTTATT 522 0 

GTCATATTCG GTAAATTTTG ATTTTTCTTC TTTGAAAAAT GCTTTTGGAT TATTTTTTAA 52 8 0 

TCTATTAGCA TATTCTTTCG GATTTGTTTT TACTTCTTTA ATTGTTTCAT TAGCAATTGT 534 0 

TCCTAATTGC GTCGCTTTAT CCTTAGCATT ATCTTTATAG CTTTGAGGAT CTTGTTTATA 54 00 

TTTATTATAT TCcTGcTTTC AGCTTGTCAC GACTATCTTT ACGTGTAACA AGTACAGCTG 54 6 0 

CTACAGCGCC ACCTATACCT AAAATCGCTT TAAATAAATT ACCTTTTGCC ATATCAATCG 552 0 

TCTCCCTTTT ATTTATAATT TAATTTGTCA AAATCATTTT CAGTTAATAA ACGATATTCT 55 8 0 

25 CCTGAATCTA AATTGCTGTC CAATTCTAAA TCAGCAATTT TGATACGTCT TAAATGTAAT 564 0 

ACCTCATTTT GAATGCTATG AAACATTCGT TTAACTTGAT GATATTTTCC TTCATAAATT 57 0 0 

GTTACGTGTG ACGTTTGATT ATCAATATAA GTTAATATTG CAGGCTTAAC CTTGCCATCA 57 60 

30 GTCAGTGTtA CACCCTCTTT AAAAGCTTGA ATGTCGTCTT CAGTGATAGG ATTTGCTGAA 58 2 0 

ATAACTTCAT ATTTTTTAGA AACATGTTTG TTTGGACTCA TTAATTCATG ATTAAAATCA 58 8 0 

CCATCATTCG TTATCAATAA AAGCCCTTCT GTATCTTTAT CAAGACGACC AACCGGAAAA 594 0 

ATATTTAGAT GTTGGTATTC AGGTATTAAA TCAATAACGG TTTTTGAATG ATGATCTTCA 6 0 00 

GTTGCTGATA TATAACCTTT TGGCTTATTT AACATAATAT AGACATTTTC AATGTATTCT 606 0 

ATTAATTCTC CACGAACTGT TATCTTATCG TTTTCTGGTT CTATATGTGT TTTTGGTGAT 612 0 

TTAATTACTT GTTCGTTGAC ATTTACAAGG CCTTTTTTAA GTAACTGTTT GACCTCATTA 618C 

CGTGTACCGA CGCCCATATT TGCTAAAAAT TTATCTATTC TCATCGTAAA AACCTAACTC 6 24 0 

TACGTCTTAA TTTTTCAGGA ATTTCACCTA AGAATTCGTC CGCAAGACGC GTTTTAATTG 6 3 0C 

TGATTGTACC GTAAATTAGA ATACCTACTG TAACACCTAA AATAATAATG ATTAAGTAAC 6 3 60 

CAAGTTTAGT AGGTTCTAAG AATAGATTTG CAAGGAAAAA TACTAATTCT ACACCTAGCA 64 2 0 

50 T CATAAT AAA TGAATACAAG AATATTTTTG CAAAATGAAT CCAACTATAG CTGAATTTAA 6 4 80 

ACTTCGCATA TTTTTTAAGA ATATAGAAAT TACATCCAAT TG r AAATAA' r AATG'^aT&'" r . - 
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TCAGTCATAT CATTTGTTTG ATTTCTATAT GCAGTAACTA ACTTTTGTAC AAAAGGATCA 3 06 0 

TTTTTATCAA CATAATGTGG TGGTTGGACT TTACCTAATT TCACTTCAAA GCCATATTGT 312 0 

TGAATCTCAT TTGCAAAACG ATCCATAGCT TTTTCAAATT CAAATCCTTC TGGGTAGCGT 318 0 

AAGTTGATAC CGAAAAGACC TGCGTTTTCA TTATCATATG TAATAACACC AATGTTAGTT 3 24 0 

GTCACGTCAC CCATGACATC TGTATGGAAT TTCATTCCCA TCTTTTCACC AAAATCTGAA 3 3 00 

TTAAATAAGT AG CG ATT ACT AAATGCTACA AACGCTTGTG CATTATTATC AAGATTTAAT 336 0 

GATGCTAAGA ATTTTAGTAA GTAAAGACCC GCATTCACAC CGATAGATGG ATCCATACCA 34 2 0 

TGAACCGCTT TACCTTCAAC TGTTAAAACT AGAATGCCAC TATCAACAGT ACTATCACCT 3480 

TGTAAATGAT TTTGTTCTAA AAAGTACTCA AAGTCTTGAA TAACATCTGT CATATTTTCT 3 54 0 

TTAACAAGCA CTCTTGCTTC TGCATGATCA GGTACCATGT TGTAACGTTC ACCAGATTTA 3 600 

AAAGTTATTA ATTCATAATC AGGTTCATCT TGATCTTCAG TAAGTTTATT TTGAACTAAA 3 66 0 

TCAAATGTTG TAATGCCTTT TTCACCATGA ATACATGGAA ATTCTGCATC TGGTGCAAAA 3720 

CCTAATGTTG GCATTTCTTC TGTTTTAAAA TAGCGATCCG TACATTTCCA AT GAG ATT CT 3780 

25 TCATCCGTAC CAATAATCAT ATGAATACGT TTCTTCCAAT CCACATTCAT ATCTTCTAAT 3 84 0 

ATCTTAATTG CATAATAAGC AGCAATTGTT GGAC CTTTGT CATCAAGTGT ACCTCTAGCT 3 900 

ATGATAGCAT CTTCTGTTAC AACCGGCTCG AACGGATTAC TATCCCATCC ATCACCAGCA 3 96 0 

30 GGAACAACGT CAACATGACA TAAGATACCT AATACGTCAT TTCCTTTACC TGCCTCAATT 4 02 0 

CTTCCTGCAA TATGATCCAC ATCATGTGTT GTAAATCCAT CTCTATGTGC AATTTCATAC 4 080 

ATGTAGTCTA ATGCCTTACG AGGACCTGGA CCAACTGGTG CGTCTTCTGA TGCTTTTGCA 414 0 

TCATCTCTCA CACTTTCAAT TGCTAATAAT CCTTTTAAGT CATTAATGAT TTGATCTTCG 4200 

TATTGTTGAA CTTTTTCTTT CCACATTCGA AATCGACTTC CTTTTTTCTA TAAGTTAAAT 4 2 60 

TCTATTTTAC ATGAAAAGAT ATAAAAACTA CAATAAGATG TCAGAAAATA ATAAAAAGGA 4320 

ACAAAACGAT GCTATTGATA TGACACAAAT CATAAATAGC TGCTTTGTTC CTTTTTTAAT 4 3 80 

TTATATATTT AAAATACACA TATTCAAGAG CTCGAGATAT AAGTCAATGT ACTAGGCACA 444 0 

CAATTTAATA TTGACAGTAA TTAACCGAAC GAAAATGCGC CCCGGGGCCC CAACATAGAG 4500 

AATTTCGAAA AGAAATTCTA CAGACAATGC AAGTTGGCGG GGCCCCAACA TAGAAGCTGG 4560 

CCAATAGTTA GCTTTCAATA ATGTGCAAGT TGGGGTAAGG GCCCCAACAC AGAAGCTGGC 462 0 

50 CAATAGTCAG CTTTCAATAA TGTGCAAGTT GGGGTAAGGG CCCCAACACA GAGAATTTCG 4680 

AAAAGAAATT CTACAGACAA TGCAAGTTGG CGGGGCCCCA ACACAGAAGC TGGCCAATAG 474 0 
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TTTTCCAATG TTGTGCTGTA ACAACCTCGC CTGTTTCTAT GCGTTTCGTC CATACTAATT 12 6 0 

TGGGCACAAT ACCTTCTGCT GATAATGCCG CAATAAATGG ATTTGAATTT CGTTTTAAAA 132 0 

ACAACTTTTG TCCATCTTGT TCAGCCATAT ATGCTTCACC AGATGCACCA CCTGCTGAAT 13 8 0 

CAAGTGTCCA CCCTAATTGA TAAAACTGCT CCAACTCGTC CACCTCACTT TCAATTAGAA 144 0 

AATGGCTCTA GAAATAGGTT TTTCAAGAGC CATATATTCT AATTTATAAC ACCATACTGG 150 0 

TACAAATATT ATGTCCAGAT AATTATTGTA AATCCTCAAC CAATGCCTAC ATTACACGAC 156 0 

TAAATTTAAA TCGTAATGTC TGTCATTGAC ACCATACATT CTATAGTCAC TTACTTGACA 162 0 

TATAATGTTA CCGTGTCTAA AACTACATGT TTTTGAATCT CTGTAGGCGA TAAACTcTAG 1680 

TTTTCAAAAT AATTGCTATC CCATTTTCAT GGTTAGCATA AATTTATGAA CTGTAACATT 174 0 

TACGTACTTA GTAAAATATG ATGCACATCA TATTTGTrAC TCATAGAAAA TTTTATAAtT 1800 

TTTATCATTA TATTTCAACT GAAAATGAGA AACAAAATGG CACTTTTTAC TAATATGTGT TflfiO 

TTTCTAAACA ACACTTTTAA GCTTCGTTTT AAATTATAAC ATAATTCACT TACGAAAGTT 1920 

GATAAATTTA AGTAATTTAA TCTAAAAATA TGATGAAAGA ATTTTAAATA CTGTGTGACT 1980 

CTATATACTT TTCAAATCCT TCTTGTAGTT GACGTGTAAT TGGGCCAACT TTACCATCAT 2 04 0 

TAACTGGTTC ACCATCTAAT TTAATAACAG GTGTAACCTC AGCTGAAGTA CTTGAAACAA 2100 

TAACTTCATC TGCGTTTTTC AAGAAATCTA CAGTAAACGT TTCTTCTTTA AATGGGATGT 2160 

TATAGTCTTC GGCAATTTTT TTAATTACAA TTCGTGTAAT ACCATTAAGA ATATAGTTGT 222 0 

TAATCGGATG TGTATAAATC ACACCGTCTT TAATTGCATA AGCATTACTT GAAGATCCTT 2280 

CAGTTACAGT TtCACCTCGA TGTTGAATTG CTTCAACTGC ATTATATTTC ACAGCATATT 2 34 0 

CTTTTGCTAA TACATTcTCC TAATAAGTTC AAGCTTTTAA TGTCGCAACG TAACCATCGG 24 00 

ATATCTTCAA CGGTAACACC ATTCACACCA TTTTCTAAAT GATCATAAGG ACGATCATAA 24 6 0 

CTCTTTGTAT AAGCAACAAT TGCTGGTTCT ACTTCAGGTG TCGGGAAGCT ATGATT CCTT 252 0 

TCAGCTACAC CACGCGTTGC TyGAATATAA ATTGCCCCAG TTTCAATTTG ATTCATATCA 2 58C 

ACTAATTTAC GAGATAGTTC AATTAATTCT TCTACAGAAT AATTTAAATC TAAACCAATC 2 64C 

TCATTGGCAC TACGTwAAAw TCTTTCATAA TGTTCTGTTA CTGTAAATAA CTTACCATTA 2 7 00 

TATACTCGAA TGTATT CAT A AATACCATCG CCAAATACGT ATCCTCTGTC GTTGTATGAA 2 760 

ACCTTTGCTT CACTTGGACT TACAAACTCA CCATTTAAAA AAATTTTTTC CATATATTAT 2 8 20 

TCCTCCACGC ATAATGAATA AATTGCTTCT AAGTAAATAC TAGTTGCGTT AAATAACTGT 2 8 80 

f r^r r r^p r rj^c^ r ^p r T>p i r r , fo r T> r j'r*p i ^'~r T^r^^^Tv;^ A -r~~> » ji -T- - * ^ * - — - » ■■- ■ , .... 



EP0 786 519 A2 



GAATTTCAAT GTATAATTGT GTATATTACA TTAGAATAAA GCACGAAGGA GCATGATACA 5160 

TGTCAGAAAT AATCGTTTAT ACGCAGAATG ATTGTCCACC TTGTACATTT GTAAAAAATT 5220 

ATCTAAATGA GCATCACATT GATTTTGAAG AGAGAAATAT CAACAATCAA CAATATCGAA 5280 

ACGAAATGAT AGATTTTGAT GCTTTTTCAA CTCCGTTTAT TTTGTTGAAT GGC 53 33 



(2) INFORMATION FOR SEQ ID NO: 171: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11126 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

is 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 
ATACGTGACC CTTTATCCGA AAATTTCTTT TCATATTCTG TTAAAATATT ACTGCCATCG 60 

20 

TCTTCTTGAT GTAAATTTAG ATTTATTTTT GTAAAATACA TTCCAAATTG AGACATACTT 12 0 

TCTAAACTGT AGGCAAATAG TCCTCTGTTA TCAGTTTTAA AATGTAAATC TCCTTCATCA 18 0 

25 TTTAAGATTT GTTGATACAA CGCTAAAAAC GTATGATACG TTAAACGTCG TTTTGCATGA 24 0 

CGATTTTTTG GCCATGGATC TGAAAAGTTC AAATAAATAC GCGAAACTTC GCCGTCTTTA 300 
AAATATTCAT TTAATTCAAT GGCGTCATTA CAAATAATCT TTAAATTTGT TAAACCCATC 36 0 

30 TCTTTAACTT TATCCAATAC TTTATAAACG ATACTTTTCT CACGTTCCAT TGAAATATAG 42 0 

TTAATATGAG GATTTTGAGC AGCTAATGTT GTAATAAACT GCCCCATACC CGAACCAATT 4 80 

TCAATGTGTA TCGGTTGCGT TTTaTCAAAC CATTCAGTCA TTTTCCCTGc ATGTTGACCG 54 0 

35 TCCATGTCAA CCAATTCAGG ATGATCTTTT AAATAATCTT CAGCCCATGG TTTGTATCGA 600 
ACT CT CAT AT TTTATTCTCC TCTTAAATAA ACATGTTACT ATTCATAACT TCATTTAGGA 660 
ATTTAAGCCA AGTGTTCATA TCCTTATATC TTTTTTGCTC TTCATACCAT TGAACAAGAC 720 

40 

CTATAGATTG AATTACCGTA TACCATTTCA TACGTTTATT TAAATTCAAG CTCTCTTGAA 780 
CACCATATGT TTCAAGCCAT TCAGACCATT GTTGTTGTGG AACATAGTTG TAAAGCAGCA 84 0 

TTCCGATATC AATTGCCGGG TCTGCAATCA TTGCACCTTC CCAATCAACT AAAAATAGTT 90 0 

45 

CATCTCGATC GGATAATAAC CAATTATTAT GATTCACATC ACCATGTACA ACAGTGAAAA 960 
AACGCGAATC TAAACTCGGT ATATGCTCTT CTAAATAGGT TAATGATTTT CTCACAATAT 102 0 

SO GATGTGTTAA AACTTCTCTT GATAAAGAGG CATTAATTTT ATTAAGCATA ATCTCAGGAG 1080 
TAATAGGTTC CATTTCCATA CGCTTTAACA TACTTAATAA AGGTCTAGAA TTGTGTATCT 1140 
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AACTGTTTCT CAACGATATC ATAATCAGTG CTATTGAACC TGGTATTAAC AATACCGTGC 3 36 0 

CTAAATATTT GATTGACTCT GGAAAGAAAC CTACGAATCC TACGAAGAAG AAAACAAAGA 3 42 0 

ATACATTCGT AACTTCCCAA ACTGGGTTTA AATAACGTGA AATTAAGTGA TTAATTTTCT 3 4 80 

TTTCATCACC AGTTAACTTT GAATGCAATG CGAAGAAACC TGCCCCAAAA TCTATAGAAG 3 54 0 

CAATAATGAT ATAGCAAAAT AAAAACAACC ATAACACTGT TATACCTATA AATGCATAAA 3 6 00 

TCATTTTTCT ATTTCTCCTC CTTGCTTCTT GGCTAAACGA TTTACATCTT CATACG CCGG 3 6 60 

TTTATTTTTA AACATACGAA TTAATACGTA TGCACATGTA TACATTAAAA TGATGTACAA 3 720 

TATGCCAAAT AAAATTGTAA CGAaGGTTAT TCCGCCTGCT TGTGTTGCTG CTTCTGCCAC 37 80 

GCGCATATAA CCACGAACAA TCCAAGGCTG TCTACCCATC TCTGTTAAGA ACCATCCAAA 3840 

TTCTATAGCT AGCATTGAAG CTGGGCCTGT TAATAATATT CCATAAAGCA TCCATTTATG 3 9 00 

AGTAGAAAAC TTTCTAAGCT TTTTAAACAT TAAAGTTAAG ACATAAACAC CTGAAATGAC 3 9 60 

AAAACATAAA ATTCCCATCG TTACCATTAA ATCAAAGAAA TAATGGACGA TCATAGGCGG 4 020 

ATGTAAACTT TTTGGAAAAT CATTTAACCC TTGTACTTTA GTTTTGACAC TATTATCTGC 4 0 80 

25 TAAGAAACTC AATAGTCCAG GTAATTCAAT CGCACCTTTA ACTTGCTGAG TCTTTTCATC 414 0 

TAACACACCA AATAATAATA ATTTGGCATG GGAAGATGTA TCGAAATGCC ATTCATAAGC 42 0 0 

TGCTAATTTT TCAGGTTGGA ATTTATGCAA AAATTTTGCA GATAAATCCC CTGCCAACAT 4 26 0 

30 AGAAAGTAAT GTTGAAAAGA ATCCAACTAT CATAGACATT TTCAAAGCTT TCTTATGGTA 43 2 0 

GACAGTATCT TTAGGTTGAC GATTACGCAA TAATTTAAAA GCTGCTATTG ATGCAATAAC 43 8 0 

AAATGCCATC GTCATACCGG CTGTAGTAAT TACGTGAAAT GATCGAACTA TAAACGATGG 44 4 0 

GTT AAA CATC GCTTCTATAG GTTGAACATT GACCATCTTT CCATTCTTCA ACTCAAAACC 4 50 0 

TGCAGGCGTA TTCATAAATG AATTCACTGA AGTAATGAAG AATGCTGAGA AAGAGCCACC 4 560 

AATAATTACT GGTATACTAA TTAAGAAATG TGTCCATTTA TTTTTAAAAC GATCCCAAGT 4 62 0 

ATATAAATAT ATACTTAAGA AAATAGCTTC AAAGAAGAAC GCAAATGTTT CCATAAATAA 4 6 80 

TGGAAGTGCA ATAACGTGTC CACCCATTTC CATAAATGTA GGCCAAATCA ATGATAATTG 4 74 0 

AAGTCCTATA ATTGTACCTG TAACAACTCC CACTGCTACA GTAATTGTAT AAGCTTTAGC 4 80 0 

CCATCTTTTG GCCATAGCTA TATATTGAAG ATCATTTTTG CGAATACCTA AAAATTCTGC 4 86 0 

AATTGCGAAC ATTAAAGGCA TACCAACACC AATCGTTGCA AAAATGATAT GAACTGCTAA 4 92 0 

AGTCATAGCT GTCAAAAACC GACTGATTTC AACTGTATCC ATTTAAAAAC ATCACCTTTT 4 98 0 

TCTTTTTTTG ATGACAACAC AATGAACTTA ATTATAATTG CTATAATOT^ TK' r T-'r~A*T* *" ~ -i " 
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GCTTTTTCTT TATATATGAT GAGCTTGAGA CATAAATCAA TGTTCAATGC TCTACAAAGT 1560 

TATATTGGCA GTAGTTGACT GAACGAAAAT GCGCTTGTAA CAAGCTTTTT TCAATTCTAG 162 0 

TCAGGGGCCC CAACATAGAG AATTTCGAAA AGAAATTCTA CAGGCAATGC GAGTTGGGGT 16 8 0 

GTGGGCCCCA ACAAAGAGAA ATTGGATTCC CAATTTCTAC AGACAATGTA AGTTGGGGTG 174 0 

GGACGACGAA ATAAATTTTG AGAAAATATC ATTTCTGTCC CACTCCCGAT TATCTCGTCG 180 0 

CAATATTTTT TTCAAAGCGA TTTAAATCAT TATCATGTCC AATCATGATT AAAATATCAC 186 0 

CTATTTCTAA ATTAATATTT GGATTTGGTG AAATGATGAA CTCTTTGCCT CGTTTAATTG 192 0 

CAATAATGTT AATTCCATAT TGTGCTCTTA TATCTAAATC AATGATAGAC TGCCCCGCCA 1980 

TCTTTTCAGT TGCTTTCAAT TCTACAATAG AATGCTCGTC TGCCAACTCA AGATAATCAA 204 0 

GTACACTTGC ACTCGCAACA TTATGCGCAA TACGTCTACC CATATCACGC TCAGGGTGCA 2100 

CAACCGTATC TGCTCCAATT TTATTTAAAA TCTTTGCATG ATAATCATTT TGTGCTTTAG 2160 

CAGTTACTTT TTTTACACCT AACTCTTTTA AAATTAAAGT CGTCAACGTA CTTGATTGAA 2220 

TATTTTCACC AATTGCCACA ATGACATGAT CAAAGTTACG GAT AC CT AAA CTTTTCATAA 2280 

25 CTGCTTCATC TGTAGTGTCT GCAACAACCG CATGAGTAGC GATATCACTA TATTCATTCA 234 0 

CTCTATTTTC ATCATGGTCG ATGGCCATTA CATCCATGTC TAATGCATTC AACTCACGAA 2400 

CGATACTACC TCCAAAACGA CCTAGACCGA TGACTACATA TTCTTTACCC ATACTCGCCC 2460 

30 TCCATTAAAT GATTTTCATC AATTCATTGA AAATATAAAT TTAAAATTAT TATAAATGAG 2520 

TACCCCAACT AAATTATCTA AATG CAGTAA TGCAAGTAAA TGAAAGTTGG GGTATCGTCT 2 580 

CAACTTATGA TTTCTTTCCT TCAACATATT CTTTGTCGAA AACAAATAAT CTTAATAATA 264 0 

ATATTAACGA TGGAAGTAAT AAAAGTAAAC CTAAAATAAA GACAATCACT AATGTCCAGC 270 0 

CCATSTCTGG ATTAACATAT GCATCTGTAA TTTTTACAAA CGGATATAAA AGGTATGGCA 2760 

ATTTACTAAT TCCATAGCCA AAGAACGCGA ACATCATTTG TAAAATAACA AATACAAAAG 2 820 

CCAAACCATG TTTTTTCTTA AAGAATGTTA ACAATGAAGC TAATGCAAAG AATAAGAAAC 2880 

TTATACCAAA CATCCACCAA TAGTCAAAAA CAGCTGAATA AAAATGTTCA GAATTTTGAA 2 94 0 

TGCGTAATGA TAGAAATACG AATAAACAAA TGATAATCAT CGGCGGCCCT AAAAATATGT 3000 

GCCATTGTCT TGTTAAATTA TATGCTGGTT CGTCATTTGC TTTTTTAGCA TAATATGTCA 3 06 0 

AAAATCCTGA TGAAATATAT AAAACTGAAA TAATTGCCAA GAATACTACA GACCAAGCAA 3120 

50 ATGGGCTTAA TAATAACTGC ACCCAATCTA GATCGATAAC ATTGTTTCGA ACATTAATAT 3180 

AGCCACCTTC TGTAATAGTT AAAGCAGTAG ATAATGAAGC TGGAATTAAT AATCCACTTA 3240 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5333 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170: 

ATTAAAACAA CTTAATATAC CTATTTATGG TGGTCCTTTA GCATTAGGTT TAATCCGTAA 60 

TAAACTTGAA GAACATCATT TATTACGTAC TGCTAAACTA AATGAAATCA ATGAGGACAG 120 

TGTGATTAAA TCTAAGCACT TTACGATTTC TTTCTACTTA ACTACACATA GTATTCCTGA 180 

AACTTATGGC GTCATCGTAG ATACACCTGA AGGAAAAGTA GTTCATACCG GTGACTTTAA 24 0 

ATTTGATTTT ACACCTGTAG GCAAACCAGC AAACATTGCT AAAATGGCTC AATTAGGCGA 3 00 

AGAAGGCGTT CTATGTTTAC TTTCAGACTC AACAAATTCA CTTGTGCCTG ATTTTACTTT 3 60 

AAGCGAACGT GAAGTTGGTC AAAACGTAGA TAAGATCTTC CGTAATTGTA AAGGTCGTAT 4 20 

TATATTTGCT ACCTTCGCTT CTAATATTTA CCGAGTTCAA CAAGCAGTTG AAGCTGCTAT 4 80 

25 CAAAAATAAC CGTAAAATTG TTACGTTCGG TCGTTCGATG GAAAACAATA TTAAAATAGG 54 0 

TATGGAACTT GGTTATATTA AAGCACCACC TGAAACATTT ATTGAACCTA ATAAAATTAA 600 

TACCGTACCG AAGCATGAGT TATTGATACT ATGTACTGGT TCACAAGGTG AACCAATGGC 6 60 

AGCATTATCT AG AATTG CT A ATGGTACTCA TAAGCAAATT AAAATTATAC CTGAAGATAC 720 

CGTTGTATTT AGTTCATCAC CTATCCCAGG TAATACAAAA AGTATTAACA GAACTATTAA 780 

TTCCTTGTAT AAAG CTGGTG CAGATGTTAT CCATAGCAAG ATTTCTAACA TCCATACTTC 84 0 

AGGGCATGGT TCTCAAGGTG ATCAACAATT AATGCTTCGA TTAATCAAGC CGAAATATTT 900 

CTTAeCTATT CATGGTGAAT ACCGTATGTT AAAAGCACAT GGTGAGACTG GTGTTGAATG 96 0 

CGGCGTTGAA GAAGATAATG TCTTCATCTT TGATATTGGA GATGTCTTAG CTTTAACACA 1020 

CGATTCAGCA CGTAAAGCTG GTCGCATTCC ATCTGGTAAT GTACTTGTTG ATGGTAGTGG 10 80 

TATCGGTGAT ATCGGTAATG TTGTAATAAG AGACCGTAAG CTATTATCTG AAGAAGGTTT 114 0 

AGTTATCGTT GTTGTTAGTA TTGaTTTTAA TACAAATAAA TTACTTTCTG GTCCAGACAT 120 0 

TATTTCTCGA GGATTTGTAT ATATGAGGGA ATCAGGTCAA TTAATTTATG ATGCACAACG 12 6 0 

CAAAATCAAA ACTGATGTTA TTAGTAAGTT AAATCAAAAT AAAG AT ATT C AATGGCATCA 13 20 

GATTAAATCT TCTATCATTG AAACATTACA ACCTTATTTA TTTGAAAAAA CAGCTAGAAA 13 3C 
ACCAATGATT TTACCAGTCA TTATGAAGGT AAAPGAA^AA AAAGAAT^AA n^r.r™ 
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GCTTGCATTT CAAGATATTG CTCATTATAT TCGTCAACTT GAGTAGCCAA TAAATGATCT 7020 

TCTTCTTCAA GTTGTGCAGT TGTTTTTTCA CTTAAACTAG AACTTAATTC ATAAGAATAG 7080 

TTTTGGTTCT CAAGATATTT AGTTAAATCA TTAAAACGAC TCAAATTACT AGTATAAGTT 714 0 

TGGTAATCTT CATGATGTTG GTAAAAATCT TCTTCAGTAC CAACATTGAT AAAATCGAAT 7200 

AGTGCTGTAA TTTCTTTATT ATTTTCTTCT AATTGAGCAT TTAAATGATT TAATTCATTT 7260 

GTAACAAGTT TGGTATTTTC AGCATTAATA CGCCATTTTT CATTCGTGTC TTCAGCTGAT 73 20 

TTCAACCATT GTtGCACATC GTGGAATAAA GATAATTTGT TGAAATAAAC AAATTGTGAT 73 80 

TTTGTAACAG CTTCAGCATG ATTGTAGAAT GTATCTAATT CTTGAACCAA TTGCTGGCGT 744 0 

TGTTGATTTA AATCACTGAT ATGTTGATCT AATGCTTTAA TATTCGCCAT TGTAGAAATA 7500 

CTATCAACAA TTAAATCATT TGAAATTTTA GATGATAAGT ATAATTCATC CTTAACGTTC 7560 

TCAACTGTCG ATTGTAATTC ATCATGACGC CCTTTCGCAT CATTTAAACG ACCTTCAATA 7620 

TACTGACGTT TCTCTTCTAA AATATCTTTA TTTTTCAAAG CTTGTTGCCA GTGATCACGA 7680 

ATGCGATATT GCTCATCAAG ATCAAAATCT AAGTCATAAT TTTCATCTAA AATGGCTAGT 7740 

2$ TGTGCTTTAA TTTCTTCGAT TTCATCTGTG ATGGCCTCGC TATAATCTAC TTCTTTTGAT 7800 

TTAGACATGA TGATACCGAT AACAAATACT AAAGTTAATA CTGCGAAAAT AATACCAAAC 78 60 

AACATGTTGT TTGAAATAAA TGAGAAGGCA GTTAAACCAA T AC CTACT AA TGTTAAAAGr 7920 

30 ATAAACGTTG TTCGkAACAA TTTTTGACGT TTTTGttTTT CTT 7963 

(2) INFORMATION FOR SEQ ID NO: 169: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 958 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 9: 

ATATTGTCTT TACAATAGTT TGCTATGGAG GTAATTAACC AATAGGAGGA ATTTATAATG 60 

GCAGTAATTT CAATGAAACA ATTACTAGAA GCGGGTGTTC mCttCGGTCA CCAAACACGT 120 

CGTTGGAACC CAAAAATGAA AAAATATATC TTCACTGAGA GAAATGGTAT TTATATCATC 180 

GACTTACAAA AAACAGTGAA AAAAGTAGAC GAGGCATACA ACTTCTTGAA ACAAGTTTCA 24 0 

GAAGaTGGTG GACAAGTCTT ATTCGTAGGA nCTAAAAAAC AAGCACAAGA ATCAGTTAAA 300 

TCTGAAGCAG AACGTGCTGG TCAATTCTAC ATTAACCAAA GATGGTTAGG TGGATTATTA 360 
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AAAAATCCAA AATTATTGAC TGATGCATAC AAAGATCTAT TAAAAGAATA CGATGTTGAC 52 2 0 

TTTAAAGATC GTGATATTAA ATCAGTTGTC GAAGATAAAA TCTTAAACCC TGAAAAACTT 5280 

AAACAAGGTG GCGCACAAGG CGGACAATCC GGCATGAGCC AATAACACAA AACCGAGCGA 534 0 

CCGTGGTTCA AAAAT CAT AC CACGGCCGCT CGGTTTTTTC GCATTAAAAA TCGGACAGAT 54 0 0 

GAGCTCATGT TTCAGTATAC TCATCTGTCC GATATCTTTT AATTCTTAAT CGAGTGATTC 54 6 0 

AGGATTGTAG AATCTACGAT TTTCAAGACC AAATATTTTA TCTGTAAACT GACCCTTGTC 552 0 

AGTTTTTTTA TATGCCTTTT CAAACATATT CATTCTAGCA TCGATATTAT CGATATAGCA 558 0 

TAAAATTTCT GCTTCTTTTA AGTATGGCAG TTTTGGAGAA CCATACTCTA ACTTACCATG 564 0 

ATGAGATAAA ATCATATGTC TTAACAACAT GATTTCTTCT CCTTCAATGT TCAATTCACG 5700 

AGCTGCTTCA ACTACTTCAT CACTCGCAAT CGAGATGTGT CCTAATAAGT TACCTTCGAC 5760 

TGTATACGAC GTCGCAACAG GACCACTCAA TTCTCTAACT TTACCAATAT CATGCAAAAT 5820 

AATACCACTA TATAACAAAC TTTTGTTTAA CAATGGATAA ATGTCaCAAA TTGATTTTGC 5880 

AATACGTAAC ATCGTTAATA CATGATAGCT TAAGCCACTC GCAAAGTTAT GaTGATGAGA 594 0 

25 ACTAGCAGCT GGATATGTGT AAAATCGTTC TTGATATTTT TTCAATAAAT GACGTGTGAT 6 0 00 

ACGTTGTAAA TT AG CATTTT CAATATCTAG CAAATAATGA GAAATCTCTT CTTGTATTTC 6 06 0 

TGCCGGTGAT AAAGGTGCAC CATCTACAAA TTGTTCTGTT TTTAATTGAT CTTCAGTTGT 612 0 

CGCTAGTCTA ATTTGGTTGA CTTTCATCTG TTTATTTCCG CGATAGTTTA TGATGTCACC 6180 

TTTAACATGT ACAATTTCTT CAGGCTTGAT TGTTGCCATA TCATTTTTTG TAGCCGTCCA 624 0 

AAATTTCGCT TCAATTTCAC CACTTTTATC TTGCAAATGT AATGTCATAT AATCTTTACC 6 3 00 

TTGTGCTGTT ACACCCTGTG TAGCTTTATG CACTAAGAAA AAGTGATCAA CTGAATCTCC 6 360 

GGGATTTAGA TTCTCTATAT TTCTCATCGT TTCCCGCCTT CCTCTATTTT GTTTAATGTA 64 2 0 

ATCACTTCTT TTGATGGAAC AATATTATCT TTTACACATG TAAAGTATAG TACTTGATAG 64 8 0 

TGTTCTGATA ATGATCGTAA ATAATTCAAC ATTTTTTCAG TACGTTTTTT ATCAAAATGA 6 54 0 

ACAAATGCAT CATCAACAAT TAATGGGAAC GGATAATATG GTCTTAGTAC CTTAATTAAA 6 600 

^ CTGATACGTA AAGCTACATA AAGTAATTCT TTTGTAGATT GACTTAGTTC AACAGGATCA 6 66 0 

TATAATTGAC CATTAACATG TTTAACCGTA ATTGAATCTT CATTATAGTT AATCATCGTA 6 720 

TATCTGCCAT CTGTTAAATG CTTCAATATT TCTACCGCTT CATTAATAAC TTGAGGCAAA 6 78 0 

50 CGTTTATCTT TAATTTGTTT AATGTGTTCA TCAACTAAAC TTTGTAAATA ACTTAAACTT 6 84 0 

GCCCAATCTT TTGCGATATC ATTAAGTTCIA TTTTTAAHA.r T^THA^A^^ n^v— ^™r~r 
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TTT CAT AAGT GATGCTTTAT TAGCAAGAAT ATGTGTTCGC AGAAATTTGT TCTGCATTCT 34 2 0 

ACTTCTACGC TAGTCAATCA GACAATTTTA CCAATCCCCA CTTTCGCGTT TCAAATCAAA 34 80 

CAATACGTCG CTCCTTTCTT CTTATATAAC AATTCTTCTA ACATGATATG TTACTATTGA 3 54 0 

ATTACTGAAC CTGAGTTAGT TATAATCTAA CTTATATTGA AAAGAGATGA GGCGTAAGAT 3 600 

ATGTTTTTAT GTAAAAGACA AATTGATATC AATGCACGAT TTGGTTTGCC TAGAATTGCA 3 66 0 

TTTATGAGTG CAGTTGCAAC CATCATTATG TTTTTAGTTA GTTATGAAGT AATGTATTTT 3 720 

TTATCTAATA CGCCATTATC AGATAGACAT TTTCTCATCT TTTTATTACT TGTATTTATG 37 BO 

ACGTATCCAT TACATAAAAG TATACATTTA TTATTTTTCT TACCATATAG AAAATCGTTT 384 0 

AAAGTTCATA AGTTAACTAA AAGAAAATGG CTTATATTCT ATAATACCTA CGTCAATCAA 3 900 

CCTGTACACA AATTTTATTT TTGCATTAAC TTAATATTGC CGTTAATTAT CTTATCTGCA 3 96 0 

ATGTTCGTTT ATCTAACAAT TTCATTCCCG CAATATGGAC ATTATTTTAT GTTCTTATTG 4 020 

GCATTGAATT TCGGTATTTC CATTACAGAT TTATTATATT TAAAAATAAT TATATTTTCT 4080 

AATTATGGAC AATATATAGA AGAACATAGT ACAGGTATTA ATATTTTGAA AAAAATTAAA 414 0 

AATCCATATC ATTTATAACA AAATAATTAT AGCAAGGTGT TATTATTTGT TTTTAGGCTA 4200 

TGTAATAgcT tACAATCAAA TGTATATAGA CCTTGTTTTT TTATTTTCAT CAATTTCTAC 4260 

CCCTAAACCT AATGCTCTAG TCTGATGTCA TGGGTTATTG ATTGGTGATA ATATAAAACT 43 20 

30 ATGTTATATT CACGATGATT AACTTACAAA GGAGTTTCAA CTATGAAGAT GATAAACAAA 43 80 

TTAATCGTTC CGGTAACAGC TAGTGCTTTA TTATTAGGCG CTTGTGGCgC TAGTGCCACA 444 0 

GACTCTAAAG AAAATACATT AATTTCTTCT AAAGCTGGAG ACGTAACAGT TGCAGATACA 4500 

ATGAAAAAAA TCGGTAAAGA TCAAATTGCA AATGCATCAT TTACTGAAAT GTTAAATAAA 4 56 0 

ATTTTAGCTG ATAAATATAA AAATAAAGTT AATGATAAGA AGATTGACGA ACAAATTGAA 462 0 

AAAATGCAAA AGCAATACGG CGGTAAAGAT AAATTTGAAA AGGCCCTTCA ACAGCAAGGT 46 80 

TTAACAGCCG ATAAATATAA AGAAAATTTA CGTACTGCTG CTTATCATAA AGAATTACTA 474 0 

TCAGATAAAA TTAAAATCTC TGATTCTGAA ATTAAAGAAG ACAGCArGAA AGCTTCACAC 4800 

ATTTTAATTA AAGTTAAATC TAAGAAAAGC GACmAAGAAG GCTTAGATGA TAAAGAAGCG 4 86 0 

AAACAAAAAG CTGAAGAAAT TCAAAAAGAA GTTTCAAAAG ATCCAAGTAA ATTTGGTGAA 4 920 

ATCGCTAAAA AAGAATCAAT GGATACTGGT TCAGCTAAAA AAGATGGCGA ATTAGGTTAT 4 980 

GTTCTTAAAG GACAAACTGA TAAAGATTTT GAAAAAGCAC TATTTAAGCT TAAAGATGGT 504 0 

GAAGT AT CAG AGGTTGTTAA ATCAAGCTTT GGATATCATA TTATTAAAGC TGATAAACCA 510 0 
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TCCGGAATAT AAGATAACTT TCTTCTATAA GCCTCTATGT CATCATTAAT GTTGATATCT 162 0 

GAAATTGATA GAGATCCTTC CATAGGTGTA AGCAATCCTA GCATATGTTT AATCGTTGTA 16 8 0 

CTCTTACCAG CGCCATTAAG GCCAATAAGT CCAACAATTT CGCCTTTGTT TAATTCAAAA 174 0 

TTTATATCTT TAATTACAGG GCGTTTTCCA TATCCACCTG TAAGCTGTTC TACTTTAACT 180 0 

GTCATAAGGC ACCTCCATGA CTTATATTGT ACCAAAAATT AT AAAATG CT CATATTAAAT 186 0 

ACACATGTCC TAATATCGAA TTTTTAGCGA CAATGTTATA ATGAATGGTA ATACTAGTTG 192 0 

AAAAGGAGTG TAG T CAT CAT GTCAGAAACA ATTTTCGGCA AAATTTTAAC TGGAGAAATT 1980 

CCTAGCTTTA AAGTATATGA AGACGATTAT GTCTATGCCT TTTTAGATAT ATCACAAGTT 204 0 

ACTAAAGGAC ATACGTTATT AATTCCTAAA AAAGCTTCTG CTAATATCTT TGAAACTGAT 2100 

GAAGAAACAA TGAAACATAT CGGTGCAGCA TTACCTAAAG TAGCAAATGC TATTAAGCGT 2160 

GCATTTAATC CTGATGGTTT AAACATTATT CAAAATAATG GTGAGTTTGC AGATCAATCT 2220 

GTATTTCATA TTCATTTCCA CTTAATTCCT CGATACGAAA ATGATATTGA TGGATTTGGT 2 2 80 

TATAAGTGGG AAACACATGA AGACATTTTA GATAACGATG CAAAACAACA AATTGCTGAA 2340 

25 CAAATTCAAG CACAATTTTA AATGTATGCT TAATCTAAGC TCGAACGGGT ATAATATGAT 24 00 

TAATATTATA ACAATTGCGT TTGAAGTGAT AACATCAAGG TTAGCAATTT TAAACAAAAT 24 6 0 

GAGTTATCAA GATAACAGAT GTTAAAAGTG AGGAGAATAT AAATGAAAGC ATCACGCATT 252 0 

CTATTCGGTA TCGGTGTTGG CGTAGCAGCT GGTTTTGTAG TTGCACTTCA AGGACGTGAC 2580 

GACAAAAGTG TCAAGAACAA CACGATCGAT CGTACTGCCC CTACTGGTTC AAAATCAGAA 264 0 

CTACAACGTG AATTTGAAAC GATTAAACAA AGTTTTAATG ACATTTTAAA CTATGGTGTT 27 00 

CAAATTAAAA ACGAAAGTGC GGAATTTGGT AGTTCAATTG GTGGTGAAAT TAAGTCATTA 27 60 

CTTGGAAACT TCAAATCTGA CATTAATCCT AATATTGAAC GTTTACAGTC ACACATCGAA 2 82 0 

AATTTACAAA ATCGTGGCGA GGATATTGGA AACGAAATTT CTAAGTAGCA GGTTACGTTC 2 8 80 

TCGATCACAA CTATTTTTAT TAGTAACAGC ATATTTATTT TTTAAAATTA AATGC CAAAT 2 94 0 

AAACGAGATG ACATTAGAAA TTAGATATTT CTTGTCATCT CTTTTTTAAA ACTCAAATGA 3 COO 

4S ACTTATGTTT ACAAATTATA GGAAGACATT GTTTGTAGTG ATTTTCGCTT AAATCATATT 3 0 60 

TATGAATTGA TTGAAAACAT TGCTTAGGAT TCATTGTGTT ATCCtTGCAC TTTGATTACG 3120 

CTTTACTTAA ATCATTATCG ACAAACAACA TACTTATATT TTCATTGAGC CGAACCTTAT 3180 

50 ATACACATTA CATATACCTT ACTTGCACAA ATTATTAATC TGGTGTTTAT TATAATTACA 3 24 0 

TATCACTATA TTTTTAGCAT TTGTATAA^T T.^^^T^a nsAnr.^^ - 
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(A) LENGTH: 7963 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168: 

TAATCTATTA TAAAAACTGT CCATACCCTT TGATTACCTT CTCTTCAGGT ACAGGCCACA 60 

CTTGAGGCCA TAAGCCATAT GCTTGCTGTG AATAAAATTG TGCCATTTGT AACAATATAA 120 

TATATACAAA TAAACACCCA ATAATTGCTG TCACTAATGG ATATGATAAC CAAACCATTA 180 

15 ATAAAACTGC AATAATTACT AACCTAAAGA TAATATTAAA TGCGTCTCTC CCTCTTATAA 240 

AGCTTCTAAT AAATAAGAAT AAATACATCG CATTAGAGTT AAATTTACTA CCCTTTGGAA 300 

CTGGTAAAAG TATATCTAGA TAACTTCTTC TGACTGCAGA TTCTTTCAAA TGTTTTACAT 360 

CGGTGAACAT ATTAACAAAT TTATAATAAT TCATATGATG TCGATGTTCG ATTGCAATCA 420 

TTTTCTCCCA AGGATACAAA AAGCCTGGTT TATATTTTTT AACTAAAAAT TCTATTAACA 480 

CAGGCAAAGC AACCATCACA AATGCGATGT ACCATTTTGG AGCTAATAGT AAGTAATATG 540 

TTAGAGCAAA GGTGATGAAT GATATTAAAT TAACTTGCCA TGTTTTAAGT CCCGATTGAT 60 0 

ACCATTGCCA TCTTAAGCGT AAACCAACAT ATGGAAAAAT TAATGCACTG ACTCCAAAAC 660 

AAATATAAAA TGCCACATTA TGTTGATTAA TATTGTAAAA CAACGGGAAC ATTACAATAA 720 

CAATAATGAG TTGGATTAAT ATGCGCGCAA AGTAACTATA TAAAATCGCA TGACGCATAA 780 

ATTGAGACAT GTGTTTTTCA AATGGTAATA AAAAGATTTT ATCCgCTTCT TTTAACAGTG 84 0 

35 GTCsCmTTGG AAAAATAGrT GTCAACGCAA CAATCACTGC TGCTATTaAT GAAAAATTGa 900 

TATTCGTTGG AATATGTTTT AACCATTcAC CATATCCArA AATAAATGCA CCCAGCAAAA 960 

TAAGTAAAAA GACCATGAAA TGACCATTAA ATATAAACTT ATTATAATAA TTTTtCTCTT 1020 

40 TACGAAGGGC ATGTAATCTT TTATTAAATA ATGTGGTAGC TTGGTTACGC ATGTACATCT 1080 

CCACCTTGCG TCACATGAAT ATATATATCG TCTAATGTTT GATTATGTAA GCCAGTTTGT 114 0 

TGTCTCAATG CTTCTAAATC TCCAAATGCA ACGACTTCAC CTTCGTCTAG TATGaTAAAA 1200 

CGATCACAGT AACGTTCAGC TGTTGCTAAA ATATGTGTAC TCATTAGAAC GGTTCTACCT 12 60 

TCGTTTTTCT TTTCAACCAT TAAATCTAAC ATGGATTGAA TTCCTAATGG ATCTAGGCCA 132 0 

AGGAATGGTT CGTCTATAAT ATACAATTCG GGATTAACGA TAAACGCACA AATAATCATG 1380 

ACTTTTTGTT TCATCCCCTT AGAAAAATGA CTCGGAAAAA CTTTCAACTC ATTTTCTAAA 144 0 

CGGAATGTCT TTAATAATGG CATTGCTCGA TTCATCGTTT CATCACGATC AATATCATAT 1500 
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AGATGAATTA AAAGCTGTTG TATTTGAATA TCAGTTGTTT CAATACTATG TTGTTGAAGT 94 8 0 

GTCTCTTGTA TAATATGCGA AATCATCCTT TGGTGTGAAT CAGGTAATTC aTTTAAAATT 954 0 

AGGTCTTCAA CATGTACATG CCCTGATGAT AATTGATTTA AATGGATGAT GGCATTAGTG 96 0 0 

ATATCATTAT CTGTTCCATC GAC 96 2 3 
(2) INFORMATION FOR SEQ ID NO: 167: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1021 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 

is (D) TOPOLOGY: linear 



bu 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167: 
ACCGTGGAAA CACGTCTAC-T CAATCACAAA GCGATAAAAA TGTGACTAAA TCATCTCAAG 

AGGAAAATCA AGCAAAAGAA GAATTACAAA GCGTTTTAAA CAAAATTAAC AAACAATCAA 12 0 

GTAAGAATAA TTAAAAAATT TTGATATTGT CTATGTTTAT AGTTCACAAG CCATTCAACG 180 

TATTGTAAAC TAAGGATAGT GTATTTTTTT AATAGTAATT TGTCAGGAGG TG CCTATCT A 24 0 

TGGAAGAACA TTACTACGTA AGTATTGATA TTGGATCATC AAGCGTAAAA ACAATAGTAG 3 00 

GCGAGAAATT TCACAATGGT ATAAATGTGA TAGGTACAGG ACAAACCTAC ACGAGCGGTA 3 60 

TAAAAAATGG TTTAATTGAT GATTTTGATA TTGCGCGACA AGCAATCAAA GACACAATTA 4^0 

AAAAGGCATC AATCGCTTCG GGTGTTGATA TTAAAGAAGT TTTCCTGAAA TTACCTATCA 4 80 

TTGGAACGGA AGTTTATGAT GAATCAAATG AAATCGACTT TTATGAGGAT ACAGAAATCA 54 0 

ACGGTTCACA TAT CGAAAAA GTATTAGAAG GTATTAGAGA AAAAAATGAT GTGCAAGAAA 6 00 

CAGAAGTAAT TAATGTGTTC CCGATTCGTT TTATAGTCGA TAAAGAAAAT GAGGTTTCAG 66 0 

40 ACCCTAAAGA ATTAATTGCC AGACATTCAT TAAAGGTTGA AGCAGGCGTA ATTGCTATTC 720 

AAAAATCGAT TTTAATTAAT ATGATTAAAT GCGTAGAAGC ATGTGGTGTT GATGTATTAG "?8 0 

ATGTTTACTC TGATGCATAT AACTATGGTT CAATCCTAAC AGCTACTGAA AAAGAGTTAG 84 0 

GTGCATGTGT CATTGATATT GGTGAAGACG TTACGCAAGT TGCTTTTTAT GAACGCGGTG 500 

AATTAGTAGA TGCTGATTCT AT CG AAATGG CAGGGCGTGA TATTACaGAC GATaTTGCAC 96 0 

aAGGrTTaAA CACTTCTnAT GAAACTGCTG nAAAAAGTTA AACACCAATn TGGTCATG C A 102 0 

T 1021 
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ATCC1T1TGA TGTCGTTtCA CCGCCTAGAG TCAGCGCGAT GGCGATAAGG AGTCCACCAA 76 3 0 

CTACGATAAA AGGAACCATA AACGATACAC CGTTCATTAA ATGTTGATAC ACCATTTGAA 774 0 

5 

T AC CATTTTT AGACTTACCG CGATCTTTCG AATGATAATT TGTTTCAGAT TGATAAATAG 780 0 

GCGCATCTTG ATTAATGATA CGTTGAATTA GACCTCTCGG ATTATGAATC CCTTCGCGAA 7 86 0 

CATTTTCATT AATCAACCGT TTACCAACAA ATCGGGACAG ATCAACTTGT TTATCAGCTG 792 0 

W 

CAATTATGAC ACCGTCAGCT TCTTCGATGT CTTGCGTAGT TAAAACATTT TCAGCACCAA 7980 

CACCGCCCTG TGTCTCTACT TTAATATCCA CACCCATTTC TTTTGCTACC TGCTCAAGCT 804 0 

15 TTTCTTGAGC CATATATGTA TGTGCAATGC CATTTGGGCA TGAGGTAATA GCTACAATTT 8100 

TCATAAAATC ATCTCCTTTT CTATATTGTA AGCGTATTCT CGATACTAAA AAAAAGAATA 816 0 

ATTACCGTTA CTAGTGGCAA TTATTCTTGT AAGTATTCAA ATAACTGTTG CTTTAAACTA 8220 

20 TGATCATCTA AACTACATAA ATGGTTCACT GAATCATCAT CCAAGTTAGC AATTAATTGC 8280 

ATCATTTGTT TTGTAAAAGC TTTGTCTTTA TGCGAAATCG CTAAGAAAAA GACAAGTTTG 8340 

ACATCGTGTT GTCGCCAAGG AAAAACATCT TTTGTGCGAA AAATAAGCAC ATGTGATTGT 8400 

25 AAAACTTTTT CAGGATCTCC ATGAGGAATC GCCATAAAAT TACCTATGTA TGTAGAAGAT 8460 

GATTTCTCAC GCTCTAAAGC TGATTCGATA TATCCTTCTA CAATCGCATG ATGTGCTTGT 8520 

AATATTTTTT GAGCTTCTTC AAAAATTTGC ACAGTATGCC GTGATTTTTG TTCAGTATTT 85 80 

30 

ACGACAAGGA AATTGACAGT GTCCATATGA TGATGTGCTT GAACCGGATT TTGCTTTTGC 864 0 

TTCACAACGT GTCTGATTTT GTGACGATCA TCTTCAGAAA ATAATGGTGC AACCTTGATA 87 00 

GTCGTCAGGT GCTTAGGAAG TATGTTTAGC GTTTGTTTAG GAATATCATG GGTCGTTATT 87 60 

35 

AATAAATCTA CATTGTCAAA GTGATAGTGT GTTATATTTT CTAGTTTAAT CGTATTTATC 8820 

ACTGACAACT CTTCGGATAA GTTATTTATT TTAGTTTCTA AAAAATTCGA CACACCTAGA 88 80 

40 CCATAATAAC AAGCAATGAC TACATTTAAT TGTGTTTTGG TACGACGCTC GATGGCAGCT 894 0 

TGAAAATGAA TTGTTAAAAA TGCAATTTCA TCTTCGCTCA TCT CTATATC AGTATCAATT 90 0 0 

GCTAATTTAT CAATCGCTTC AAAAAGTGTG TTAAACACAA AGGGATAGAG TTTTTTAATC 9060 

45 TCTATAACTA AAGGATTGTT TAAATAAATG TTTTGAGTGA TACGTAAATA TGCTTTACTA 9120 

AAATGATTAT ATAAATTTTG TTGTAAAATC GAATCTTCAT TGAAAGGTAC ATGAATACGT 9180 

TGCTGCATCA ATTCGATTAA GCGATCAATA TAACTTTGTA TAAATATACG TTCTATGCCA 924 0 

50 

ATATCGAGTT TATTAAAATG ATAAGCAATA AAGAATGAAA ACATATTGAT TACTTTTTCG 93 00 

TTCAAGTCAT AACCTAATCT TTCGTTGATT TGCTTAATGC AAGATTGAGA TATCAATTTT 936 0 
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GTTCGCCTTC GTGTTTTAAA GCGTAGTCAT CATCTGGGTG AACTTGAACA GATAATTTAT 58 8 0 

CATTGGCATC TAATACTTTA GTTAGCAGAG GGAAACTATC TCGTGAATCA TTATCGAATA 594 0 

ATTCACGATG TTGTGACCAA AGTTGATCTA GGGTCATATC CTTGTATGGA CCATTGATAA 6000 

TTGTATTAGG ACCATTTGGA TGTGCAGAAA TTGCCCAGCA TTCACCAGTT GTTTCATTAG 606 0 

GGATATCATA GTTAAATGCT TTTAATGCAT GACCGCCCCA AATTCTGTCT TTAAAAACGG 612 0 

GTTGTAAAAA TAATGCCATA GTTAAAACTC CTCTATATTT TCATTAATAA GTTATAAATT 618 0 

TCTGTAGTAC TGTTTGCATT AATTAGTGAT TGGCGTGTCT CATCATTCAT TAACGCTTTA 6 24 0 

GATAAGCGCT GAAGTATTTT TAAATGTGTA TCCTGACTGT TGTTTGGTAC GGCAATTAAG 63 0 0 

AATATCAATT GAGGTAGACT ACCATCTAGA CTGTCCCATT TAACACCATG ATTATTTTTC 63 60 

ATAACAGCTA CAATCGGTTG TTTTACAACA TCAGACTTTG CATGTGGAAT GGCCACGTTC 64 20 

20 ATG CCAATAG CTGTCGTAGm tCcATTTCAC GTTCTAGTAT TGCATTTTTT AAATGCGATG 64 80 

TGTGCTCTAC ATAACGGCAA ATTTTAAGTT TATGAATCAA CATATCAATT GCTTCGTTTC 654 0 

GAGACATGTC GTGATCAGTA ATTATCATAG TTTGTTGATC AAAAACATGA GAAGGTTTAT 6600 

TGAGATGTGA ATGTTTCGCG GTGTTATCTA CATTGTCAAC CTCTGTATCA TGTTGTGTAA 6 660 

TATCTGTATC ATGAAGTTGC GTGTGTTGCG CTGGTGCATC TACTGCTATA ACTGGTGTAT 672 0 

TGCGTTTTAA TAATAGTACA GTAGTCATTG TGACAAGACT ACCTACTATC ACTGCAAAGA 6 78 0 

TAAACCATAA TACATGATCA ATACCAC CTA ATACAGCCAC GATTGGACCT CCATGTGCGA 6 84 0 

CTCTATCGCC GACACCACCA ATGGCTGCAA TGACTGATGC AATCATTGCA CCAATGATGT 6 90 0 

TTGCAGGTAT AATGCG CAAT GGATCTTGGG CTGCGAAAGG AATAGCACCT TCAGTAATAC 696 0 

CAAATAGTCC CATAGTGAAG GAAGCCTTAC CCATTTCTCT TTCGGAATGA TTGAATTTAT 7020 

ACTTTTGAAC AAACGTTGCT AAACCTAAAC CGATTGGTGG TGTACATACA GCAACTGCGA 7080 

40 CCATACCCAT AACGGCGTAA TTACCTTCAG CAATAAGTGC TGAGCCAAAT AAAAATGCTA 714 0 

CCTTGTTTAC TGGACCGCCC ATATCGAAGG CAATCATCGC ACCTATAATC ATCGCAAGTA 72 0 0 

TAATAATATT AGCACCTTGC ATACTTTTTA ACCAGGTTGT TAATGCCTCA AAAATATTAG 72 6 0 

AAATTGGTGC A C C G ATT AAA AATATAAATA TCAATCCTAC AACGACCGAT GAAATAATGG 7 320 

GAATAATAAT GATAGGCATA ATTGGTGCCA TTGCTTTTGG AACTTTAATA TCTTTAATCC 73 80 

ACTTTGCGAT ATAACCTGCT AAGAAACCAG CAACAATACC ACCTAAAAAT CCTGCGCCTG 74 4 0 

CATCACTGCC ATAAAAACTA CCGTCAGCAG CGATAGCGCC GCCAATCATA CCAGGAACAA 7 500 



25 



30 



35 



45 



50 



EP0 786 519 A2 

AACAGTATTA CCAAGTATTG AACAACAATA CATTAGTGCT GTTAAAAATG CTCAAGCAAA 4 0 80 

CTTCTCGAAA GTGAAAAGTG ATGTAGcTAA AGCTGCTAAC TTTGTGCGCA ATGACTTACC 414 0 

5 

ACAGTTAGAA CAGCGATTAA CTAATGCGAC AGCAAGTGTG AATAAAAATT TACCAACGTT 4200 

ATTAAATGGT TATGATCAAG CGGTAGGATT ACTAAATAAA AATCAGCCAC AAGCGAAAAA 4260 

GGCTTTATCA GATTTAGCTG ATTTTTCTCA AAATAAATTG CCTGATGTTG AAAAAGATTT 4 320 

10 

GAAAAAAGCG AATAAAATTT TCAAGAAATT AGACAAAGAT GATGCAGTCG ACAAATTAAT 4 3 80 

CG ACACACTT AAGAATGATT TGAAAAAGCA AGCGGGTATT ATTGCAAATC CTATTAATAA 4440 

15 GAAGACTGTT GATGTTTTCC CAGTTAAGGA TTATGGTTCA GGTATGACAC CATTCTATAC 4 500 

TGCACTGTCA GTATGGGTAG GTGCACTCTT GATGGTAAGT TTATTAACGG TTGATAATAA 4560 

ACATAAGAGT CTAGAGTCAG TCTTAACGAC AAGACAAGTG TTCTTAGGTA AGGCAGGATT 4620 

20 CTTTATAATG CTTGGTATGT TGCAAGCACT CATTGTATCG GTTGGAGATT TGTTAATCCT 46 80 

AAAAGCAGGA GTTGAGTCAC CTGTATTATT TGTACTTATA ACGATTTTCT GTTCGATTAT 4 740 

TTTCAACTCA AT CGTATATA CGTGCGTATC ATTACTTGGT AACCCAGGTA AAGCCATTGC 4 8 00 

25 

AATCGTATTG CTTGTATTAC AAATTGCAGG TGGTGGGGGA ACATTCCCAA TTCAAACTAC 4 860 

GCCACAATTT TTCCAAAACA TTTCGCCATA CTTACCATTT ACGTATGCAA TTGATTCATT 4 920 

ACGTGAAACA GTAGGCGGTA TTGTTCCGGA AATCCTAATT ACAAAATTAA TTATATTAAC 4 9 80 

30 

GTTATTTGGT ATAGGATTCT TCGTTGTAGG TTTAATTTTA AAACCTGTAA CAGATCCATT 5040 

GATGAAGCGC GTATCTGAAA AAGTTGACCA AAGTAACGTT ACAGAATAAA AATTAAATCC 5100 

35 ACACATTAGG GTTATAGCTC CTTAATGTGT GGATTTTTAT GTTTTTAGAC AGAAGAGATA 5160 

GTAATTTCTG TCTTTTATGG GACGGTTGTT AT CATTGCTA TTATCCAGGA TGACTTACTA 5220 

TAGGACTAAT ATTACCGACA AAGTGAATAT CCTCGTCTTC CGTAGTTAAA ATAAAGCTAG 52 80 

40 AACCTTTTTG GATGTCATAG TGCTTATCGT TTACTGTTAA AGTACCAGTA CCATCGATAA 5340 

TTGTAACTAA GCAATAAGCA TGTGGTTTAT TGAATTTTAA ATCTCCATGA ATATCCCATT 54 00 

TATATACTGC AAAATATTGA TTATCTACAA ATTGAGTTAC AGTGTGTGTG TCGATGTGAG 54 60 

45 

TTGTTATAGG AGTAGTATTT GGTTCATGAT TGCCTAATTC AATCACATCT TTACTTTGCT 5520 

CTAAGTGCAA ATCACGCAAT TGACCATTTT GATCTCGTCT ATCATAGTCA TAAATACGGT 5580 

ATGTCGTATC GGAGGATTGT TGTGTCTCTA AAATTAAAAT ACCCGAACCA ATGGCATGGA 5640 

50 

CAGTGCCAGC AGGAACATAA TAAAAGTCAC CGGGCTTAAC AGGTATACGT TTGAAAAGAC 5700 

TGTCAAATTC ATGATTATCA ATCATGTCTA TTAACGTCTG TTTATTATGT GCATGTACGC 5760 
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TAACTTATGG GCAATGTGGG ATCCATATGG CAACACGGGA CACATCAAGG TCGCAGTCGT 22 8 0 

TAATGAAGAT AAAGGCGACA CAATCAGAGG GAAAAAAGTT AATGTCGGTA ATACGATGGT 234 0 

TAATACACTC AAGAAAAATA AAAGTTTTGA TTGGCAGTTT GTAAGTAGAG AGAAAGCTGA 24 0 0 

TCATGAGATA AAAATGGGTA AATATTTTGC AGGTATTTAC ATCCCATCTA AGTTTACACA 24 6 0 

TGAAATTACA GGGACACTAC GTAAGCAGCC TCAAAAAGCA GATGTAGAAT TTAAGGTGAA 252 0 

TCAGAAGATT AACGCTGTTG CGTCTAAGCT AACAGATACT GGTTCGTCAG TTGTCGTTGA 2 580 

AAAAGCGAAT GAACAATTTA ATAAAACAGT AACTCGAGCA TTATTAGAAG AAGCTAACAA 264 0 

AGCAGGTTTA ACTATTGAAG AAAATGTGCC GACAATTAAC AAGATAAAAA ATGCGGTATA 2 700 

TTCAGCAGAT AAAGCTTTAC CTAAGATTAA TGACTTTGCG AATAAAATTG TATATTTGAA 2760 

TAACCACCAA GCGGATTTAG ATAAATATGC CAATGATTTT AGAAAACTAG GAAATTATAA 2 820 

AGGTGATATT TTAGATGCTC AOAJVAAAATT AAACGAaGTC AATGGTGCTA TTCCCCAACT 2 880 

TAATGAAAAG GCTAAGTTGA TATTAGCTTT AAATAATTAT ATGCCGAAAA TTGAAAAAGC 2 94 0 

GTTAAATTTT GCAGCTGATG ACGTGCCAGC GCAGTTCCCT AAAATTAATC AAGGACTTAA 3 000 

CATTGCGAGT CAAGGTATTG ATCAAGCTAA TGGACAGTTA AATGATGCCA AAGGCTTCGT 3 0 60 

CACACAAGTT AGAAGTAGAG TCGGTGATTA TCAAGATGCA ATTCGACGCG CGCAAGATTT 3120 

AAATCGAAGA AACCAGCAAC AGATTCCTCA AAATAGCGCG GCGAACAACG AAACATCAAA 3180 

TAGTGCACCT GCAGCTGGTA ATGGTGTAGC ATCAACGCCA CCAAGTGCAC CAAGTGGCGA 3 24 0 

TACTGCACCA AATAATAATG TTACGCAAAA TACCGCACCA AATAGTAATA ATGCGCCTGT 33 00 

ATCGACTACA CCACAAAGTA CAAGCGGGAA AAAAGATGGT CAAAGTTTTG TAGATATAAC 3 3 60 

AACAACACAA GTCAGCACAG CTAACGAGAA CACACAAAAC ATTACAGATA AAGATGTTAA 34 20 

ATCAtfTGGAA GCGGCATTAA CGGGCTCTTT ATTATCATTA TCAAATAATT TAG AT AC CCA 34 80 

40 AGCGAAAGCC GCACAAAAAG ATAGTCAGGC ATTACGTAAT ATTTCGTATG GGATTTTAGC 3 54 0 

AT CGG A CAAG CCTTCTGATT TTAGAGAGTC TTTAGATAAT GTTAAGTCCG GTTTAGAATA 3 6 00 

CACAACGCAA TATAATCAAC AATTTATCGA TACATTAAAA GAGATTGAGA AGAATGAAAA 3 6 60 

•*5 

TGTTGATTTA TCAAAAGAAA TTGATAAGGT AAAAGCAGCT AATAATCGAA TTAATGAATC 372 0 

ATTAAGGTTA GTTAATCAAT TAAGCAATGC ATTAAAGAAT GGTAGTTCAG GAACTGCTGA 37 80 

AG CT ACT AAA TTACTAGATC AACTTTCAAA ACT AG ATT CA TCATTATCAT CATTTAGAGA 3 84 0 

50 

TTATGTTAAA AAAGATCTTA ACAGCTCTTT AGTATCAATA TCACAACGTA TTATGGATGA 3 900 
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ATGAATCATC ATAATCCTTG ATAGAACGTT 
CGTCAAACTG AGTTAATTGA TAGTGTTTAA 
5 GATCTGTAGC ATATGTTTTA GATAAGTGTG 

ATTTCCATGT TGGTTTATAA ATTGTTCGAT 
AGTAATCTTT TAGTGATTCT TTCGTGCTTG 

10 

ACAATTGATT ACCATCAGCT TCTAATGTGT 
CTTTGATACC GAATAAATTA TGGTTTGGTG 

15 ATTCTAAGAT TGCTTGGGCA ATCATGACAG 

GATGTGCATC TTTAGCAATT GATTTGACAA 
TAAATTGTCC GCTATCATCA TTGTTAGATA 

20 CACGTGTATC CTTTTGATTA ACATCGTTAT 

TCAATTCATC TTGTGTTGGT AACTGTGGAT 
TTTTAGATTG AGATGCATAA TCTTTTTGTG 

25 AAATAGAGTC TAAAGCCGAA TCTGACATTG 

TTGCTTTATC GTCACTTGCT GGTTGACTAT 
AATTTGGTTG CTTATTAGAT GTACTTGGTT 

30 

TGTCTGCTTT ATCTTGTTTA GATGATTGCG 
TGTTTTTATT CGAATCATTT GTTGACTTTT 

35 TATCCGAATT TAAATTGAAT AAGTTTTGGA 

ATTTATTTTT GGTTAGCAAT TGGTTTATAT 
CAATGATATT GTTAGAGTCT GAAGTGCTGT 

40 TGTCTTGGTT ACTTGTATTA TTTTTGTCTG 

TAGAAGTTTC ATCGTCATTA GATTTTTTTG 
CTTTTTGAGG TGTATCAGCA TAAGCGgTAG 

45 

TTGATAGCAA ATAAATTAAA ATTTTATTTT 

TATAATAATT AAGTGTGATA ATAAACTATG 
AGTTGATAGG TATCAATCGA CTAAATATCT 

50 

GCAAAAATAA ATTAATTTAC AAAAAATATA 
TGTGACAATG AAGAACGCAT TTAAATTATT 
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CATATTTATC TAAATCTGGC ATGCGTTCAT 4 80 

TAATACTGTT TAATTTCTTA GCATAGTTTG 540 

ATGTTGCATC TTTATAAGAA TCGGCTTCCG 6 00 

TGCCATCAAT ACCATTTTTA ATAAGGTCAG 660 

GATATTTTCG GAATCCAGCA TTAATACTAT 720 

TAAAAGGAAC AGAATTCCCT TCaAAAGCAC 780 

ACwTAGCTAA AGCACTACGA CCTGAGTCAG 840 

ACGCATAAAT ATCGTTATCT TGACCAATGC 900 

ATTGACGTGT ATCTTTTGAG TCAACAACGT 960 

TACTAGGATC TGTTTCGAAT AATGATGTTG 1020 

TGAATGATTG AGCAGGTTTA GATTTATGTT 1080 

TCTTTGTATT AGATTTTTCA TTTTTGTCTT 114 0 

TTTTCTTTGC ATCTTCACTG TATTGATCCA 1200 

ATTGATTATC TTTCGATGAA GATTTTTGAT 1260 

TTGATTGATT AGGTTGTGTT GGCTTTGGCG 1320 

TTGTATTGTT TGATTTAGGT GCTTTTTGAT 1380 

TATCAGTGTC ATTTTTGATG CTATTGTCAC 144 0 

CGCCATTACG AGGTTGTTCG TAATCAGAAA 1500 

TTAAAGTTGT TAATGAGTAA TTATCATCGT 1560 

TGGTTTGTGG TAAATTCTTA TAAATAAAAT 1620 

CGTCTATAGT TTTAAATTTT TTGTCGTTAT 1680 

CTTTATCAAT ATCTTTACTT GTAGTATCCT 174 0 

AATCATGAGA TGTTGTCTTA GCTGTAGTAT 1800 

GTGAAaCTAA AGTAGGTAAT ACGAGCGTAG 1860 

TAGGCATATT TCGTATTCTC CCTTGAAAAA 192 0 

ATTTGTTATA ATTTATCGTA TGCTGAAAAT 198 0 

TCCAGTAAAT TGATTATACT AATTCACAAC 204 0 

TAAAAAATAT GAATAATTCC TACATAGGAG 210 0 

TAAAATGGAT CTGAAGAAAG TAGCTAAGAC 216 0 
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AAAAGGTAAA AAATCAATTG CAATACGTTT AAATTATTTA GACACAGAAG AAACATTGAC 4 56 0 

AGATGAGCGC GTTTCAAAAG TACAAGCGGA AATTGAAGCA GCATTAATTG AACAAGGTGC 4 620 

TGTTATTAGA TAATGATTTA AACCCCATGT ATAAGGATAT CTGAAGTAGA TTGATATCCC 4 6 80 

TAACATGGGG TTTTATTTTT GGGTTCACCA ATTTGGTTCC AATGCATTTA AAAAGTCAAA 4 74 0 

GAGGAACAGC GGAATACAGA TGATGcTTCG CACAACTGCA TAAAAGCCTC TAATGATTAA 4 8 00 

AAATCAAAGA GGCTTTAAAA TTTTTTGGGC TTTTTCACGA TTTTTAAAAT GCTTTTTTGA 4 8 60 

AATGGTATCT AAACGTGAAA GACCGTATTT TTTTATAATT TTGGCGGCGA TTACATCGAC 4 920 

TTTAGCACCG GCACCTTTAG GAATCGTCAT ATTAATATTT TTTGATATTT GATCCATATA 4 9 80 

TGTAACAAAT GCGTATCGAG AAATTATGCT TGCCACTGCA ATGGCTAATG ACTTCGATTC 504 0 

TCCTTTTGTT TCAAATTTTG TTTTCTTTGG AAGTGGTATA TCTGATAATG CGTAATGGCT 5100 

20 ATACACTTCG CGTTTTGCGA ACTG AT CAAT GACGATATAG TCTAATTGAG ACGAATCAAT 516 0 

TTTTTCAAGT ACATTTTTGA TGGCTTCATT ATGAAGGGCA GCTTTCATTT TTACTTGAGT 52 2 0 

CCAGCCTTTT GCTTGCTGAA TATTATATTT TTCATTGTGT AGTGTTAATA ATGAATGTGG 52 8 0 

TATGAAAGTA ACCAATTGCT CAGCAAGTTC TACAATTTTG GTATCGGTTA ATTTTTTTGA 534 0 

ATCATCTACA CC CAAAGTTT TTAAAATAGG GACATGCTCT TTGGTAACGA AAGCAGCACA 54 00 

CACAGTCAAC GGACCAAAGT AATCGCCACT TCCAGCCTCA TCACTACCAA TACAGTTAAA 546 0 

TTGrTCATAC ATTAaAGTTg TcCAgAAAAG AATTAGCCAT ATTTnCCTTT 5510 
(2) INFORMATION FOR SEQ ID NO: 166: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 9623 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 166: 

GnTTATACTT ATAAATTTTA CGGGGGTAAT ATAATACTtA TTTACCTGTA ATATATGATA 6 0 

ATTCTTCAGC GGCAGCTGCG TTGATAGTTC TATGAGAAAT GATAC CTAAT CCTTTAACAT 12 0 

TGGATTCTGA AATAACGATA GAACCATCAC TGTTAACTTT TTCAACAAAT GCTACATGAC 18 0 

CGTAATGTTG ATCTGCACCA AATTGTCCAG CCTCAAATAC AACAGCAGCA TGACGTTTTG 24 0 

GTGTATGACT TACTTGATAA TCACGGTATT GAGCTCGATT ATTCCAATTA TGTGCATCAC 30 0 
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ATATTTAGAT GATCAAGTAA TGGAATTTGA 
TATGATAGGT ACTGCTTATG AAGTTGCAGC 
5 GACAACATCA AATGAGCTTG ATTTATCTGC 

TGAAGATAAA GTACCATATT ATAGTGCACG 
GCCAATTTGG ATGCAAGCAC GCTTAATAAA 

10 

TGACATTTCA AATTATGTGT TATTAGAATA 
TGCGATTGGT TCACAACAAA TTGTTGTTCG 

75 ATTAGATGAT ACAGAACGTG AATTATTAAC 

TCCAATTGCA TTAGCTGGTG TTATGGGTGG 
AAATATAGTG ATTGAAGGTG CTATTTTTGA 

20 TTTAAATTTA CGCAGTGAAT CATCTAGTCG 

AGATGAAGCA GTCGACCGTG CATGTTATTT 
AAAAGATAGA GTGTCTTCAG GAGAACTTGG 

25 TGATAAAATT AATCGCACTA TTGGATTTGA 

TAATCAACTA GGGTTTGATA CAGAAATAAA 
ACGTCGTAAA GATATTACAA TTAAAGAAGA 

30 

atacgacgat attccatcaa cgttacctgt 
tgatcgccaa tataaaacta gaatggttaa 

35 agctattacg tattcgttag tttctaaaga 
tcaaacaatt gatttattga tgccaatgag 
att£ccacat ttaatcgaag cggcatcata 

40 attatttgaa atcggcaatg tcttctttgc 
tgaatattta agtggtattt taactggaga 
agaaacggtt gatttctatt tagcaaaagg 

45 

tcttgaattt agttatcgcc gtgctgatat 
aatcttatta gagaataaag ttgttggttt 
tgataatgat ttaaaacgta cgtatgtttt 

50 

GTCGGTAGGT TACATTAATT ACCAGCCAAT 
TGCATTAGAA GTAGATCAAA ATATTCCAGC 

55 



TTTAACGCCG AATCGTGCAG ATGCTTTAAG 276 0 

ATTATATAAT ACAAAAATGA CTAAGCCAGA 282 0 

AAATGATGAA CTGACTGTGA CAATTGAAAA 28 8 0 

TGTTGTTCAC GACGTGACAA TTGAACCCTC 2 94 0 

AGCGGGTATA CGTCCTATTA ATAATGTTGT 300 0 

CGGTCAACCA TTGCACATGT TTGATCAAGA 306 0 

TCAAGCTAAT GAAGGCGAAA AAATGACAAC 3120 

GAGCGATATT GTCATTACTA ATGGACAAAC 318 0 

CGATTTTTCA GAAGTTAAAG AACAAACATC 324 0 

TCCAGTTTCA ATTCGTCATA CATCAAGACG 3300 

TTTTGAAAAA GGAATAGCTA CTGAATTTGT 3360 

ATTACAAACT TATGCAAACG GAAAAGTGCT 3420 

TGCATTTATT ACACCAATCG ACATCACTGC 3480 

TTTGTCACAA AATGATATTG TTACTATTTT 3 540 

TGATGATGTT ATTACAGTGC TAGTACCATC 3 600 

TTTAATTGAA GAAGTTGCAC GTATATATGG 3 660 

CTTCGATAAA GTTACTAGTG GTCAGCTAAC 3720 

AGAAGTGTTA GAAGGTGCTG GATTAGACCa 3 780 

AGATGCTACT GCaTTTTCGA TGCAACAGCG 3 84 0 

TGAAGCGCAT GCGTCATTAC GTCAAAGTTT 3 900 

TAATGTGGCA CGCAAAAATA AAGATGTAAA 3 960 

TAATGGAGAA GGTGAACTAC CAGATCAAGT 4 020 

TTATGTAGTC AATCAATGGC AAGGTAAGAA 4 080 

TGTCGTGGAT CGAGTATCTG AAAAGTTAAA 414 0 

TGaTGGATTA CATCCAGGTC GTACTGCTGA 4200 

TATTGGTGAA TTACATCCAA TATTAGCAGC 4260 

TGAGTTGAAT TTTGATGCAT TAATGGCTGT 4 320 

TCCGAGATTC CCAGGCATGT CTCGTGACAT 43 8 0 

AGCTGATTTA TTATCAACGA TTCATGCACA 444 0 
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ATAAATAATT GTTTTAGGGA GAATAATCGT GACTGCAAGT TATTCCAATT ATTTAAAGTC 96 0 

TTTTCACCTT TTTGG TT ACT TAAAGAGATT TAAGTCGGAA AGACAATCCG TTATCAATAT 102 0 

TAAACAAGTG TATGCTTAGG CATAAATTTG GGTGGTACCA CGGAAATGAC TTTCGTCCCT 10 BO 

TATTTTTTAA GAGGATGAAA GTCTTTTTTT AGTTAAACAA CAAATATGAT AAATAGAAAA 114 0 

TGAATAGTTC GAATAGGGAG GTCAGTGACA TATGTCTGAA CAACAAACAA TGTCAGAGTT 120 0 

AAAACAACAA GCGCTTGTAG ATATTAATGA AGCAAATGAT GAACGTGCAC TGCAAGAAGT 126 0 

TAAAGTGAAA TACTTAGGTA AAAAAGGGTC AGTTAGCGGA CTAATGAAAT TGATGAAGGA 13 2 0 

TTTGCCGAAT GAAGATAAAC CTGCGTTTGG TCAAAAAGTG AATGAATTGC GTCAAACAAT 13 80 

TCAAAATGAA TTAGATGAAA GACAACAGAT GTTAGTTAAA GAAAAATTAA ATAAGCcAAT 144 0 

TGGcTGAAGA AACAATTGAT GTATCATTAC CAGGTCGTCA TATTGAAATC GG TTCAAAGC 150 0 

ATCCATTAAC ACGTACAATA GAAGAAATTG AAGACTTATT CTTAGGTTTA GGTTATGAAA 156 0 

TTGTGAATGG ATATGAAGTT GAACAAGATC ATTATAACTT CGAAATGCTG AATTTACCTA 162 0 

AATCACACCC TGCACGTGAT ATGCAAGATA GTTTCTATAT TACGGATGAA ATTTTATTAC 168 0 

GTACGCATAC ATCACCAGTG CAGGCACGTa CGATGGAATC ACGTCATGGT CAAGGTCCAG 174 0 

TTAAAATTAT TTGCCCTGGT AAAGTGTATC GTCGTGACTC TGATGATGCG ACACATAGTC 1800 

ATCAATTTAC ACAAATCGAA GGATTAGTTG TTGATAAAAA CGTTAAAATG AGTGATTTGA 186 0 

«mvjvin^iii <-kwrvi. x x \j x x vjuinrturtrtni iniiijuivjv, ivjniv.^iunn AiicuiiiML i?zu 

GTCCAAGTTA CTTCCCATTC ACTGAACCTT CTGTAGAAGT TGATGTGTCA TGTTTTAAAT 198 0 

GTAAAGGAAA AGGTTGTAAT GTGTGTAAAC ACACAGGATG GATTGAAATT TTAGGTGCTG 2 04 0 

GAATGGTACA 7CCTAATGTA TTAGAAATGG CTGGTTTTGA TTCTTCAGAG TACTCTGGAT 2100 

TTGCATTTGG TATGGGACCA GACCGTATTG CAATGTTGAA ATATGGTATA GAAGATATTC 216 0 

40 GTCATTTCTA TACTAATGAT GTGAGATTTT TAGATCAATT TAAAG CGGTA GAAGATAGAG 222 0 

GTGACATGTA ATGTTGATAT CAAATGAATG GTTGAAAGAA TATGTAACAA TCGATGATTC 22 8 0 

TGTAAGTAAT TTGGCAGAAC GTATTACGCG CACAGGTATT GAAGTGGATG ATTTAATTGA 234 0 

4t CTACACAAAA GATATCAAAA ATTTAGTTGT CGGCTTCGTT AAGTCAAAAG AGAAACATCC 24 0 0 

TGATGCTGAT AAATTAAATG TTTGCCAAGT TGATATCGGA GAAGACGAAC CTGTACAAAT 24 6 0 

CGTTTGTGGT GCACCGAACG TTGaTGCAGG ACAATATGTC ATTGTTGCTA AAGTAGGTGG 2 52 0 

50 

CAGATTGCCT GGTGGTATTA AAATTAAGCG TGCCAAATTA CGCGGTGAAC GTTCAGAAGG 2 5 80 
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GATACGTTAT GTCATAGCTA TTTTAGTAGT TGTATTAATG GTGTTGGGTG TTTTCCAATT 2 17 BO 

AGGAATAATA GGTCGTCTAA TTGACAGCTT CTTTAATTAT TTATTTGGGT ACAGTAGATA 21840 

5 TTTAACATAT ATTTTAGTAC TCTTAGCAAC TGGTTTTATT ACATACTCTA AACGTATTCC 21900 

TAmaACTAGA CGAACGGCTG GTTCGATTGT ATTGCAAATT GCATTGCTAT TTGTATCACA 21960 

GTTAGTTTTT CATTTTAATA GTGGTATCAA AGCTGAAAGA GAACCTGTAC TTTCTTATGT 22 020 

w 

GTATCAGTCA TACCAACACA GTCATTTCCC AAATTTTGGT GGCGGTGT AT TAGGCTTTTA 22080 

TTTATTAGAG TTAAGCGTAC CTTTAATTTC ATTATTTGGT GTATGTATTA TTACTATTTT 2214 0 

15 ATTATTATGC TCAAGTGTTA TTTTATTAAC AAACCATCAA CATCGTGAAG TTGCAAAAGT 22200 

TGCACTGGAA AATATAAAAG CTTGGTTTGG TTCATTTAAT GAA 22243 

(2) INFORMATION FOR SEQ ID NO: 165: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5510 base pairs 
£B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165: 

TTATTAATnA TTAATATTTT TATTTTTAAA AATAAAGCGA GGAGCTATCA ATGGAACAAA 6 0 

30 

TTACTTCTGC ACAAAATAAT AGAATTAAAC AAGCGAACAA GCTAAAAmAG AAACGTGAGA 120 

GGGATAAAAC TGGATTAGCT TTAATTGAAG GTGTGCATTT AATTGAAGAA GCTTATCAAA 18 0 

35 GTGGAATTGT AATTACACAA TTATTTGCAA TTGAACCGGC AAGATTAGAT CAGCAAATTA 24 0 

wCGCATACGC GCAAGAAGTT TTTgAAATAA ACATGAAAGT TGCTGAATCT TTATCAGGTA 300 

CAGTfiACACC ACAAGGGTTT TTCGCAATCA TTGAGAAGCC GCATTATGAT ATTTCTAAAG 360 

40 CACAACAAGT ATTGCTCATC GATCGTGTTC AAGATCCTGG AAATTTAGGC ACATTAATTA 42 0 

GAACTGCGGA TGCTGCTGGA ATGGATGCTG TAATAATGGA GAAGGGTACG ACAGATCCTT 4 80 

ATCAAGATAA AGTGTTGCGA GCGAGTCAAG GTAGTGTTTT CCATTTGCCA GTTATGACAC 540 

45 

AAGATCTCGA TACGTTTATT ACTCAATTTA ATGGTCCTGT TTATGGTACA GCACTTGAAA 600 

ACGCAGTGgC ATACAAAGAA GTTACTTCAA GTGATTCTTT TGCATTACTA TTAGGTAATG 66 0 

AGGGAGAAGG TGTTAATCCT GAATTATTAG CACATACTAC ACAAAATTTA ATCATACCTA 720 

50 

TTTATGGTAA AGCTGAAAGT TTAAATGTAG CGATTGCAGG TAGTATTTTA CTTTATCATT 780 

TGAAAGGTTG ACCGTGTTGA AAGTTTTCCG ATATAATTAT AATTAATTGT TTAACAGAAC 84 0 

55 
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CCTTACACAC GG A CATGAGC ACGCGATTGG TGCAGTGAGT TATGTTTTAG AACAATTAGA 19980 

TGCACCAGTA TATGGATCTA AATTGACAAT AGCGTTAATT AAAGAAAATA TGAAAGCCCG 2 0040 

TAATATTGAT AAAAAAGTTC GCTACTATAC AGTTAATAAT GATTCAATTA TGAGATTCAA 2 010 0 

AAACGTGAAT ATTAGTTTCT TTAATACGAC ACACAGTATT CCTGATAGTT TAGGTGTTTG 20160 

TATTCACACT TCATATGGTG CCATTGTGTA TACAGGTGAA TTTAAGTTTG ACCAAAGTTT 2 0220 

ACATGGACAT TATGCACCAG ATATTAAACG TATGGCAGAG ATTGGTGAAG AAGGCGTATT 2 0280 

TGTCTTAATC AGTGATTCTA CTGAGGCAGA GAAACCTGGA TATAATACTC CGGAAAATGT 2 0340 

GATTGAACAT CATATGTATG ATGCTTTTGC AAAAGTGCGA GGTCGCTTGA TAGTTTCATG 2 0400 

TTATGCTTCG AACTTTATAC GTATTCAGCA AGTTTTAAAT ATTGCTAGCA AGCTAAATCG 20460 

TAAAGTGTCA TTTTTAGGAA GATCACTTGA AAGTTCATTT AATATTGCTC GTAAAATGGG 2 0520 

GTATTTCGAC ATTCCTAAAG ATTTGCTAAT TCCTATAACA GAACTTCATA ATTATCCTAA 2 0530 

AAATGAAGTG ATAATTATAG CTACTGGTAT GCAAGGAGAA CCTGTAGAAG CCTTAAGTCA 2 0640 

AATGGCGCAA CATAAGCATA AAATTATGAA TATCGAAGAA GGCGATTCTG TATTTTTAGC 2070 0 

AATTACGGCT TCTGCTAATA TGGAAGTTAT CATTGCGAAT AcATTAAATG AGCtTgTtAC 2 0760 

GnCTGGCGCA CATATTATTC CAAATAACAA AAAGATTCAT GCTTCAAGTC ATGGTTGCAT 20820 

GGAAGAATTA AAAATGATGA TTAATATTAT GAAACCTGAA TACTTTATTC CTGTACAAGG 20 880 

TGAATTTAAA ATGCAGATAG CACATGCGAA GCTAGCAGCT GAAGCAGGTG TTGCACCAGA 2 0 940 

AAAGATTTTC CTTGTGGAAA AAGGAGATGT CATTAATTAC AACGGTAAAG ATATGATATT 21000 

35 AAATGAAAAG GTAAATTCAG GAAATATTTT AATAGATGGC ATTGGTATTG GGGATGTAGG 21060 

AAATATCGTG TTGAGAGACC GTCATCTTTT AGCAGAAGAT GGTATCTTTA TTGCTGTTGT 2112 0 

AACGTTAGAT CCTAAAAATA GACGTATAGC TGCGGGACCT GAAATTCAAT CTCGTGGGTT 21180 

TGTATATGTA CGTGAAAGTG AAGACTTATT ACGTGAAGCA GAAGAGAAAG TACGTGAAAT 21240 

AGTAGAGGCT GGTTTACAAG AAAAACGCAT AGAATGGTCT GAAATTAAAC AAAATATGCG 213 0 0 

TGATCAAATT AGTAAACTAT TATTCGAAAG TACAAAACGT CGTCCTATGA TTATTCCAGT 213 6 0 

AATTTCTGAA ATTTAATCAA AAAGTCATTA ACATAAAAGA GGTCAGAACA AGTCACTGAA 214 2 0 

ATATAATGGT TGTCATGGAC AATTTACTTA TATTTTATGA TAGTCAATTG AAGGGGTAAC 214 8 0 

GATTAATCTG TTATCTTAAG TAAATTGATA CATAGATGAT ATTGTTCTAA CCTCTTTCAT 21540 

CGTCTGTTTG GACTACATAT TCTAAACATC AAATAGGAAA TTATATATAA TAACGTCGTT ? 7 * n n 
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GAAGAAAAAG GACTTAAAGA AACAGTTTTA ACATTTGATA AACAACAACG AGATGaAAAT 18180 

CTTGATAACT TAAAAGAAGA AATCGTCAAT GAATTTATCG ATGAAGAAGA TCCAGAGAAT 18240 

5 GAaTTACTTA TTAAAGAAGT TTATGCAATT TTAAATGAAT TAGTGAAAGA AGAAGTTCGA 183 00 

CGTTTAATTG CAGATGAAAA AATTAGACCA GACGGCCGTA AACCTGATGA AATCCGTCCA 18360 

TTAGATTCTG AAGTTGGTAT TTTACCTAGA ACGCATGGTT CAGGTCTATT TACACGTGGT 18420 

w 

CAGACTCAAG CACTTTCAGT TTTAACATTA GGTGCTTTAG GCGATTATCA ATTAATTGAT 18480 

GGTTTAGGAC CTGAAGAAGA AAAAAGATTC ATGCATCATT ACAACTTCCC GAATTTTTCA 18540 

75 GTAGGTGAAA CTGGTCCAGT ACGTGCGCCA GGTCGTCGTG AAATTGGACA TGGTGCGTTA 18600 

GGTGAAAGAG CATTAAAATA TATTATTCCT GATACTGCTG ATTTCCCATA TACAATTCGT 18660 

ATTGTAAGTG AGGTACTTGA ATCAAATGGT TCATCATCTC AAGCGTCAAT TTGTGGATCA 18720 

20 ACATTAGCAT TAATGGATGC GGGCGTACCG ATTAAAGCAC CAGTTGCTGG TATTGCTATG 18780 

GGCCTTGTTA CACGTGAAGA TAGCTATACG ATTTTAACTG ATATCCAAGG TATGGAAGAT 18840 

GCATTAGGTG ATATGGACTT TAAAGTCGCT GGTACTAAAG AAGGTATTAC AGCAATCCAA 18900 

25 

ATGGATATTA AAATTGACGG TTTAACGCGT GAAATTATCG AAGAGGCTCT AGAACAAGCG 18960 

AGACGTGGTC GTTTAGAAAT AATGAATCAT ATGTTACAAA CAATTGATCA ACCACGTACT 19020 

GAATTAAGTG CTTACGCGCC AAAAGTTGTA ACTATGACAA TTAAACCAGA TAAGATTAGA 19080 

30 

GATGTTATCG GACCTGGTGG TAAAAAAATT AACGAAATTA TTGATGAAAC AGGTGTTAAA 19140 

TTAGATATTG AACAAGATGG TACTATCTTT ATTGGTGCTG TTGATCAAGC TATGATAAAT 19200 

35 CGTGCTCGTG AAATCATTGA GGAAATTACA CGTGAAGCGG AAGTAGGTCA AACTTATCAA 19260 

GCCACTGTTA AACGTATTGA AAAATACGGT GCGTTTGTAG GCCTATTCCC AGGTAAAGAT 1932 0 

GCGTTGCTTC ACATTTCACA AATTTCAAAA AATAGAATTG AAAAAGTGGA AGATGTATTA 19380 

40 

AAAATCGGTG ACACAATTGA AGTTAAGATT ACTGAAATTG ATAAACAAGG TCGAGTAAAT 19440 

GCTTCACATA GAGCATTAGA AGAATAATAT TTAAAGTCAT ATGACGACAA TGTATCGTCA 19500 

TGTGATTTTT TTATGCCACT TTTTACGAAG TGACCCGTTT TGAATTTGTT GTATTGAACA 19560 

45 

TTTTAAAACG CTTTATTATT TTGTGTGCAA CTGTTAATTA TCCTGTATGT ATAGTGATTA 19620 

ATAGTGTACA TCAAGTGTTT TTTAACTTAT AATGAATAGT GAGTTTATAT ATGGACGGGT 19680 

SO AACAAATTTA GGAGGTAAGA TTTTGAGTTT AATAAAGAAA AAGAATAAAG ATATTCGCAT 19740 

TATACCATTA GGCGGTGTTG GCGAAATTGC TAAAAATATG TATATCGTTG AAGTAGACGA 19 800 

TGAAATGTTT ATGTTAGATG CTGGACTTAT GTTTCCAGAA GACGAAATGC TAGGTATTGA 19860 
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TGCTGTTAGT ATTGAAATCG GCACTGAAAA TAAATTATAT CGAGGGGTAG CTAACATAGG 16 3 80 

TGTAAAGCCA ACATTTCATG ATCCTAACAA AGCAGAAGTT GTCATCGAAG TGAATATCTT 16440 

5 

TGACTTTGAG GATAATATTT ATGGTGAACG AGTGACCGTG AATTGGCATC ATTTCTTACG 16 500 

TCCTGAGATT AAATTTGATG GTATCGACCC ATTAGTTAAA CAAATGAACG ATGATAAATC 16 560 

GCGTGCTAAA TATTTATTAG CAGTTGATTT TGGTGATGAA GTAGCTTATA ATATCTAGAG 16 620 

w 

TTGCGTATAG tTATATAAAC AATCTATACC ACACCTTTTT CTTAGTAGGT CGAATCTCCA 16680 

ACGCCTAACT CGGATTAAGG AGTATTCAAA CATTTTAAGG AGGAAATTGA TTATGGCAAT 16 740 

15 TTCACAAGAA CGTAAAAACG AAATCATTAA AGAATACCGT GTACACGAAA CTGATACTGG 16 8 00 

TTCACCAGAA GTACAAATCG CTGTACTTAC TGCAGAAATC AACGCaGTAA ACGAACACTT 16 860 

ACGTACACAC AAAAAAGACC ACCATTCACG TCGTGGATTA TTAAAAATGG TAGGTCGTCG 16 920 

20 TAGcATTTaT TAAACTACTT ACGTaGTAAA GATATTCAAC GTTACCGTGA ATTAATTAAA 16 98 0 

TCACTTGGTA TCCGTCGTTA ATCTTAATAT AACGTCTTTG AGGTTGGGGC ATATTTATGT 17 040 

TCCAACCTTA ATTTATATTA AAAAAGCTTT TTACAAATAT TAACATTTAT TATATGTTAA 17100 

25 GCTAATATTG AGTGAATAAT AAGGTTACAA TGAGATAAAG ATGATATAAG TACACCTAGA 17160 

GTAATAATCA AGATATTAAA AATAAAGTAT GTTTTTTTAA AAAATATAAC TTATATTTAT 17220 

ACTGATAAGG GTGGGACGAT AAGTCTATTT TGTAAATAAT AGATGGATAT CCCGCTCTCT 17280 

30 

TTTTTTCCAA TTCAATATTT TATAACTAAT ATTAAAATAC GATAATAAAT GATATGATAT 17340 

AACTATTAGA TTCAAGAGAG GAGATTTATA ATGTCTCAAG AAAAGAAAGT TTTTAAAACT 17400 

GAATGGGCAG GAAGATCTTT AACGATTGAA ACAGGGCAAT TAGCTAAACA AGCAAATGGC 17460 

35 

GCTGTATTGG TTCGTTATGG AGATACAGTC GTGTTATCGA CGGCAACTGC ATCAAAAGAA 17520 

CCTCGTGATG GAGATTTCTT CCCATTAACA GTGAACTATG AAGAAAAAAT GTACGCTGCG 17580 

GGTAAAATTC CTGGTGGATT TAAAAAGAGA GAAGGACGTC CTGGTGACGA TGCAACATTA 17640 

40 

ACTGCGCGAT TAATTGATAG ACCAATTAGA CCTTTATTCC CTAAAGGATA TAAGCATGAT 17700 

GTTCAAATTA TGAACATGGT ATTAAGTGCA GATCCTGATT GTTCACCACA AATGGCTGCA 17760 

45 ATGATTGGTT CATCTATGGC GCTTAGTGTG TCGGATATTC CATTCCAAGG GCCAATCGCC 17820 

GGTGTAAATG TGGGTTATAT TGACGGTAAA TATATCATTA ACCCAACAGT AGAAGAAAAA 17880 

GAAGTTTCTC GTTTAGACCT TGAAGTAGCT GGTCATAAAG ATGCGGTAAA CATGGTAGAG 17 940 

50 GCAGGCGCTA GTGAGATTAC TGAACAAGAA ATGTTAGAGG CGATTTTCTT TGGTCATGAA 18000 

GAGATTCAAC GTTTAGTTGA TTTCCAACAA ^AAATrrvrrG Ar^A<~ATT^A A^^^TYV^AA * - q n p ^ 
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AAGATTTACA CAAACAAGAT AGATAATTTA 
TTCTTAATAT CGGTATATTA ACATTAAACA 
5 CATTTTCCAG TTTTTTTATG AATAAATTTA 

AAGAAGGTGA CTATATGTAT AATGGGATAT 
GTCATGACGT TGTATTCAAA TTGCGTAAAA 

10 

GTACGCTTGA TCCCGAAGTT GCAGGCGTGT 
TTAGTGATTA TGTTATGGAT ATGGGCAAAG 
GTACAACGAC TGAAGATCAA ACGGGTGATA 

15 

ATTTTAATAA GGACGATATT GACCGATTGT 
TTCCGCCGAT GTACTCATCC GTCAAAGTAA 
ATAATGAAAC AGTTGAAAGA CCAAAGCGTA 

20 

CTGAATTAGA TTTTAAAGAA AATGAGTGTC 
GTACATATAT TAGAACGCTA GCAACTGATA 

25 TGTCGAAATT AACACGAATC GAGTCTGGTG 

AACAAATAAA AGAACTTCAT GAGCAGGATT 
ATGGATTAAA GGGTTTGCCA AGCATTAAAA 

30 TAAATGGGCA GAAATTTAAT AAAAATGAAT 

TTATTGATGA TGATTCAGAA AAAGTATTAG 
CAGAAATTAA ACCTAAAAAA GTCTTTAATT 

35 AGtGACACAT CCTATACAAT CTAAACAGTA 

ATTTTTCGAT GGCATGCATA AAGGTCATGA 
TGAGGCACGC AGTTTAAAAA AAGCGGTGAT 

40 

GAATCCTAAA AGAAAACGAA CAACGTATTT 
TAGCCAACAT GATATTGATT ATTGTATAGT 
GAGCGTAGAA GATTTTGTTG AAAATTATAT 

45 

TGGTTTTGAT TTTACTTTTG GTAAATTTGG 
TGATGCGTTT AATACGACAA TTGTGAGTAA 
5Q AACTTCTATT CGTCAAGATT TAATCAATGG 

CTATATATAT TCTATTAAAG kCACTGTAGT 



GTGTTAGGTA TCTGGAAAAT GTTTGATAAT 14 580 

GTTAATACAT AGATGTGTAG AAATAGTTAA 14 640 

GTTGATACGC TATTAAAATA TATTTTAAAA 14 70 0 

TACCAGTATA TAAAGAGCGC GGTTTAACAA 14 76 0 

TATTAAAAAC TAAAAAAATA GGTCACACGG 14 82 0 

TACCGGTATG TATAGGTAAT GCAACGAGAG 14 880 

CTTATGAAGC AACTGTATCG ATAGGAAGAA 14 940 

CATTGGAAAC AAAAGGTGTA CACTCAGCAG 15000 

TAGAAAGTTT TAAAGGTATC ATTGAACAAA 15060 

ATGGTAAAAA ATTATATGAA TATGCGCGTA 15120 

AAGTtAATAT TAAAGACATT GGGCGTATAT 15180 

ATTTTAAAAT ACGCGTCATC TGTGGTAAAG 15240 

TTGGTGTGAA ATTAGGCTTT CCGGCACATA 15300 

GATTTGTGTT GAAAGATAGC CTTACATTAG 153 60 

CATTGCAAAA TAAATTGTTT CCTTTAGAAT 154 20 

TTAAAGATTC GCACATAAAA AAACGTATTT 15480 

TTGATAACAA AATTAAAGAC CAAATTGTAT 15540 

CAATTTATAT GGTACACCCT ACAAAAGAAT 156 0 0 

AAAGGAGATA GAATTTATGA AAGTCATAGA 1566 0 

TATTACAGAG GATGTTGCAA TGGCATTCGG 15720 

CAAAGTCTTT GATATATTAA ACGAAATAGC 1578 0 

GACATTTGAT CCGCATCCGT CTGTCGTGTT 15840 

AACGCCACTT TCAGATAAAA TCGAAAAAAT 15900 

GGTTAATTTT TCATCTAGGT TTGCTAATGT 1596 0 

AATTAAAAAT AATGTAAAAG AAGTCATTGC 16020 

AAAAGGTAAT ATGACTGTAC TTCAAGAATA 16080 

ACAAGAAATT GAAAATGAAA AAATTTCTAC 1614 0 

TGAGTTGCAA AAAGCGAATG ATGCTTTAGG 16200 

GCAAGGTGAA AAAAGGGGAA GAACTATTGG 1626 0 
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GCGGCGAAAC AATTtTCGTc CACTTTCTGC ATTAAGTGGT GATGGTATCG ACGATTTATT 12 7 80 

AGAAATGATA GGATTAGTTG CAGAAGTTCA AGAACTTAAA GCAAATCCTA AAAACCGTGC 12 840 

TGTTGGTACA GTTATCGAAG CTGAATTAGA TAAATCACGT GGTCCTTCTG CATCATTATT 12 900 

AGTACAAAAC GGTACATTAA ATGTTGGTGA TGCGATTGTA GTTGGTAATA CTTACGGCCG 12 960 

TATCCGTGCA ATGGTTAATG ACTTAGGTCA AAGAATCAAA ACGGCTGGTC CATCAACGCC 13 020 

TGTTGAAATT ACAGGTATTA ATGATGTGCC ACAAGCTGGG GATCGCTTTG TTGTATTTAG 13 080 

TGATGAAAAA CAAGCTCGTC GTATTGGTGA ATCAAGACAC GAAGCTAGCA TTATACAACA 1314 0 

ACGTCAAGAA AGTAAAAATG TTTCATTAGA TAACCTGTTT GAACAAATGA AACAAGGTGA 13 20 0 

AATGAAAGAT TTAAACGTTA TTATTAAAGG TGATGTTCAA GGTTCTGTTG AAGCTTTAGC 1326 0 

TGCATCATTA ATGAAAATTG ATGTTGAAGG CGTAAATGTT CGTATCATTC ATACAGCGGT 133 2 0 

TGGTGCAATT AATGAGTCAG ACGTGACACT TGCTAATGCC TCAAATGGTA TTATCATTGG 133 8 0 

TTTCAATGTT CGTCCAGACA GTGGTGCAAA ACGTGCTGCA GAAGCTGAAA ATGTTGATAT 13440 

GCGTTTACAC AGAGTTATTT ATAATGTTAT CGAAGAAATT GAATCAGCGA TGAAAGGTTT 13 50 0 

25 ACTTGATCCA GAATTTGAAG AACAAGTTAT CGGACAAGCT GAAGTTCGTC AAACATTCAA 13 56 0 

AGTTTCTAAA GTTGGTACTA TTGCTGGATG TTATGTTACT GAAGGTAAAA TTACGCGAAA 13 620 

TGCTGGTGTA CGTATTATTC GTGATGGTAT TGTTCAATAT GAAGGCGAAT TAGATACACT 13 6 80 

30 TAAACGTTTC AAAGATGATG CTAAGGAAGT TGCAAAAGGT TATGAATGTG GTATTACAAy 13 740 

TGAAAACTAC AATGACCTTA AAGAAGGCGA TGTTATCGAA GCATTTGAAA TGGTTGAAAT 13 800 

TAAGCGTTAA TTAAATAAAT TACAAGCTAA AAGTATAGTT AAGATTGATA TGCTCCCTAT 13 86 0 

AAATATTGCA CTTTTTAAGT GTCTACTTTA TAGGGAGCAT ATTTGATACT AGCTTTTGGT 13 920 

TTTFTATTAG AATAGATTAC CTATTAAAAG TTACGTTATA TGGACATGAT TTTGTATAAA 13 98 0 

ATTTTGTGGT GGCCTAGAAT GATTTTTAAT GACAAAATAT AATGTCGACT ATTATTGGAA 14 04 0 

AATTTTCTGT TGaAATGCCT ATCTTACGGC AAACTTTATT TGATTTTATA GGCTTAATTT 14 100 

ATTAAAATAA CGTGTGAGCT AAAATAATTG TTTAAGCATT GTTACACTAA AAAATGCAAA 14160 

TAACAATTGA ACTTAAAGAT AAAGAGGTGA CAAGAATGAG CAGTATGAGA GCAGAGCGTG 142 20 

TTGGTGAACA AATGAAGAAG GAATTAATGG ATATCATCAA CAATAAAGTC AAAGATCCTC 14 2 80 

GAGTTGGTTT TATTACAATT ACAGATGTTG TTTTAACAAA TGATTTATCG CAGGCTAAAG 14 34 0 

TATTTTTAAC TGTATTAGGT AACGATAAAG AAGTAGAAAA TACATTTAAA GCACTTGATA 144 00 

AAGCAAAAGG CTTCATTAAG TCTGAATTAG GTTCTAGAAT GCGATTACGT ATTATGCCGG : 4 4 * n 



35 



40 



50 
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GATGCGCGTG AAGCGGGTAT CTATCCAGTA 
GTTGCTTTAG AAGATGCTGA CACAACAGAA 
5 GAAACAAATG TAGAGAAAGA ATCTGAATAA 

AAAAAATTCC GATGCGAAAA TGTATTCTTT 
TTCGTGTTGT TGTTAATAAA GAAGGCGAAA 

10 

GCCGTGGCGC ATATGTTTCT AAAGATGTTG 
TTTTAGAAAA ATATTTTAAA GCATCTAAAG 
TTAGATTAAT TTATAGAGAA GAGATCCCAA 

15 

AGGATTAGCA ATGAGAGCTG GTAAAGTAAA 
TAAAAAAGGA AATTTGAAGC TCGTTATTGT 
ATTAATTACA GATAAATGTA AGAGTTACAA 

20 

TGAATTGGGA ATAGCACTTG GAAAAGGTGA 
CTTTGCTAAA AAGTTGCTAT CAATGATAGA 

25 AACAAAGAAT TTACGAATAT GCGAAAGAAT 

AGTTAAAAAG CATGAATATT GAGGTTTCAA 
TTAAAGCATT AGATAAAAAG TTCAAAAAAG 

30 AAAATAATCA CCAAAAATCA AACAATCAAA 

AAAAGAATCm ACAACAAAAT AATAAAGGCA 
ATAAGAAAAA TAACAAGAAT AATAAACCAC 

35 CATCAAAAGT GACATATCAA GAAGGTATTA 

TTGAATCATC AGAAATTATC AAAAAATTAT 
AATCATTAAA TCAAGAAACA ATCGAATTAA 

40 

AAGAAGTTGT GATTAATGAA GAAGACTTAT 
CAGAGGCAAT TGAGAGACCA GCAGTTGTAA 
CGACTTTATT AGATTCAATT CGTCATACAA 

45 

CTCAACATAT TGGTGCATAT CAAATTGAAA 
CACCGGGACA TGCTGCATTT ACAACGATGC 
5Q CTATTTTAGT AGTAGCAGCT GACGATGGTG 

ATGCTAAAGA AGCAgAAGTA CCAATTATTG 



GTTGAAGCTG AAAAAGTAAC TGAAGAAGAT 10980 

TCAACCGAAG AGGTAAATGA TGTTTCAGTT 11040 

TAGGTTGGAG TGAAGTATCT ATGAAAAAGA 1110 0 

CAAATGAAAT GCATCCCAAA AAAGATATGA 1116 0 

TCTTTGCGGA TGTTACTGGA AAGAAACAAG 1122 0 

CTATGGTTGA AAAAGCACAA CAAAAAGAAA 1128 0 

AGCAATTGGA TCCTGTTTAC AAAGAAATTA 1134 0 

AATGAGTATA GATCAAATAT TAAACTTTTT 114 00 

AACAGGTGAA TCAGTCATTG TTAATGAGAT 114 60 

TGCAAATGAT GCGTCTGATA ATACAGCTAA 11520 

AGTTCCATTC AGAAAGTTTG GAAATCGAAA 11580 

GCGTGTTAAT GTAGGGATTA CTGACCCAGG 11640 

TGAATATCAT AAGGAGTGAT TATATGAGTA 11700 

TAAATCTAAA GAGTAAAGAG ATTATAGATG 11760 

ATCATATGCA AGCTTTGGAA GATGACCAAA 11820 

AACAAAAGAA CGACAATAAA CAAAGCACTC 1188 0 

ACCAAAATAA AGGGCmACAA AAAGATAACA 11940 

ACAAAGGCAA TAAAAAGAAT AATAGAAATa 120 00 

AAAATCAACC AGCTGCTCCA AAAGAAATAC 1206 0 

CAGTAGGCGA ATTTGCGGAT AAATTAAATG 12120 

TCTTACTTGG TATTGTTGCT AATATCAATC 12180 

TTGCCGATGA TTATGGCGTT GAGGTTGAAG 12240 

CAATCTATTT CGAAGACGAA AAAGATGATC 123 00 

CAATTATGGG ACATGTTGAC CATGGTAAAA 12360 

AAGTTACAGC AGGTGAAGCA GGCGGAATCA 12420 

ACGATGGCAA AAAAATCACT TTCTTAGATA 12480 

GTGCGCGTGG TGCaCAAGTA ACAGATATTA 12540 

TTATGCCACA AACAATTGAA GCAATTAACC 126 00 

TTGCAGTAAA TAAAATTGAT AAACCAACTT 126 60 
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GAAATAATCA AGGTATTTAT TTAATGCGTA 
CTGGACACAA AATTATGCCC GTATTTCTTT 
5 TGGTGAAATA AAGGAACAAA CTTTTACAAG 

TCAAGCATAA ACTTATGCTA TAATTAAGTT 
GTGGGAGATT CCCGCTCTTT TCTATTTGCC 

10 

AGAACAAGTA GAAGTGATTG TTAAACCAAT 
CGTTGAATAT GTCAAAGAGG GTAGAGATCA 
TGGCGTAGAT TTAAATGATT GTACGCTAGC 

15 

AAATGATCCT ATTCCTGAAA TGTATTATTT 
AATTAAAAAA GAACAAGATT TCCAAAATGC 

2Q TGTACCAATT GAAGGTGAAA AGGAATGGTT 
AATTGTAGTA CAAGTTAAAA TCAAAGCAAG 
AATAGCAAAA GCACGTCACG CAGTTATGAT 

25 AGTAATGAAT TATTATTAGC TACTGAGTAT 
GTATTAATTG ATGCTATTGA AGCAGCTTTA 
GCAAGAAATG TCCGTGTGGA ATTAAATATG 

30 AAAGATGTTG TTGAAGAAGT ATTTGACGAC 
GTTAAAAACC CTGCATATGA AATTGGTGAT 

TTTGGTCGTG TAGGTGCTCA AGCAGCGAAA 

35 

GAACGTGAAA TTTTATTTGA AGAATTTATA 

ATTGACCGTG TTGACCATCG TTATGTATAT 

TCTGAAGCAG AAAGAAGTCC TAACGAAAAA 

40 

GTTAACAAAG TGGAACAAAC GACAAAAGGT 
GGTTTATTAA AACGTTTATT TGAACAAGAA 
GTTAAATCAG TAGCACGTGA AGCTGGCGAT 

45 

AATGATATAG ATGCTGTTGG TGCATGTGTT 
GTTGAAGAGC TAGGTGGTGA AAAAATCGAC 
50 TTTGTAAAAA ATGCTTTAAG CCCTTCTCAA 

CAATCTACAG TAGTTGTTGT TCCTGATTAT 



TGGCGTAGTC AAAGAAATAC AAAATTGTTG 918 0 

TCAATGTCTT ACGAGTCTAT TCAAATGTAA 924 0 

AATCTCTGAT TAATAGTGAA GTCATTTGTT 93 0 0 

GCTTAAAAAT TAGTGAACTC AGGCAGAAGA 936 0 

AAAAAGGGAG GCCTGTATGA GTAAAATTAC 9420 

TATGGAAGAC TTGAATTTTG AACTTGTAGA 94 8 0 

TTTTCTTAGA ATCT CTATTG ATAAAGAAGG 954 0 

TTCTGAAAAA ATAAGTGAAG CTATGGATGC 96 0 0 

AGACGTAGCG TCACCTGGTG CAGAACGTCC 96 60 

AATAACTAAA CCTGTATTTG TTTCTTTATA 972 0 

AGGCATTTTA CAAGAAGTCA ATAATGAAAC 97 80 

AACGAAAGAT ATAGAGATAC CGAGAGACAA 984 0 

TTAACGTGAT GAGGAGGAAA AAACGTGTCA 9900 

TTAGAAAAAG AAAAGAAGAT TCCTAGAGCA 996 0 

ATTACTGCAT ACAAAAAGAA TTATGATAGT 10020 

GATCAAGGTA CTTTCAAAGT TATCGCTCGT 10080 

AGAGATGAAG TGGATTTAAG TACAGCGCTT 1014 0 

ATATACGAAG AAGATGTAAC ACCTAAAGAT 10200 

CAAGCAGTAA TGCAACGTCT TCGTGATGCT 10260 

GACAAAGAAG AAGACATACT TACTGGAATT 10320 

GTGAATTTAG GTCGTATCGA AGCTGTTTTA 10380 

TATATTCCTA ACGAACGTAT CAAAGTATAT 10440 

CCTCAAATCT ATGTTTCTCG TAGCCATCCA 10500 

GTTCCAGAAA TTTACGATGG TACTGTAATT 10 560 

CGCTCTAAAA TTAGTGTCTT CTCTGAAAAC 10 620 

GGTGCTAAAG GCGCACGTGT TGAAGCTGTT 106 8 0 

ATCGTTCAAT GGAATGAAGA TCCAAAAGTA 10740 

GTTTTAGAAG TTATTGTTGA TGAAACAAAT 108 00 

CAATTGTCAT TAGCG ATTGG TAAAAGAGGA 10 8 6 0 
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CTGCCTCAAA TCGTAATTGA TCGATTAGAA AAAGAATTAA AAAGTATTAT CGGTAATGGA 73 80 

TTTGCGGTAA TTTACTTAAT TTCGCAACGT TTAGTTAAAA AATCATTAGA TGATGGATAC 744 0 

TTAGTTGGTT CCCGTGGTTC AGTAGGTTCT AGTTTTGTAG CGACAATGAC TGAGATTACT 7500 

GAAGTAAACC CGTTACCGCC ACACTATATT TGTCCGAACT GTAAAACGAG TGAATTTTTC 75 6 0 

AATGATGGTT CAGTAGGATC AGGATTTGAT TTACCTGATA AGACGTGTGA AACTTGTGGA 7620 

GCG CCACTTA TTAAAGAAGG ACAAGATATT CCGTTTGAAA CATTTTTAGG ATTTAAGGGA 768 0 

GATAAAGTTC CTGATATCGA CTTAAACTTT AGTGGTGAAT ATCAACCGAA TGCCCATAAC 774 0 

TACACAAAAG TATTATTTGG TGAGGATAAA GTATTCCGTG CAGGTACAAT TGGTACTGTT 7800 

GCTGAAAAGA CTGCTTTTGG TTATGTTAAA GGTTATTTGA ATGATCAAGG TATCCACAAA 7860 

AGAGGTGCTG AAATAGATCG ACTCGTTAAA GGATGTACAG GTGTTAAACG TACAACTGGA 792 0 

CAGCATCCAG GGGGTATTAT TGTAGTACCT GATTACATGG ATATTTATGA TTTTACGCCG 79 80 

ATACAATAT C CTGCCGATGA TCAAAATTCA GCATGGATGA CGACACATTT TGATTTCCAT 8 04O 

TCTATTCATG ATAATGTATT AAAACTTGAT ATACTTGGAC ACGATGATCC AACAATGATT B100 

25 CGTATGCTTC AAGATTTATC AGGAATTGAT CCAAAAACAA TACCTGTAGA TGATAAAGAA 8160 

GTTATGCAGA TATTTAGTAC ACCTGAAAGT TTGGGTGTTA CTGAAGATGA AATTTTATGT 8220 

AAAACAGGTA CATTTGGGGT ACCAGAATTC GGTACAGGAT TCGTGCGTCA AATGTTAGAA 82 80 

30 GATACAAAGC CAACAACATT TTCTGAATTA GTTCAAATCT CAGGATTATC TCATGGTACA 834 0 

GATGTGTGGT TAGGCAATGC TCAAGAATTA ATTAAAACCG GTATATGTGA TTTATCAAGT 84 0 0 

GTAATTGGTT GTCGTGATGA TATCATGGTT TATTTAATGT ATGCTGGTTT AGAACCATCA 8460 

ATGGCTTTTA AAATAATGGA GTCAGTACGT AAAGGTAAAG GTTTAACTGA AGAAATGATT 8520 

GAAA6GATGA AAGAAAATGA AGTGCCAGAT TGGTATTTAG ATTCATGTCT TAAAATTAAG 8580 

TACATGTTCC CTAAAGCCCA TGCAGCAGCA TACGTTTTAA TGGCAGTACG TATCGCATAT 864 0 

TTCAAAGTAC ATCATCCACT TTATTACTAT GCATCTTACT TTACAATTCG TGCGTCAGAC 8700 

TTTGATTTAA TCACGATGAT TAAAGATAAA ACAAGCATTC GAAATACTGT AAAAGACATG 8760 

TATTCTCGCT ATATGGATCT AGGTAAAAAA GAAAAAGACG TATTAACAGT CTTGGAAATT 8820 

ATGAATGAAA TGGCGCATCG AGGTTATCGA ATGCAACCGA TTAGTTTAGA AAAGAGTCAG 888 0 

GCGTTCGAAT TTATCATTGA AGGCGATACA CTTATTCCGC CGTTCATATC AGTGCCTGGG 894 0 

CTTGGCGAAA ACGTTGCGAA ACGAATTGTT GAAGCTCGTG ACGATGGCCC ATTTTTATCA 9000 

AAAGAAGATT TAAACAAAAA AGCTGGATTA TCTCAGAAAA TTATTGAGTA TTTAGATGAG 9060 
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GCAATAGAGG GTGTCATTTT TGATATAAAC 
GTAGAAATTA AAGTGACTGA CTATACGGAC 
5 AACAAAGATG ATTTAGAACA TTTTAAAGCG 

GGTCGTATTG AAGAAGATAC ATTTATTAGA 
GAGATTAAAA AAGCGACAAA AAAAGATAAG 

10 

CATACTGCAA TGAGCCAAAT GGATGGTATA 
GCAGACTGGG GACATCCAGC CATTGCGGTT 
GATGCTCACG CAGCAGCGGA AAAACATGGC 

15 

TTAGTTGATG ATGGTGTTCC GATTGCATAC 
ACTTATGTTG TGTTCGACGT TGAGACAACT 

2Q GAGCTTGCAG CTGTGAAAGT TCATAACGGT 

AATCCGCATG AACGATTATC GGAAACGATT 
TTAGTAGATG CCCCTGAGAT TGAAGAAGTA 

25 GCGATATTCG TAGCGCATAA TGCTTCGTTT 

CGTCTTGGGT TTGGACCATC AACGAATGGT 
ATTAATACTG AATATGGTAA ACATGGTTTG 

30 TTAACGCAAC ATCACCGTGC CATTTATGAT 

ATGGTTCAAC AAATGAAAGA ATTAGGCGTA 
AGTAATGAAG ATGCATATAA ACGTGCAAGA 

35 

CAACAAGGTC TTAAAAATCT ATTTAAAATT 
CGTAGACCTC GAATTCCACG TTCATTGTTA 
ACAGCGTGTG ATGAAGGTGA ATTATTTACG 

40 

GAAAAAATTG CCAAATATTA TGATTTTATT 
TTAATTGATA GAGAGCTTAT TAGAGATACT 

ATACATGCAG GTGACACAGC GGGTATACCT 

45 

TTTGAACATG ATGGTATCGC ACGTAAAATT 
AATCGCTCAA CTTTACCGGA AGCACATTTT 
5Q CATTTTTTAG GTGAAGAAAA AGCGCATGAA 

GATCGAATTG AACGTGTTGT TCCTATTAAA 



TTAAAAGAAC TTAAAAGTGG T CG C CAT AT C 5 5 80 

TCTTTAGTTT TAAAAATGTT TACTCGTAAA 564 0 

CTAAGTGTTG GTAAATGGGT TAGGGCTCAA 5700 

GATTTAGTTA TGATGATGTC TGATATTGAA 57 6 0 

GCTGAAGAAA AGCGTGTAGA ATTCCACTTG 5 82 0 

CCCAATATTG GTGCGTATGT TAAACAGGCA 5 8 80 

ACAGACCATA ATGTTGTGCA AGCATTTCCA 5 94 0 

ATTAAAATGA TATACGGTAT GGAAGGTATG 600 0 

AAACCACAAG ATGTCGTATT AAAAGATGCT 6 060 

GGTTTATCAA ATCAGTATGA TAAAATCATC 612 0 

GAAATCATCG ATAAGTTTGA AAGGTTTAGT 618 0 

ATCAATTTGA CGCATATTAC TGATGATATG 624 0 

CTTACAGAGT TTAAAGAATG GGTTGGCGAT 6 300 

GATATGGGCT TCATCGATAC GGGATATGAA 63 60 

GTTATCGATA CTTTAGAATT ATCTCGTACG 6420 

AATTTCTTGG CTAAAAAATA TGGCGTAGAA 64 8 0 

ACAGAAGCAA CAGCTTACAT TTTCATAAAA 654 0 

TTAAATCATA ACGAAATCAA CAAAAAACTC 660 0 

CCTAGTCATG TCACATTAAT TGTACAAAAC 666 0 

GTAAGTGCAT CATTGGTGAA GTATTTCTAC 6720 

GATGAATATC GTGAGGGATT ATTGGTAGGT 678 0 

GCAGTTATGC AGAAGGACCA GAGTCAAGTT 6 84 0 

GAAATTCAAC CACCGGCACT TTATCAAGAT 6 90 0 

GAAACATTAC ATGAAATTTA TCAACGTTTA 6 96 0 

GTTATTGCGA CAGGAAATGC ACACTATTTG 7 02 0 

TTAATAGCAT CACAACCCGG CAATCCACTT 7 08 0 

AGAACTACAG ATGAAATGTT AAACGAGTTT 714 0 

ATTGTTGTGA AAAATACAAA CGAATTAGCA 72 0 0 

GATGAATTAT ACACACCGCG TATGGAAGGT 7 26 0 
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TTTAGTTGTC 


GGTGCTAACG 


AAGATGGTTA 


TCACTTAATT 


AATGTAAATG 


TAGGTAGAGA 


3960 




CTTCAACGTT 


GATGAATATG 


GCGATTTCCG 


TTTTATTTTA 


GAAGGCGAAA 


AGTTAAGTGA 


4020 


10 


TGGTTCAGGC 


GTTGCACATT 


TTGCTGAAGG 


TATTGAAGTT 


GGTCAAGTAT 


TCAAATTGGG 


4080 




TACTAAGTAT 


TCAGAATCAA 


TGAATGCTAC 


ATTCTTAGAT 


AACCAAGGAA 


AAGCTCAATC 


4140 


15 


TTTAATTATG 


GGTTGTTACG 


GAATTGGAAT 


TTCTAGAACG 


CTAAGTGCGA 


TTGTTGAACA 


4200 


AAATCACGAT 


GATAATGGAA 


TTGTTTGGCC 


TAAATCAGTT 


ACTCCGTTTG 


ATTTACATTT 


4260 




AATTTCTATT 


AATCCTAAGA 


AAGATGATCA 


ACGAGAACTA 


GCAGATGCAC 


TATATGCTGA 


4320 


20 


ATTTAATACT 


AAATTTGATG 


TGTTGTACGA 


TGATCGTCAG 


GAACGTGCAG 


GTGTTAAATT 


4380 


TAATGATGCC 


GATTTAATTG 


GTTTACCACT 


GCGAATTGTT 


GTTGGTAAAC 


GTGCATCGGA 


4440 




AGGTATTGTA 


GAAGTTAAAG 


AACGTTTAAC 


AGGTGATAGC 


GAAGAAGTTC 


ACATTGATGA 


4500 


25 


CTTAATGACT 


GTCATTACAA 


ATAAATATGA 


TAACTTAAAA 


TAATTAAGAT 


CGAATGAATT 


4560 




ATAAGAGTAG 


GAAAAAGCTG 


AAAGAAATCT 


GATGCTTATG 


TCCTGCTCTT 


ATTATTTTTG 


4620 




ATATAATGAT 


TATTCGATGA 


AAAATGACTG 


AAGACATAGT 


ATAATTAAAG 


ATAAATTTGT 


4680 


30 


TTTAACAATA 


TAATGATTAG 


CCAAATATAA 


AGCATTTAAT 


TTTCTATCAT 


TACTATGCTC 


4740 




ACATAATCTA 


AATATTGTTC 


GAACACGTAA 


AAGTAATTTC 


TATTTAAGGT 


GGTAATTGTC 


4800 




TTGGCAATGA 


CAGAGCAACA 


AAAATTTAAA 


GTGCTTGCTG 


ATCAAATTAA 


AATTTCAAAT 


4860 


35 


CAATTAGATG 


CTGAAATTTT 


AAATTCAGGT 


GAACTGACAC 


GTATAGATGT 


TTCTAACAAA 


4920 




AACAGAACAT 


GGGAATTTCA 


TATTACATTA 


CCACAATTCT 


TAGCTCATGA 


AGATTATTTA 


49B0 




TTATTTATAA 


ATGCAATAGA 


GCAAGAGTTT 


AAAGATATCG 


CCAACGTTAC 


ATGTCGTTTT 


5040 


40 


ACGGTAACAA 


ATGGCACGAA 


TCAAGATGAA 


CATGCAATTA 


AATACTTTGG 


GCACTGTATT 


5100 




GACCAAACAG 


CTTTATCTCC 


AAAAGTTAAA 


GGTCAATTGA 


AACAGAAAAA 


GCTTATTATG 


5160 


45 


TCTGGAAAAG 


TATTAAAAGT 


AATGGTATCA 


AATGACATTG 


AACGTAATCA 


TTTTGATAAG 


5220 


GCATGTAATG 


GAAGTCTTAT 


CAAAGCGTTT 


AGAAATTGTG 


GTTTTGATAT 


CGATAAAATC 


5280 




ATATTCGAAA 


CAAATGATAA 


TGATCAAGAA 


CAAAACTTAG 


CTTCTTTAGA AgCACaTATT 


5340 


50 


CAAGAAGAAG 


ACGAACAAAG 


TGCACGATTG 


GCAACAGAGA 


AACTTGAAAA 


AATGAAAGCT 


5400 




GAAAAAGCGA 


AACAACAAGA 


TAACAACGAA AGTGCTGTCG 


ATAAGTGTCA 


AATTGGTAAG 


5460 
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CACTGCTTAT GATAATGAAA GACATCATTT TAAAATTGCT AGAAAGTCTT TCTTTGTTGA 198 0 

AAATGGTAGC TTAGTTCAAA TTGCTCCGAG AGACAGACAA TTTG CACAT A AAAAGCCATG 2 04 0 

GCCGAAATTT TTAACATTAT TTGCGGGACC GTTATTTAAC TTTATATTAG CTTTAGTCCT 2100 

ATTTATTGGT CTTGCATATT ATCaAGGcAC GCcTACGTCT ACTGTAGAAC AAGTCGCAGA 2160 

TAAGTATCCA GCTCAACAAG CAGGATTACA AAAAGGTGAT AAGATCGTCC AAATTGGCAA 222 0 

ATATAAAATA TCTGAATTTG ATGATGTTGA TAAGGCGTTA GATAAAGTTA AAGATAATAA 2280 

GACGACTGTT AAATTTGAAC GTGATGGTAA AACAAAGTCA GTTGAATTAA CACCTAAAAA 234 0 

GACTGAAAAA AAACTGACTA AAGTAAGTTC AGAGACGAAG TATGTTCTCG GATTCCAACC 24 00 

AGCGAGTGAA CATACACTTT TTAAACCAAT TGTATTCGGA TTTAAAAGCT TTTTAATCGG 24 6 0 

TAGTACTTAT ATTTTTACAG CTGTAGTAGG TATGTTGGCT AGTATATTTA CGGGCGGATT 2 520 

CTCATTTGAT ATGTTAAATG GTCCGGTTGG TATTTATCAT AACGTCGACT CAGTTGTTAA 2 580 

AGCGGGTATC ATTAGCTTAA TTGGTtnCAC TGCGTTATTA AGTGTAAACT TAGGTATTAT 2640 

GAATTTAATT CCTATTCCTG CACTAGACGG TGGTCGTATT TTATTTGTTA TATATGAAGC 2 7 00 

25 GATTTTCAGA AAACCAGTTA ATAAAAAAGC GGAAACAACG ATTATTGCTA TTGGTGCCAT 276 0 

TTTCATGGTC GTTATAATGA TATTAGTAAC GTGGAATGAT ATTCGACGAT ATTTCTTATA 282 0 

ATTTAGGAGG ATAAATAATT ATGAAGCAAT CCAAAGTTTT TATACCAACG ATGCGTGACG 288 0 

TGCCATCAGA AGCAGAAGCA CAAAGTCATC GTTTATTATT GAAATCGGGT TTGATAAAAC 2 940 

AAAGTACAAG TGGGATTTAT AGTTATTTAC CGCTAGCAAC ACGTGTGTTA AATAATATTA 3 0 00 

CTGCAATTGT GCGACAAGAA ATGGAACGTA TCGATTCTGT TGAAATTTTA ATGCCAGCGT 3 06 0 

TACAACAAGC TGAATTATGG GAAGAATCAG GACGTTGGGG TGCATATGGC CCAGAATTAA 312 0 

TGCGTTTACA AGATAGaCAT GGAAgACAAT TTgCATTAGG TCCaACACAT GAAGAATTAG 3180 

TTACATCAAT AGTAAGAAAT GAATTGAAAT CATACAAACA ATTACCGATG ACATTATTCC 324 0 

aAATTCAATC TAAATTCCGT GATGAAAAGA GACCACGTTT TGGTTTAyTC GTGGGCGTGA 3 30 0 

ATTTATTATG AAAG ATG CAT ATTCATTCCA TGCTGACGAG GCATCATTAG ATCAAACGTA 3 36 3 

45 TCAAGATATG TATCAAGCGT ATAGCCGTAT TTTTGAGAGA GTTGGCATTA ACGCAAGACC 34 2 0 

AGTAGTTGCA GATTCAGGTG CTATAGGCGG TAGCCATaCA CATGAATTTA TGGCATTAAG 3 480 

TGCTATCGGT GAGGATACAA TCGTTTACAG TAAAGAAAGT GATTATGCTG CTAACATCGA 3 54 0 

50 AAAAGCAGAA GTCGTTTACG ArcCAaATcA TaAGCATACT ACTGTGCAAC CTTTAGAAAA 3 600 

&A~^AAA^A ^<~AAAT'T T, TA A^ArTGrOPA A^AATT^JCrA ^A^TT^TTA^ ^A^A^A^ 1 - ^ " 
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TAGAAGCAAT TAATAATGCy mAAGAAAAGA CAGCTAATAA TACCGGCTTA AAATTAATAT 180 

TTGCAATTAA TTATGGTGGC AGAGCAGAAC TTGTTCATAG TATTAAAAAT ATGTTTGACG 24 0 

AGCTTCATCA ACAAGGTTTA AATAGTGATA TCATAGATGA AACATATATA AACAATCATT 3 00 

TAATGACAAA AGACTATCCT GATCCAGAGT TGTTAATTCG TACTTCAGGA GAACAAAGAA 3 60 

TAAGTAATTT CTTGATTTGG CAAGTTTCGT ATAGTGAATT TATCTTTAAT CAAAAATTAT 420 

GGCCTGACTT TGACGAAGAT GAATTAATTA AATGTATAAA AATTTATCAG TCACGTCAAA 4 80 

GACGCTTTGG CGGATTGAGT GAGGAGTAGT ATAGTATGAA AGTTAGAACG CTGACAGCTA 54 0 

TTATTGCCTT AATCGTATTC TTGCCTATCT TGTTAAAAGG CGGCCTTGTG TTAATGATAT 600 

TTGCTAATAT ATTAGCATTG ATTGCATTAA AAGAATTGTT GAATATGAAT ATGATTAAAT 660 

TTGTTTCAGT TCCTGGTTTA ATTAGTG CAG TTGGTCTTAT CATCATTATG TTGCCACAAC 720 

ATGCAGGGCC ATGGGTACAA GTAATTCAAT TAAAAAGTTT AATTGCAATG AGCTTTATTG 780 

TATTAAGTTA TACTGTCTTA TCTAAAAACA GATTTAGTTT TATGGATGCT GCATTTTGCT 84 0 

TAATGTCTGT GG CTTATGTA GG CATTGGTT TTATGTTCTT TTATGAAACG AGATCAGAAG 900 

25 GATTACATTA CATATTATAT GCCTTTTTAA TTGTTTGGCT TACAGATACA GGGGCTTACT 960 

TGTTTGGTAA AATGATGGGT AAACATAAGC TTTGGCCAGT AATAAGTCCG AATAAAACAA 1020 

TCGAAGGATT CATAGGTGGC TTGTTCTGTA GTTTGATAGT ACCACTTGCA ATGTTATATT 1080 

TTGTAGATTT CAATATGAAT GTATGGATAT TACTTGGAGT GACATTGATT TTAAGTTTAT 114 0 

TTGGTCAATT AGGTGATTTA GTGGAATCAG GATTTAAGCG TCATTTCGGC GTTAAAGACT 1200 

CAGGTCGAAT ACTACCTGGA CACGGTGGTA TTTTAGACCG ATTTGACAGC TTTATGTTTG 126 0 

TGTTACCATT ATTAAATATT TTATTAATAC AATCTTAATG CTGAGAACAA ATCAATAAAC 1320 

GTAAAGAGGA GTTGCTGAGA TAATTTAATG AATCTCAGAA CTCCTTTTGA AAATTATACG 13 80 

CAATATTAAC TTTGAAAATT ATACGCAATA TTAACTTTGA AAATTAGACG TTATATTTTG 144 0 

TGATTTGTCA GTATCATATT ATAATGACTT ATGTTACGTA TACAGCAATC ATTTTTAAAA 1500 

TAAAAGAAAT TTATAAACAA TCGAGGTGTA GCGAGTGAGC TATTTAGTTA CAATAATTGC 1560 

ATTTATTATT GTTTTTGGTG TACTAGTAAC TGTTCATGAA TATGGCCATA TGTTTTTTGC 1620 

GAAAAGAGCA GGCATTATGT GTCCAGAATT TGCGATCGGT ATGGGGCCAA AAATTTTTAG 1680 

TTTTAGAAAA AATGAAACAC TTTACACTAT TAGGTTATTG CCTGTTGGTG GATATGTTCG 1740 

50 TATGGCAGGA GATGGCTTAG AAGAGCCACC AGTCGAGCCC GGTATGAACG TTAAAATTAA 1B00 

ACTTAATGAA GAAAATGAAA TAACACATAT CATATTAGAT GATCATCATA AGTTTCAACA 1860 
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TTATAGGGTT TTTGCGACCG GATGTTTCTT CAATTTAATG TATTGAGAAA GACTATATAA 3 900 

CACAATACCT GTCCAAATAA ATATAAACGT AATTAATTGA TCTATACTAA AAGGCTCTTT 396 0 

5 GAAAACAAAT ATGCCGAGTA CAAACATTAT TGTTGGTCCA ACGTATTGAA TAAATCCTAT 4 02 0 

TAGCGAAAGT GGAATACGTT TTGCCCCGGC TGAGAATAGG ATTAGTGGTA TTGCCGTAAT 4080 

AGCACCAGAA AATAACAACC AAAATGATGA CATGTTCAAT CCAAATGACA TCTGATGTTG 414 0 

70 

CTGCCATAAA TAAATAACGT ATATTAGTCC AGCAGGTGCG GTAACAATAC ATTCAATCGT 420 0 

AATACTGCTG ATGGCATCAA TATGTACTAC TTTTTTCAAT AATCCGTATG TACCAAAGGA 4260 

TAACGCTAAT ATAATAGAGA CGATTGGGAA TTCTCCAATC TTGAGCGTCA TATATAATAC 4320 

15 

ACCGATGAAT GCGAATAAAA TGGCTAGCCA TTCAAATTTA TTGAATCTTT CTTTTAAAAA 43 8 0 

GATAAGTGCG AGCAAAATGC TAACAAGTGG ATTTATATAA TAACCTAAAC TTGTTTGTAG 444 0 

20 GACGTGACCG TTCGTTACAG CCCAAATAAA TGTACCCCAA TTTAATGTAA TGACATAGCC 4 500 

TGCTACGACA ATCGCTAATA GCTGAATGGG CTTGCCTAAC AATTGATTCA TATCTCGTTG 4560 

AAATGCATTG CGTTGTTTTT GTCCAACCGC GAGTATGAAA ATCATGAATA TTGCTGAAAA 4 62 0 

25 TATAATACGA AAGGCTAAAA TTTCAAATGC GCCTATTGCA TCAACGAACT GCCAATATAT 46 80 

AGGTAGTATT CCCCACAGAA TGTATGCACT GAGTGCTAAA AATATGCCTT TTTTATACTC 4740 

TGAATTCACC TTCAAACCTC CTTACTTTCC TAATTTTTAA TTTACTG CAT ACGCTCACTT 4 8 00 

30 GGTTATGCTA ATATAACGAT TTTACTAATA ATATTTCGAT AAAGATATCA TTTTGTTTAT 4 860 

ATTTCCCACA TTTATTCACC AACCACTAAA CAATATTAAT TTTATAAATA ATTCTGTACA 4 92 0 

AATCAGGGTA TATTGCCAGA AAGACTACCA TACAACATAA AGGATGGATA CAAATGACTT 4 980 

35 

TACCTAAAAT TGGAAAGCCT GCAACACGCG CGCTAAATTC ACAAGGTATA TACACATTAG 504 0 

AAGCAGTATC ACAATATACG AAGTCATCTC TAATGGAGAT GCATGGCGTT GGTCCTAAAG 5100 

CTATATCAAT ATTGGAACAA GCTTTATTTC AG 513 2 

40 

(2) INFORMATION FOR SEQ ID NO: 164: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 22243 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 

AA<"-TAAATTA TATTAT^AAT TTnr^TOT^A AT^T^TAAA nA^AT^^^A ^^^aa 0 '"^ c 
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TACTCCCTAT CGTTGTAGGT CTCCTTATTT GGGCACTTAC ACCTTTTAAA CCGGATGCTG 2100 

TGGATCCAAC AGCATGGTAT ATGTTCGCAA TATTCGTCGC GACAATCATT GCTTGTATTA 216 0 

CACAACCGAT GCCAATTGGG GCCGTCTCTA TAATTGGATT TACAATCATG GTACTCGTTG 22 2 0 

GCATTGTTGA CATGAAAACG GCTGTCGCTG GTTTTGGTAA TAATAGCATT TGGTTAATTG 2280 

CTATGGCATT TTTCATTTCG AGAGGATTTG TGAAAACAGG TCTTGGTAGA CGTATCGCAC 2340 

TTCATTTCGT CAAATTATTT GGTAAAAAAA CATTAGGATT AGCATATTCT ATCGTCGGTG 24 00 

TAGATTTAAT TCTAGCGCCT GCTACACCAA GTAATACCGC GCGTGCTGGT GGAATCATGT 2460 

TCCCAATTAT CAAATCACTT TCTGAATCAT TTGGTTCGAA ACCGAAAGAC GGATCAGCAC 2520 

GCAAAATGGG TGCATTTCTT GTTTTCACAG AATTCCAAGG TAATTTAATT ACTGCGGCTA 2 580 

TGTTTTTAAC TGCAATGGCC GGTAACCCCC TTGCACAAAA TTTAGCATCT AGCACATCTA 2 64 0 

ATGTTCACAT TACATGGATG AATTGGTTTC TAGCTGCTTT AGTTCCTGGA CTTGTTTCCT 2 700 

TAATTGTTGT ACCTTTTATT ATTTATAAAA TTTATCCACC AACTGTTAAA GAAACACCAA 2760 

ATG CTAAGAG TTGGGCTGAA AATGAATTAG CGACTATGGG TAAAATCGCT TTAGCTGAAA 2 820 

25 AATTTATGAT TGGTATTTTT GTCGTTGCGT TAACACTATG GATTGTCGGA AGTTTCATTC 2 880 

ATATTGATGC AACTTTAACG GCCTTTATTG CGCTAgcATT gTTATTATTG ACAGGCGTCT 2 94 0 

TAACATGGCA AGACATTTTA AACGAAACAG GTGCTTGGAA CACATTAGTA TGGTTCTCAG 3 000 

30 TATTAGTGTT AATGGCCGAC CAATTAAACA AGCTTGGATT TATTCCTTGG TTAAGTAAAT 3 06 0 

CCATTGCTAC AAGTCTTGGT GGCTTAAGCT GGCCTATAGT CCTGGTCATT TTAATATTGT 3120 

TCTACTTCTA TTCACATTAC TTATTTGCAA GTTCTACAGC ACATATCAGT GCGATGTATG 3180 

CAGCATTACT AGgCGTTGCC ATCGCAGCCG GTGCACCACC ATTATTCAGT GCATTAATGT 324 0 

TAGGTTTCTT CGGTAACCTA TTAGCTTCAA CAACACACTA TAGTAGTGGT CCAGCGCCGA 3 300 

TTCTATTCTC TTCAGGTTAC GTGACTCAAA AACGTTGGTG GACAATGAAC TTAATATTAG 3 360 

GTTTCGTCTA CTTTATTATC TGGATTGGTT TAGGATCACT TTGGATGAAA GTAATTGGTA 34 20 

TATTTTAAAA TATTTAAATT AGCGCTCGAA TCTCATTGAT TTGGGCGCTT TTTAATTTGT 34 80 

ATTTAAAATC AACCTTTGCT AAATCAAGAC TCCCTTTTTA AAATACGTTT ATCCTTTAAA 3 540 

TCATTGCGTG CTTCACTGAA AATTTGTATA AAGATTTAAG TCATTACGTA ACATCACATA 3 600 

AAATACATTT CTATACTATT CCGCTTCATT GATTAACATT ACGTATGCCC TCATAAATCA 3 6 60 

50 TCATACAAAA AACACCTTCG TTTAAATTCA TTTTAATTGC GAATTCAACG AAAGTGCCTT 3720 

ATTTCATATT TAATGTTTCA AATTTATACG TCTGTCACTG TTACTGCACA CATACCTCAG 3 7 80 
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AATACAAGTT CCAATGAGCG CAATTAAAAG TACTAACCCA ACGATGAAAC TCTGTTTGTC 3 00 

ACTTAACTCA AAGAAACTAT AGATAGGATA TTTTTTAATA ATCAAGCCAC CTAAAATCAT 3 60 

5 CCATAAAAAT ACGATAATTC CATAAGTCAC ATTTATAACA TACGTTATTT TTTGGTCACC 4 20 

AAATCGGACT AATGTATTTC GTAGAATCAG CATACCAATG ACAACACCTA AAATAACGAT 4 80 

ACT AG CT AT A TAAAGTAAAA ATGCAATTGT CACATCAAAT GTACCCAAAT CTAAAAACCT 54 0 

W AGGAATTAyA AyGACTGCTA AAATAAAAGC GAAGyACAAA GTAATATAkT TATACAAACC 6 00 

GGTAGTAAGA CTTATCTCAG GTGATAATTG ATCAGCCATT GACTTAATCG GTGTATTAAT 66 0 

AATTGAACTT GTATCTTCGT TATTTTTTTC AGCCATAGTT AAATGATCTT CGAGCTCTTC 720 

15 

CAATAACTCT TCTACTTCTG CTTCAGTCTT ACCTCTAAAT AACAATTCAA CACGTAATTT 78 0 

TTCTAAAAAA TCTTGAGATT GTTTACTTAA CATCGTTTTC CCCTCCAAAC AAGTTAATCA 84 0 

TCCCTTTATT CAAAACTTGC CATTTCGATT TAAATACTTT TAGTTCCTTT AAACCTGAAT 90 0 

20 

CGGTAATCGT ATAGTATTTC CGCCTCGGGC CGCCATTACT AGATTTTTTT ATTGTCGTAT 960 

CAACGTATCC TTTTTTGTTT AAACGCATTA AAACTGGATA AATACTACCC TCACTTATCT 102 0 

25 CTGGAAACTC TTGATTCTTA AGTTTCGTCA TAATTTCATA TCCATACGTT TCGCCTTGGG 10 80 

CAATGAGACC TAATATCGCC CCATCTAAGA GACCTTTCAT AATCTGATCT GACACTGACA 114 0 

TTTTAATCAC CTACTATCTT ACATAATAAG ATAGTACATT GAGAACTTTT CGTCAACTAT 12 00 

30 CTTTTATTGT AAGGTAGTTG TTGTACACAT TCCTTAAATG ACTAACAACT TTGTTAATAG 12 60 

GGTAATACTT ACGGAAGTAT ATTTTATTTA TGGGGGAGGA ATTAATAATG ACTACAAAAA 1320 

CAGTATTTGA TGTCATTGAT ATGGGGTTAG GATATTTAGT AAATGTGTAT GATGCTTGGA 13 80 

35 AAGTTGAAAA GGTACTTGAT GATTATCATA AGCCTTTTTC TAATACCATT CATTGGCAAT 144 0 

TTGGtCATGT ATTAACAATT TTTGAATCGG CCTTAGCTGT TGCTGGTAAA GAGAATATTG 1500 

ATTTAAATAT CTATAGACCT TTATTCGGAA ATGGTTCGTC TCCAGATGAA TGGAAGGATG 156 0 

40 

AAGTACCGAG TATTGAAAGG ATTTTAGAAG GTCTCCAAAC TTTACCTGAA CGTGCACGAA 16 2 0 

ATCTAACTGA AGATGATTTA GCAATTGAAT TGAAACAGCC AATTGTCGGT TGTAATAACT 168 0 

TAGAAGAGTT ATTAGTATTA AATGCCATTC ACATCCCACT TCATGCTGGT AAAATTGAAG 174 0 

-J 5 

AGATGTCTCG TATATTAAAA AATTTAAAAT AAATATGTGC TTATTAACCG TTAACAACAC 1800 

GTTAACGGgT TTTTTATTTG TTTAAAAGGT CACTTTTTTG AATTTAATAA ACACCATCTA 186 0 

50 TACCAGTTCT TCACCGATTC TCGAAAAATA ATTATATTAA TGATTTCGTT AATTTAATTT 192 0 

t^^TA?" TnT-Ta^jp ^^^-tt^t A C TT AO f*TTT ATTCTTAAAT TGAAATATOT *! n R 0 
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(2) INFORMATION FOR SEQ ID NO: 162: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 798 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162: 





TTTTTTCTTT 


TCTTCATTTG 


AAAATTOATC 


ATTCAGCAAT 


ATAAGCGTAT 


TTGTTAATGA 


60 


15 


TTTAGGTGTT 


CCAATTTCAT 


AATCCCACCA 


ATTTAAGTTG 


GTATTCTTGC 


CAGTTGTTTT 


120 




AGTAAAATTC 


TCACTTAATT 


CTTTTACTTT 


TTTATCTGGT 


TCTTTTCCAT 


ATGCA'ITITf 


180 




ATGCAGCCAC 


TCAAGGGCAT 


CTTTCACTTT 


CTTCTTATTT 


TCGTCAGTAT 


TTAAAGTGGT 


240 


20 


TTTAGGATTC 


CTCATCGCTT 


CTGCGATTTT 


CTCAATATTA 


CGATAGGTAC 


GAGTCATATG 


300 




AGAAGAATTA 


GTTTCAAGGG 


TTTCCGCTCC 


TGACCACAAG 


TATTTCCTAC 


CACTTTCAGT 


360 




TTTCATTTCC 


TTGAGTAAAT 


TCGTCGCCTC 


TTTCTCTGTA 


GCATCAAACT 


TCTTCTTCAT 


420 


25 


ATCTGGATTA 


TTCTCATCAT 


ACTTATCATA 


ACCATAGTTA 


ACGTCCAGCC 


ATGTGTTCCT 


480 




CAATTTTTCA 


TAATCTGGCG 


TTTGAACATT 


CGTATCAGCC 


ACAGCGATTT 


GATGTTTATC 


540 




AACACTTCTG 


AATTCACCAC 


CATTCAAAGT 


AATCACACCA 


GCCATTAATA 


ACGTAATGGT 


600 


30 


GGATAATTTT 


TGCCATTTCT 


TTATTCTATA 


TGTCATTGaC 


ATGTCTCCTT 


TTTGTGTTGC 


660 




GCGTGCGCAA 


TGAATATTAT 


GATTAAATAA 


TGATTCAATT 


TTTCAAAATT 


CGTTAACGTA 


720 


35 


TACAAATGAC 


TGTCTACTGT 


CAAACAATCC 


ACAAAGAATG 


TTGATGtCAT 


ATaAACAATC 


780 


GATCACCCAA 


ATTTTCCG 










798 



(2) INFORMATION FOR SEQ ID NO: 163: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5132 base pairs 
(B5 TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163: 

TACAGGTTTT ACTATAATGG ATGGTATTTT GGCTAAACGA CATTGGTTTA GTCTTCTTTT 6 0 

50 TTTnACTTCC TAnATTTACA ATGGTATAAA TAATAATGCT ATATTTAGAA TGATGAGTAT 120 

ACTTACTGAA ACTAAATTAA AAGTGTCTGG TTCTTTACTA AAGATAGCTG CTATCCTTGC 180 
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AAAACAGCTG GTAAAATATT AT AT CG AG AT CAAGACATTT TTGATCAAAA ATATTCTAAA 6180 

GAACAATTAC GTACAAATGT GGGCATGGTC TTTCAACAAC CTAATCCATT TCCAAAATCA 624 0 

ATATACGATA ATATTACTTA CGGTCCAAAG ATTCACGGTA TTAAAAATAA AAAAGTTCTT 6 3 00 

GATGAAATCG TTGAGAAATC ATTACGTGGC GCTGCAATTT GGGATGAATT AAAGGATAGG 6 3 60 

TtGCACACAA ATGCATATAG TTTATCCGGT GGGCAACAAC AACGTGTTTG TATCGCGCGT 642 0 

TGTTTAGCAA TTGAACCTGA AGTCATTTTA ATGGATGAAC CGACATCAGC ATTAGATCCA 64 80 

ATCTCAACAT TAAGAGTAGA AGAGTTGGTT CAAGAACTAA AAGAAAAGTA TACAATTATT 6 540 

ATGGTtACAC ATAATATGCA ACAAGCAGCT CGTGTATCAG ATAAAACTGC ATTTTTCTTA 6 6 00 

AATGGTTATG TCAATGAATA TGATGATACT GATAAAATTT TCTCTAACCC ATCAAACAAG 6 6 60 

AAAACAGAAG ATTATATTTC AGGAAGGTTT GGTTGATATA TAATGGCAAT AATTAGACAA 6720 

CGATATCAGG AGCAACTTGA TGATTTAATA AAAGAATTAC GTCGGTTAGG TGCaAATGTC 67 80 

TATGTGAGTA TTGaAAATGG TATAAAAtCA TTAAGTATTG aCGATAGAGG cTTTGCACGA 684 0 

CAAACAGTTA AAAACGATAA ACATATCAAT CAATTAAATT ATGATATTAA TGAGCGAGTT 6 90 0 

25 ATCATGTTAA TTACAAAGCA ACAGCCCATT GCGAGTGATT TGCGTATGAT GATTTCTTCA 6 960 

TTAAAAATCG CCTCCGATTT AGAAAGAATA GGAGATAATG CCTCGAGTAT TGCCAATATT 7020 

CGATTGCGTA CAAAGATTAC AGATGATTAT GTGTTAACCC GTTTAAAGAC AATGGGTAAA 70 80 

30 TTAGCTATGT TAATGTTAAA GGACTTAGAT CAAGCATTTA AAAAGAAAGA TACCGTATTA 714 0 

ATAAGAGAAA TAATTGAGCG TGATGAAGAT ATCGATGACT TATATAGTCA TATTATTAAC 72 00 

GCAACGTATC TTATTGATAA CGtCCATTTG TCGCTGCACA AGCTCATTTA GCAGCAAGAC 72 6 0 

ATTTAGAACG TATTGGTGAT CATATTATTA ACATCGCTGA AAGTGTTTAT TTTTATTTAA 73 2 0 

CAGGTACACA TTACGAACAA TAACTTAAAG TTATTACTAT AAAATCCCTT ACGATAAATA 73 8 0 

TATATTTCTA TTATTCATAA ACCCTCAAAA AAACCAAGAT TCTCACAATT AGTAATGTGA 74 4 0 

AAA7CTTGGT TTATATTGTT CT ACT AT AAA TTGTCTCGCA TCTTAGTTAT TTGCTTGCTC 7 50 0 

AATTTCATCT GTTAATTTTT CAACTTCATC GACTAAATCA GAAATATATT GAATTGTAGA 7 5 60 

TTTAAGTGGC TGTTCTGTAG TAATGTCTAC ACCTGCAATG TTTGCAAGTT CGACAGGTGA 7 62 0 

TACACTACCA CCTTTTTTCA ATGTTTCTAA CCAAGCATCA ACAGCTGGTT GGCCTTCATT 76 80 

TTTAATCTTT TGAGAAACGA CAGTTCCGAT TGTTAAGCCA GCAGAATACG TATACGAATA 774 0 

TAATCCCATA TAGTAATGAG GTTGACGCAT CCATGTTAAT TCAGCACCCT CAGTCATGTC 78 00 

TACTGCATCT C C AAA AAA TT GTTTATAAAC ATTTAGCATT ATTTCATTTA ATGT-COGr^ -P^n 
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AACCGATATT AGAAATTTTA GCAGGAATAC CAACAATTGT GTTTGGTTTC TTTGCATTAA 4 3 80 

CCTTTGTTAC ACCAGTATTA AGATCTTTCA TACCAGGTCT TGGAGAGTTT AATGCTATAA 444 0 

GTCCCGGCTT AGTTGTCGGT ATTATGATTG TCCCTCTCAT CACAAGTTTG AGTGAGGaTG 4 500 

CAATGGCATC TGTACCAAAT AAAATTCGAG AAGGTGCCTA TGGACTTGGA GCAACTAAAT 4 560 

TAGAAGTAGC AACTAAAGTC GTACTTCCCG CAGCAACATC AGGTATTGTA GCTTCAATCG 4 620 

TTCTCGCGAT TTCAAGAGCA ATTGGAGAAA CGATGATTGT ATCATTAGCG GCAGGTAGTT 4 680 

CGCCAACAGC TTCATTAAGT TTAACAAGTT CGATTCAAAC AATGACTGGA TATATTGTTG 4 74 0 

AGATAGCGAC AGGTGATGCA ACATTTGGAT CAAATATTTA TTACAGTATT TATGCTGTAG 4 800 

GGTTCACACT ATTTATCTTT ACCTTAATCA TGAATTTACT TTCTCAGTGG ATTTCTAAGC 4 860 

GTTTTAGGGA GG AG T ATT AA TATGGAAACG ACAGATAATA ATAGACAATC ACTCGTCGAT 4 920 

CAACAACTTG TCCAAAAACA TTTATCATCC AGAACGGTTA AAAATAAAGT GTTCAAACTC 4 980 

ATATTTTTAG CATGTACATT ATTAGGACTT GTCGTACTTA TTGCGTTGTT AACTCAAACA 504 0 

TTGATTAAAG GGGTAAGTCA TTTAAATTTA CAGTTTTTCA CTAATTTTTC TTCTTCAACA 5100 

CCATCTATGG cTGGCGTTAA AGGCGCGTTA ATCGGTTCAC TTTGGTTAAT GTTAAGTATC 5160 

ATTCCATTAT CAATCATCCT AGGAATAGGT ACAGCTATAT ACTTAGAAGA ATATGCGAAA 522 0 

AACAACAAAT TTACTCAGTT TGTTAAAATC AGTATTTCCA ATTTAGCTGG TGTACCATCA 5 2 80 

30 GTTGTATTTG GGTTATTAGG TTATACTTTG TTCGTTGGTG GTGCAGGGAT TGAAGCCTTG 534 0 

AAAATGGGTA ACAGTATATT GGCAGCAGCG CTAACAATGA CCTTACTGAT ATTACCAATT 54 00 

ATTATTGTTT CAAGTCAGGA AGCAATTAGA GCTGTACCTA ACTCAGTACG CGAACTTcTT 54 60 

ACGGCTTAGG TGCTAATAAA TGGCAAACGA TAAGACGTGT TGTCTTACCA GCAGCGTTAC 5520 

CTGGTATTTT AACTGGATTC ATTTTGTCTC TTTCAAGAGC ACTGGGAGAA ACAGCGCCAC 5580 

TTGTGCTAAT CGGTATACCG ACTATATTAT TGGCAACACC TAGAAGTATA TTGGATCAAT 564 0 

TTTCAGCATT ACCTATCCAA ATATTTACTT GGGCGAAAAT GCCTCAAGAA GAATTCCAGA 5700 

ATGTTGCATC GGCAGGCATT ATCGTTTTAC TAGTTATCTT AATCTTAATG AATGGCGTTG 5760 

CGATTATTTT ACGTAACAAA TTTAGTAAAA AATTCTAATT TAAACAATCA ATCTCATTTA 5820 

TCTATTAAAA AGGGAGTTTT AAATATGGCG CAAACACTTG CACAAACTAA ACAAATATCT 5880 

CAAAGTCATA CGTTTGATGT CTCACAAAGT CATCATAAAA CACCAGATGA TACAAACTCA 5 94 0 

50 CATTCTGTTA TATATTCAAC ACAAAATTTA GACTTATGGT ATGGCGAAAA TCATGCATTA 6000 

CAAAATATTA ATTTAGATAT TTATGAAAAC CAAATTACTG CCATTATAGG TCCATCTGGT 6 060 
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CTTACTGCTG TTTTTTTAGG GATTTATGTC CCAGCCATTT TTGTATTCAT ATTTAAATTT 2 58 0 

CGATAATTTT TCAGGAAGCA TTTTAATTTT ACTAATGAAG CAATATTTTT TAGATTAACA 2 64 0 

AAAATTAATA TTTACATTTT CTTAACAATT TTTTATGTAA CATTTACAGT TTCTAAAAAT 27 0 0 

GAGGTTAATA ATTCAAGGTT AAGATAAAGA TGTAATCAAT ACAAATACTA TTTGTTGTTC 2 76 0 

ATACAGGGAG GATATTTCAA TGAAAAAATG GCAATTTGTT GGTACTACAG CTTTAGGTGC 2 82 0 

AACACTATTA TTAGGTGCTT GTGGTGGCGG TAATGGTGGC AGTGGTAATA GTGATTTAAA 2 8 80 

AGGGGAAGCT AAAGGTGATG GCTCATCAAC AGTAGCACCA ATTGTGGAGA AATTAAATGA 2 94 0 

AAAATGGGCT CAAGATCACT CGGATGCTAA AATCTCAGCA GGACAAGCTG GTACAGGTGC 3 000 

TGGTTTCCAA AAATTCATTG CAGGAGATAT CGACTTCGCT GATGCTTCTA GACCAATTAA 3 06 0 

AGATGAAGAG AAGCAAAAAT TACAAGATAA GAATATCAAA TACAAAGAAT TCAAAATTGC 312 0 

GCAAGATGGT GTAACGGTTG CTGTAAATAA AGAAAATGAT TTTGTAGATG AATTAGACAA 3180 

ACAGCAATTA AAAGCAATTT ATTCTGGAAA AGCTAAAACA TGGAAAGATG TTAATAGTAA 324 0 

ATGGCCAGAT AAAAAAATAA ATGCTGTATC ACCAAACTCA AGTCATGGTA CTTATGACTT 33 0 0 

25 CTTTGAAAAT GAAGTAATGA ATAAAGAAGA TATTAAAGCA GAAAAAAATG CTGATACAAA 33 6 0 

TGCTATCGTT TCTTCTGTAA CGAAAAACAA AGAGGGAATC GGATACTTTG GATATAACTT 342 0 

CTACGTACAA AATAAAGATA AATTAAAAGA AGTTAAAATC AAAGATGAAA ATGGTAAAGC 34 8 0 

AACAGAGCCT ACGAAAAAAA CAATTcAAGA TAACTCTTAT GCATTAAGTA GACCATTATT 354 0 

CATTTATGTA AATGAAAAAG CATTGAAAGA TAATAAAGTA ATGTCAGAAT TTATCAAATT 3 6 00 

CGTCTTAGAA GATAAAGGTA AAGCAGCTGA AGAAGCTGGA TATGTAGCAG CACCAGAGAA 3 66 0 

AACATACAAA TCACAATTAG ATGATTTAAA AGCATTTATT GATAAAAATC AAAAATCAGA 372 0 

CGACAAGAAA TCTGATGATA AAAAGTCTGA AGACAAAAAA TAATAAGACG CAATTTCAAA 37 8 0 

TGTGTCTTGA AACATGATTT TGATGGTGAA TCATTATTTA GAGTACAAAG CTTGATTTAT 3 84 0 

CGAGACGCTG ATTTTGACAT TCAGTTAGTC TAcAAGCTTA TCAACTTAAA ATAGTGGTTC 3 900 

ATCATTATTT TACAAATCTA ATTATTTTGG GAGTAATAGA AAGAGGTTTG ATTATGACTT 3 96 0 

45 CATCTACTAA TGTTAAAGCT TTAATCGAAA AAAATAATAA TAAAAAAGGA AAGCATAATG 4 02 0 

ACAAAATTAT ACCAGTTATT TTAGCCGCAA TTTCAGCGAT TTCCATTTTA ACAACACTAG 4 0 80 

GTATATTAAT CACATTGCTT TT AG AAA CCA TCACTTTTTT CACCAGAATT CCAATAACTG 414 0 

50 AATTTCTATT TTCTACTACT TGGAATCCTA CCGGTTCAGA CCCTAAGTTT GGTATCTGGG 4 2 00 
CATTGATAAT AGGGACTTTA AAAATCACAG TTATTOrGAr TATA rr * r T^^A n^^^A^^' 
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ATCTCCAAAA CGTAAACTCA CATCAGTTAC TTGTAACATG CATTTTCTCC TTTTTTTCAT 7 80 

TCGATATTCT AACGGAAGAA TTATATCATA TTATCGTCAC AGTTTCGACC TCATATAAGT 840 

TGTAATGATA GAATGACTCA CACATGTTAT AATAATAAAG AATACAAGAA TCGAAGGAGA 90 0 

ATAACATGGC ATTAGACAAA GATATAGTAG GTTCTATAGA ATTCCTTGAA GTAGTAGGGT 96 0 

TACAAGGTTC AACTTACCTT TTAAAAGGAC CAAACGGTGA AAACGTAAAG TTAAACCAAT 102 0 

CAGAAATGAA CGATGATGAT GAATTAGAAG TAGGTGAAGA ATATAGTTTC TTCATTTATC 10 80 

CAAACCGTTC AGGTGAATTA TTTGCAACTC AAAATATGCC TGATATTACG AAAGATAAAT 114 0 

ATGACTTTGC TAAAGTACTT AAAACGGATC GCGATGGGGC ACGTATAGAT GTTGGATTAC 1200 

CCCGTGAAGT GTTAGTACCA TGGGAAGATT TACCAAAAGT GAAATCACTA TGGCCACAAC 1260 

CTGGTGATTA TTTGCTAGTT ACATTACGAA TTGACCGTGA GAATCATATG TATGGACGTT 13 20 

TAG CG AGTG A ATCTGTTGTA GAAAATATGT TTACACCTGT ACACGACGAT AATTTAAAAA 13 80 

ACGAaGTCAT TGAAGCCAAA CCTTACCGCG TATTACGAAT TGGTAGCTTT TTATTAAGCG 144 0 

AATCAGGTTA CAAAATTTTC GTACATGAAT CAGAACGTAA AGCTGAACCA AGATTAGGTG 1500 

25 AATCTGTTCA AGTTAGAATT AT CGGGCATA ATGATAAAGG TGAGTTAAAT GGTTCATTTT 1560 

TACCACTTGC ACATGAACGT TTAGACGATG ACGGCCAAGT CATCTTTGAT TTACTAGTTG 1620 

AATATGATGG TGAATTACCA TTCTGGGACA AATCAAGCCC TGAAGCGATT AAAGAAGTAT 1680 

30 TCAATATGAG TAAAGGTTCA TTCAAACGTG CAATCGGTCA CTTATATAAA CAGAAGATTA 174 0 

TTAATATAGA AACAGGTAAA ATCGCTTTAA CTAAAAAAGG TTGGAGTCGA ATGGACTCAA 1800 

AAGAATAATC ATTTTTACAC GTGTCGTAGG ATGCGTGTTT TTTTTATTCA ATATTAAATC 186 0 

GGACAGATGA AGTAGTTTTT TAAACATTCC TTTCAAAGTA AAAAATTAAA TAATTCAAAC 1920 

GAATAGGCTG GGaCATTAAG TTCTTAGGCA ATGTAAAAAA GCTGATTTCT ATTAATTATT 1980 

TGATGGAAAT CAGCTTTTTT GATATGTATT TTATAATGTA CAGCTCGTTG AGCTGCTATT 2040 

TTCCTTATAT TAAGTGCCAT TAATACAAAA CCTAGCTCTC GTTTAACTTT ATTTATTCCT 2100 

CGAACTGACA TTCGAGTGAA aCCCAAAATA GCCTTCATAA ATCCAAAAAC AGGCTCTACA 216 0 

TAAATTTTTC TATGACTATA GATTTTTTTC GTTTCTGGTT CAGAAAGCTT TTGaTTAATT 2220 

TGGGCTTTAA TGTATTTCAA AGTAAAATTA CATGTTAATA CGTAGTATTA ATGGCGAGAC 2280 

TCCTGAGGGA GCAGTGCCAG TCGAAGACAG GGGCCCCAAC ACAGAAGcTG ACATATAGTC 2 34 0 

50 AGCTTACAAC AATGTGCCGG TTGGGGTGGC TGAGACGGCA CCCTAGGAAG GGACCCGTCA 2400 

TCAAAAATTC TATTTATAGA ATTTTACAGT AATGTGACAG ACGGGCAAAG CGAAgCCATT 24 6 0 
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TGTTGCTTTT CCTATTAACA TT ATT AC CAT TTTTCAATAT 

ATATGTTAAG CAATGCACCC GCTGAAACAT CTACTCTAAT 
5 TAACTCAAAA CTCCAGTGGT GGCTTATTAT CTATCGGTTT 

CTTCAAATGG AATGACTGCA ATTATGAATT CTTTCAATGT 
GCCGTAATGG AATCGTATTA AAACTACTAA GTGTTGTCTT 

10 

TGTTTGTAGT TGCTCTAGCA TTACCAACGC TTGGTTCTGT 
GTCCACTTGG aTTTGACGAA CAAGTGAAAT GGATTTTTAA 
CAATCATTAT TATATTTATC ATATTTATCG TGTTATATTC 

15 

CGAAGCTTAA GTCAGTATTA CCAGGTGCAG TATTTACTTC 
CATTTGGTTT TGGTTGGTAT ATTTCAAATT TTGGTAACTA 
2 0 7CGCGGGTAT CATCATTTTG TTACTATGGT TATATATCAC 

GnGCTGAAAT CAATGCAATC ATTCATCAGC GTAGTGTAAT 
(2) INFORMATION FOR SEQ ID NO: 161: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7872 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 
TCTTGAGCCA TCTTTTGAGC TAACTGACTA GATTGATACC CAAAAATCAT AGTT AC CAAC 6 0 

35 

ATAAACTTTA ATTTTACCGA AGTCTAAATC AGCGATATGA GTACATACAT TATTTAAGAA 120 
ATGACGGTCA TGCGATACTA CGATAACAGT ATTATCAAAG TTAATTAAGA AATCTTCTAA 180 
CCAACTGATT GCTGGAATAT CGAGACCGTT AGTAGGCTCA TCCAGTAATA GTACGTCTGG 24 0 

40 

TTCACCGAAT AAACTTTGCG CTAATAATAC TTTAATTTTT TGGTTGTTTT CTAATTCAGC 300 
CATTTTTTTA TCGTGTAAAG TTGGATCGAT ACCTAAACCA GATAAAAGGT TAGCAGCATC 360 

45 AGCTTCAGCA TTC CAAC CAT TCATTTCTGC AAATTCACCT TCAAGTTCAG CAGCACGGAT 4 20 

ACCATCTTCA TCACTGAAAT CTGGCTTCAT ATAGATTTCA TCTTTTTCTT TCATAACCTC 4 80 

ATAAAGACGT TCGTGACCTT TAATTACAAC ATCAAGCACG CGTTCATCTT CATAAGCATA 54 0 

50 GTGGTCCTGT TTTAAAACAG CTAGACGTTC ATTTTTCCCT AATGAAACAT GTCCTGTTTG 6 00 

AGAATCTAAT TCACCAGATA ATATTTTTAA GAATGTTGAT TTACCTGCAC ^A^rcrA^^ < - " 



TAAGCAGAGT CAAATTACTA 54 0 

TAAGAGTGTA ATTGGTGATA 60 0 

GATTTTAGCA ATTTGGTCAG 66 0 

TGCTTACGAT GTAGAAGATA 72 0 

CACTGTAGTT ATGGGCGTTG 78 0 

AATTAGTCAT TTCCTATTCG 84 0 

CCTTATTAGA ATTGTGTTAC 90 0 

GGTTGCACCT AACGTTAAAA 96 0 

AATTATTTGG TTAGCTGGTT 1020 

TTCTAAAACA TATGGCAGTA 1080 

AAGTTTTATT ATAATTGTCG 114 0 

TAAAGG 1186 
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TGCTAATCCA TCTAAACCAT CTGTTAAATT TACTGCATTA GAAAAACCTA CTTGCCAAAA 72 0 

AACAATGAAA ATAACATATG CAAATGATAG TGGGATTGCT ACATTCGTAA ATGGAATATG 780 

TATGCTCGTA GAAAAATTCA CCAAATGAAA CACATTACTT AAAACAAAGA ATATAATCGC 84 0 

AATACCAATT TGCGCCAAAA ACTTCTGTTT ACTTGTTAAA CCTTGGTTAT TCTTTTTAAC 900 

AACAATAATA TAATCATCTA TAAAACCAAT TAACCCAAAA CCAATCGTCA CAAATAATAA 96 0 

CAGTATGATT GGATTAGCTT GATCTACAAA TATAATAGCC ACCAAAGACG TTATCACAAT 102 0 

ACTTAATAGA AATGTTAGTC CACCCATCGT TGGTGTACCA GTCTTCTTCA TATGGCTTTG 10 8 0 

TGGACCTTCT TCTCGAATAC TTTGACCAAA TTTCATCCTT TTTAATGTAG GTATTAAAAC 114 0 

AGGT AC CAAA ACAAATGTAA TCACTAGCGC TAATAACGCA TATACAAAAA TCATAACTAT 1200 

CTCCTCTTCT TAATCCAGAC TTTTTTAACC ACTAATATAT TATCAATTTT TCAATTAAAT 1260 

AAACAAAGTT GTAATCAAAA TTTATAATTT TTCTTTTTTA CGGCATAAGA GGCCAGTATA 1320 

AAAAGTTTGC CTATAACAAA CAAGTTAATC TGACCTCGTC TACCTTAAAA TTCTCTATCA 1380 

ACACTTATTT ATAAAGATTA AATGAAGATG TTGTTTTCTA TCACAGCATT ACTTTAGTAA 144 0 

25 AAACAAATAG TGACAATACA TCCTAATTTA ATGTAGCCAT TCTTGTTAGT CCGACTTATC 1500 

CTTGTCAGTT TTACTGTCAG ATTTCnTCTT ATCATCTGAA TTTGAATCAG AATTATTCGT 1560 

CGAATTGCTG TCTACATTCT CTGGATGGAA AATTCTACGT 16 00 
(2) INFORMATION FOR SEQ ID NO: 160: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1186 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 

ATTGCCTTTG TTTTAATTTT AAATCAAAAT mGCCTATGAA AGATTTAAAT CAATTAATTT 6 0 

CTATAATATT ATCATTTTTA AAG CAT ATCA TTGTTTAGTT TTTTTATAAT TGGATAAATA 120 

CTAATAGTTA CTTTATAAAA CATTACATAG AGAAAGGTTA AGGAGTGCAC ATGTCGAAAA 18 0 

AGGATCACTC TTCTTCAAAA TACCTTAATT CTGTTAAGGA AGCGCAAGAG GAGTCAAAAA 24 0 

AGAAAAATAA AAGTAATCCC AAAATTGATG TTGATCGTAC ATATATTGAA CCTCAACAAT 300 

50 TCCAATCTAA GAAACCTAAA AAAGATGATC AGGTTTTCTT CTTATCAAGA TTAAATAAAC 360 

CTGCAAAATA TAAGAAAGAC TCTAATTTCT TATCATATCT CATCTATCGC ATAGGAAAAG 42 0 
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ACATTAATGA TATGGATTTC TTTGTCTATG TATCCGACTA ATGTATGTGA TTTGTCTAAA 414 0 

TCATTGACTG CATTAATAAT ACTTTGAACG TTATCATTCA TTTTAGGTGC ATGTATATCA 4 2 00 

ATATAAGATT CCGTCTCATT TGCATTGATA AATAAATTGA GTTTCATCAT AGGTTAATGC 426 0 

CTCCTTCAAA ATT ATT AAA C CATAAATGAC CATC GAT AT A TTTAAATTTT GTTGAATGGT 432 0 

AGAAATTAAA TGTTAAGTGG CTAGAAAGCG CTAATCAATA TAAAAGATAC CTCCTGAAAT 4 3 80 

AAAAACAGAA ATGTTTTTTC AGGAGGTAGA GATTAAAGTG AATTATTTGG CAGTGTAATA 444 0 

GTAAAGGTGG TTACATACTC GTTACTTTGT GTGAATTGGA TTGTACCATG ATGCAATTCA 4 500 

ATGATGGATT TTGTAATTGC AAGACCTAAA CCATTGCTAT TATCATGTTT GCTCACTTTA 4 56 0 

TAAAAACGTT CAAATAAACG TGCTTCAGCT TGTGGACTAA TTGGTGAACC ATCATTACTT 4620 

ATTGTGAAAA TGATATTGTT GTGACTATGT TGCAAAGCGA TGTCAATGGC ACCACCAACA 4 6 80 

TCTGTATACT TAATAGCATT TATTAATAAA TTACTCAATG CTTGATGTAA CAAACGTTGA 4 74 0 

TTTCCTAGGA AATTGATGAT TCTAGGTCAG CTAAnATGAT TAACGACTTT TCATCAGCAG 4 8 00 

CAnATTGTTC ATGTCGAATG ATATCnTTAA TGAGCTG 4 83 7 

(2) INFORMATION FOR SEQ ID NO: 159: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1600 base pairs 
( 3 ) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 

ACAATTATTG G ATT ATT AT C AAGCAACGTT AATGGATGAC TTCCACTTAC AACAGAAATG 6 0 

CCCATAGATT CTAAATCTt T TGCATGAGCA TCTTGTGATA AGTCTTTTCC ATCATTGACA 12 0 

GTTACATTCG CACCTAATTT ACTTAATAAT TTAGCTGCTT CATAACCACT TTTTGCCAAA 180 

CCGACAACTA ATACATTTTT ATTTTCTAAC CCTGTATAAT TAAGCATCTT AATGCACTCC 24 0 

AATCCATAAA CCGATTAAAC CTGAAATCAG ACCAACAGCC CAAAATACTG TAACTACTTT 3 00 

CCATTCGCTC CATCCTATCA ATTCAAAATG ATGATGAATC GGACTCATTT TAAATATACG 3 60 

CTTTCCAGTC AATTTAAAGC TAGCGACTTG TAACATAACA GATAATGTTT CAATTACGAA 4 20 

TACTAAACCT ATAAAAATTA ATGATAATTC CTGATTAAGC ATGATTGAAA TGGTAGCAAA 4 80 

TATACCACCT AAAGCTAAGC TACCTGTATC T CC CAT AAA C ACTTTAGCAG GGTTAATGTT 54 0 

ATATGGTAAA A AT CT AAA A OTOCAAACAA '"ATAATATA ^A^AAAAT^ n ^n.!\^r— <- ~ ■ 
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GGTACCGTTA ATGATTGAAG GTCGTAAGTC TGATGAACCA ATTGCTTTAA CTTATGATGA 234 0 

AACTGGTACa gATGTTAACT TTGGTGCGTT AACTGCAAAG TTATTTGATA ATTTAGAGCA 2400 

ACGTGGTGTG GGAATTCAAT ATAAGCAGAA TGTATTAGAC ATCAAGAAAC AGAAATCTGG 24 6 0 

GGTATGGCTA GTTAAAGTTA AAGATTTAGA AACTAATGAA ACGACAACAT ATGAATCTGA 252 0 

TTTTGTATTT ATTGGTGCTG GCGGTGCGAG TTTACCATTA CTCCAAAAGA CTGGGATTAA 2580 

ACAATCAAAA CATATTGGTG GTTTCCCGGT AAGTGGATTA TTCCTGCGCT GTACAAATCA 264 0 

AGAAGTGATT GATCGTCATC ATGCTAAAGT GTACGGAAAA GCAGCAGTGG GTGCGCCACC 270 0 

AATGTCAGTG CCGCACTTAG ATACACGTTT TGTAGACGGC AAGCGTTCAT TGTTATTTGG 2760 

TCCATTTGCA GGTTTCTCAC CTAAATTTTT AAAAACAGGT TCACATATGG ATTTAATTAA 2820 

ATCGGTTAAA CCAAATAATA TCGTGACGAT GTTATCTGCA GGTATCAAAG AAATGAGTCT 2880 

TACGAAGTAT TTAGTGTCAC AATTGATGTT ATCTAATGAT GAGCGTATGG ATGATTTAAG 2940 

20 

AGTCTTTTTC CCAAATGCTA AAAATGAAGA TTGGGAAGTG ATTACAGCAG GGCAACGTGT 3 000 

CCAAGTAATC AAGGATACTG AGGATTCTAA AGGTAACTTA CAATTTGGTA CTGAAGTTAT 3 060 

25 TACGTCAGAT GATGGCACAT TAGCTGCATT ACTTGGTGCA TCACCTGGTG CGTCAACAGC 3120 

TGTAGATATT ATGTTTGATG TTTTACAGAG ATGCTATCGT GATGAATTCA AAGGATGGGA 3180 

ACCAAAGATT AAAGAAATGG TGCCGTCATT TGGTTATCGC tTAACAGATC ATGAGGATTT 324 0 

30 AT AT CAT AAA ATTAATGAAG AAGTAACTAA GTATTTACAA GTTAAATAAT AAACGAAACG 3 300 

GTAATGTCTT TTTTAATGTG ATAGACATTA CCGTTTTTTA GTGGTTAATA AAAATCATTT 3360 

TAATTGTTTC AGTTGCTTGT TAATAGTGTC TACGTAGTTC TTGTTTTTAA AGAATTGAAT 342 0 

TATCCAAATT AATACATAAA CCACAATGAA GATAATTGTG AATATGATTA GATAATGCAC 3480 

TGTTAGTGGA AACCAACCGG CAAGCATTGC TAAAGGCAAG AATCCGACAT ACGTTGTTAT 354 0 

GAAATGCATT ATAGTTGCTT TAGTAATGCT CCAATCTGTG TATTTAAAGA TAAAATCTCC 36 00 

AAGGAAAAAG ACGACGCCTA TGAGTAACCA TAAAATGATA GAAATCAACA TTACGGTAGT 36 60 

TTCTGTGAAA TGCGTATAAT ACAATATGCC AATAGTTGAT TGTGGGTTCA GTGGATAATA 3720 

TTTGCCGTCT GCAAATAACA TACTAAAGAA CAGTGAAAGG GACAAACCAA TGATTAAGCT 378 0 

AATAAATAAT GAGTTTTTCA AATTTTTCAT ATTGATAAGC GCTCCTTTAT AGATTTTAAA 384 0 

TAACGTCTAG AAGAATAGGT GTAGTGTGCA TCTTTAAGAT ACATACGTAT AAGTCCATTT 3900 

50 GGCTCTAATA ATAATTTTTC AATGTAATAC TTGTTGACGA TTTCTGATTT GGAAATGCGA 3960 

ATGAAATGTT GTGGTAACTG TTTTTCTAGT TCATAAAGTC GTAATTTTAG TTTGAATTTT 4020 
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GGTTTTAGAG GTGTTAAAGA AACATTACCA GCAATTTTAG TAGTTTCAAT CACTTATACA 54 0 

CTTACTCAAG GATTATTAAC TGTATTCAGT GGACCTGAAT TAGCAGATAT TATTCCACCG 6 00 

TTATTAACAA TGTTAGCATT AGCAGTATTT TCTAAAAAAT TCCAACCAAA ACACATTTAT 660 

CGTGTTAATA AAGATGAAGA AATTGAACCT GCAAAAGCAC ATTCTGCAAA AGCAGTATTA 720 

CATGCATGGA GCCCATTCAT TGTATTAACA GTCATTGTAA TGATTTGGAG TGCGCCATTC 780 

TTTAAAAACT TATTCTTACC AAATGGTGCT TTATCATCAT TAGTATTTAA ATTCAACTTA 84 0 

CCTGGaACAA TCAGCGAAGT T A CG CAT AAA CCATTAGTAT TGACTTTAAA TATTATTGGA 90 0 

CAAACAGGTA CAGCTATTTT ATTAACTATT ATTATTACAA TTTTAATGTC TAAAAAGGTT 96 0 

AACTTTAAAG ATGCAGGTAG ATTATTCGGC GTTACATTTA AAGAGTTGTG GTTACCAGTT 102 0 

CTTACAATTT GTTTCATCTT AGCAATTTCT AAAATCACAA CTTATGGTGG TTTAAGTGCA 10 80 

GCAATGGGTC AAGGTATTGC TAAAGCAGGT AATGTCTTCC CAGTTCTATC ACCAATTTTA 1140 

GGTTGGATAG GTGTGTTTAT GACAGGATCA GTTGTAAATA ACAACTCATT ATTTGCACCA 12 00 

ATTCAAGCTT CTGTTGCACA ACAAATTGGA ACAAGTGGTT CACTTCTTGT ATCTGCTAAT 1260 

25 ACAGTTGGTG GTGTAGCGGC AAAATTGATT TCACCACAAT CAATTGCAAT TGCAACTGCA 132 0 

GCAGTAAAAC AAGTTGGTAA GGAATCAGAA TTATTAAAAA TGACATTGAA ATACAGTGTA 13 8 0 

TGTTTACTAA TATTCATCTG TATTTGGACT TTCATCTTGT CATTATTATA AAAAAACGTA 14 4 0 

30 TTTCAAAATA TAAATATACA GAAGGTGAGA TGTTTTCTAA CATCTCATCT TTTTTTTATG 150 0 

GATCATTAAT GAAAGAAGTT TGACATTATA ATAATGGTAG CGCTTTATGT TAAAATGAAT 1560 

AGTGAGTAAT CAGCAATCAA ATTAAATTGG TTGATAGCTG TTAAGGTTTG TGGTTTTGTC 1620 

TTTGTGCTAT CGCnCATAAA GTATATAATT AAAGTAGTTT CGTTATTATA AAATATTAAT 16 80 

ATACATAGTA GATAGTAATA GAGCATCACC ATGGGAACCT ATTGAGACAC TTATTGATTT 174 0 

AAAGTGGTAT TAATATGTCG TATTTCTCGA ACGTTCCATT ATTCATTTTA AAAAGGGGGA 16 00 

CTGTATTTGT TATGACAACA CAACATAGCA AAACAGATGT CATCTTAATT GGTGGCGGTA 18 60 

TTATGAGTGC aCATTAGGAA CATTACTTAA AGAATTATCA CCTGAGAAAA ATATTAAAGT 192 0 

GTTTGAAAAA TTAGCACAAC CTGGCGAAGA GAGTTCAAAT GTATGGAATA ATGCCGGTAC 19 80 

AGGGCATTCA GCACTTTGCG AGTTGAACTA TACAAAAGAA GGTAAGGATG GCACAGTTGA 2 04 0 

TTGTAGTAAA GCAATTAAGA TAAATGAGCA GTACCAAATT TCAAAACAGT TTTGGGCATA 2100 

TTTAGTTAAA ACAGGACAAT TAGATAACCC AGATCGCTTT ATTCAAGCGG TGCCACACAT 2160 
GAGTTTTGTC ATTGGCGAAG ATAATGTAGC TTTTATAAAA AGTCGTGTTG CAACGTTAAA 
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5 



15 



20 



TGTTTGTCTT 


CAAAAGCTGG 


TACAGGTTGT 


TATGTATCGA 


TTTCAGAAGA 


TAAACGATAT 


1500 


TTATTTGAAG 


CGGTATATGG 


TGCTGGCATC 


ATACGTATGT 


ATGAATTAAA 


TACGCACACA 


1560 


GGTGAAATTA 


TACGTCTAAT 


TCAAGAACTT 


GCACATGATT 


TTCCAACAGG 


TAGACATGAA 


1620 


AGACAAGATC 


ATCCACACGC 


ACATTATATT 


AATCAAACTC 


CAGATGGTAA 


GTACGTTGCA 


1680 


GTAACAGATT 


TAGGTGCTGA 


TCGTATCGTT 


ACTTATAAAT 


TTGATGACAA 


CGGGTTTGAA 


1740 


TTTTATAAAG 


AATCTTTATT 


TAAAGATAGT 


GATGGGACAA 


GACATATTGA 


ATTTCATGAT 


1800 


AATGGAAAAT 


TTGCTTATGT 


CGTACACGAA 


TTATCAAATA 


CTGTGAGTGT 


TGCAGAATAT 


1860 


AATGACGGTA 


AATTTGAAGA 


GCTCGAGCGT 


CATTTAACAA 


TTCCTGAAAA 


CTTTGATGGA 


1920 


GATACTAAAC 


TTGcAGCAGT 


GCGTTTATCT 


CATGaTCAAC 


AATTCTTATA 


TGTATCTAAT 


1980 


AGAGGGCATG 


ATAGCATTGC 


AA'ITTTTAAA 


GTTCTTGATA 


ATGGTCAACA 


CTTAGAACTA 


204U 


GTAACAaTTA 


CTGAAaGTGG 


TGGTCAATTC 


CCAAGAGATT 


TTAATATTGC 


CTCATCAGAT 


2100 


GACCyTTTAG 


TTTgTGCTCA 


kGaGCaAGGA 


GATTCAGTTG 


TAACTGTTTT 


CGAAAGAAAT 


2160 


AAAGAAACAG 


GTAAAATTAC 


GCTATGTGAT 


AACACTCGTG 


TAGCATCTGA 


AGGTGTATGT 


2220 


GTCATATTTT 


AATCTTTAAT 


TAATCATGAT 


AAAAAGAAAA 


CCATGTTTCC 


AAAAAATTTG 


2280 


TGTATACCTT 


GAAATTTATT 


GnTTTCCAGn 


ACATCAATTA 


TGGGAAGCAT 


GGnTTATTTT 


2340 


TGT 












2343 



30 (2) INFORMATION FOR SEQ ID NO: 158: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4837 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 

40 

AAATTGCCAG TTGGTATCGC TTCTGGTGCA GTAGTCGAAG GTTTCTTCCA AGGTATCATT 6 0 

CCGATTGGCT ATATCGTTAT GATGGCAGTA TTGTTATACA AAATTACTGT TGAATCTGGA 120 
CAATTTTTAA CAATTCAAGA TAGTATTACA AATATTTCAC AAGACCAACG TATTCAAGTT 180 

45 

TTACTTATTG GATTTGCATT CAACGCATTT TTAGAAGGTG CAGCAGGATT TGGTGTACCA 24 0 

ATTGCAATTT GTGCACTTTT ATTAACACAA TTAGGATTTA ATCCATTAAA AGCTGCGATG 300 
50 TTATGTTTAG TCGCAAATGC AGCGTCTGGT GCTTTTGGTG CGATTGGTAT CCCTGTAGGT 36 0 

GTTGTAGAAA CGTTGAAATT ACCTGGAGAT GTTTCAGTAT TAGGTGTTTC TCAATCAGCA 420 

55 
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(2) INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2343 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 157: 

AGTAAGATAA TTTTCAATTA GAAAATATCT TACTGCTGTT CTCTATTTAT ACAATACTTC 6 0 

GTATTGAATG GcTTCGCTTT CCTAGGGTGC CGTCTCAGCC TTGGTCTTCG ACTGGCACTG 12 0 

CTCCCTCAGG AGTCTCGCCA TTAATACTAC GTATTAACAT GTAATTTTAC TTTGAAATAC 180 

TTTTAAAAAA TAAGACACTT TGCCCAACTT GCACATAAAT GTAAAATTCA ATAAAATGAA 24 0 

TTTTCTGTGT TGGGTCCCTT CTTATAATTT AATAAATACC ACTAAACTAA ATTAACGAGG 3 00 

TGCCTTATGT ATAAAAATTA TAACATGCCC CAACTACACT ACCAATAGAA ACTTCTGTTA 360 

GAATCCCTCA AAATGATATT TCACGATATG TTAATGAAAT TGTTGAAACr ATACCTGATA 4 20 

GCGAATTCGA TGAATTCAGA CATCATCGTG GCGCAACATC CTATCATCCA AAAATGATGT 4 80 

TAAAAATCAT CTTATATGCA TATACTCAAT CTGTTTAATT ATGTTCAAAG CATTAAGGTA 54 0 

ACAAGACAAT ATCTAAGATA TCAAAGATAG AAATTTTTTG ACGTTGTTGC TGATTGTAAA 6 00 

CAT AAC CATC AATTT CAT AA TTAATAGCAT CAATACGATA AATGGTTAAG CGTACTGAAT 6 60 

CTACAAAGCC ATTATTATAA AATTTAACTT CTACAGGTTG GGCATATTGT AGCGCCTCGT 7 20 

GTAGCCGAAT GTTTAGCTCA GCCAATTGAT CATCTGATAA TACAGGACGT GTAATTTTGT 78 0 

TTTGGTCGAT AATGTATTGT TGAATCGTTT CGAATTGTTC GGGTAATGTT GCAAAAGGAG 84 0 

CCCATTTAAT CATGCCTCTT CCCATAGGTA TATTGTTATC TAGTAATTCT CTTGGAACGT 900 

TACGATAATC AGTTTCTTCT TCATAACTTG TCATCCTTAA TTCACCCCAA TCTGATAATT 96 0 

ACATTATACG AACATGTGTT CTATTTTGCA ACAAAAATTT TGTGGaAGCA TAAACGCGTT 102 0 

AATAATTAAT GCTCGTGtAA GTAAAAAAGA GGGATTAATT AAAATCGAAT AATGaCATAT 108 0 

CACaGCAAAT AGTTCTTTTA AAGTAGTTAA ATAGTTTTAG CTTTAAGGAA aTGATAAaTG 114 0 

ATTGTvAATT CTAGCTAAAA TTTAATAAAA TGAAAATAAG ACTAACATGG AGGGGTAAAA 1200 

GTAATGACAA ATGGATATAT TGGTTCTTAC ACTAAAAAGA ATGGTAAAGG GATTTATCGT 12 60 

TTTGAATTAA ACGAAAATCA GTCACGTATT GATTTATTAG AAACAGGATT TGAATTAGAA 13 20 
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CATCCTCAGC TTCTTCTTTT ATTAAGTCAT TCACCTTTTT TTCGGCATTT TTTAAAGTTG 13 80 

TGTCACAAGC TGCTGATAGT TTCATACCAC GTTGATATAA ATCTAATGAT TCCTCTAAAG 144 0 

5 ATACTGTTTC ATTATCTAAT TTTTGAACAA TTTGCTCTAA TTCTTGCATC ATTTCTTCAA 1500 

AACTTTGCGT TTCTTTAGTC ATTATTACAC CTTACTTTCG TAACTTTTGC ATCTACTAAG 1560 

CCATCTTTCA TTGTTAACGT CAATTGATCA TTTTCTGTTA AATCTTTAGT ACTCGTAATG 16 20 

w 

ACTTCGTCTT TTTTATTAAC AATTGCATAT CCACGCAACA TTGTATTAGT TGGACTTAAA 16 8 0 

TTGTTTAAGT TTTCTACTTT ATTTTTCAAA TCATTTTTAT AACTTAATAT CTTAGAATTC 174 0 

AATAATTTAA CAAGTTGGTT TGTCAATTGA AGATTATnTT GTTGTTCTTG ATTAACACTA 18 00 

15 

CTTAGTAATG CTTTTAAATn ATAACGTTGG TGCAACAGCA TTAAATCGAG GCCCCGGTGG 1860 

TCCAAAGTTG CCCGAATTnG TGGTTTCAGG CCC 1893 

2Q (2) INFORMATION FOR SEQ ID NO: 156: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 821 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
25 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: i 
30 AAAATATATT CCTTCACTTA ATATTCAATT 
TGCAATATTT CTGGGTGTTT TAATACTTTT 
AATTTATAAT AATAACTCTA GTAAAGGGAA 

35 

TTGTTTCACA ATATTTTATT ATTTTTTATT 
GTTATTAACT TTAAATCCTC AAAATATAAT 
AGGTTT CTTT CTGTTAAACA TGTCATCGAT 

40 

AACTAACACG GTGGAAAAAT GATGGTTTAG 
TACTTGTATA TTGGACCAAT CAAAAAAATG 
TAGGACATAA ACTAGACGTT GGTATAGATG 

45 

ATGGTGTTGA AAATGTCATG GTGAAGCTTC 
GAAATGGAGC TGGTAAATGT TGCTATGCGA 
50 TAGAATTATT AAATATAATG ATGAATACTT 

TTTATTTTTT CCTTTTATTA ATTGGTTCAT 



IEQ ID NO: 156: 

AGAGAAAAAC ATGGTGATTG TAATATGTTG 6 0 

TTTATTTCTG AATCGTAAGC TAAGGTTGGA 12 0 

AATAATTTTA TTTCCTTCAT TAAAAAACTT 180 

TGGCGGTCTT TCAATAATGG CTCTAAGTAT 24 0 

AGGCTTTATT GGTTGGTTGG TAATGACTGC 3 00 

TATTGACAAA AAAATTTATG TATTATCTAA 360 

CTGGATTTAC TGCAGGTTCT ATTTCGGCAA 420 

AATTTGGAAT AAAAGATAAA AACGATTGGA 4 80 

CTGTAGAAAA ATCTGCAGAA AAAACAGTAG 54 0 

AAAAAGTATT TCTAATCATA TAAGCCCTAA 600 

ATCTAAAATC AT CAATAAAA ACC CAAAATA 660 

AATGGTCGAT ATAATAAGCA CTTGGATTAG 720 

CCCaAAAGaA TACGTCAAAA TTAGTAGAGA 780 
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TAAAGCTGTT GAATAATTTT AGTGCCTAAA CCATCAATAT TcATGGCTTG TCTTGaTACA 13 500 

AAGTGnATCa ATCCtTcAAC AAGTTGTGCT TGGTCATTTT GG 13542 
5 (2) INFORMATION FOR SEQ ID NO: 155: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1893 base pairs 

(B) TYPE: nucleic acid 

10 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 

CAGTAAACAC CTCTGATTAC GAATATTTAT ACATTTATTT TAACACATGC ACTGATTTAC 6 0 

GACTACTAAA CACCTTTACG TAAAAAGGGT AAACATGGTT TATCTATCTT GGTTATCTAT 120 

20 TTATAAATAT TTnTCATATT ACGCATAACA ATTGCTTAAA ATATGTATAA AAATGAATAT 18 0 

ATGTGTAATA AACTTGCTAA TTATTAGATT TAATAAGCGT CAATTGTTTG AACATATTtA 24 0 

ATTAAAATCA CATTGATATC ACAGATACGA ATATTGTCGT ATAGAAATTG AAAATTCTAT 3 00 

25 TTTTTAAATG AAAGTCTTCA ACATAATTTT AAGTTTCAAC ATGAGAAAAA TCGATTAACA 3 60 

AACAACGTCA GTTGAATATG CCTTTTGAGA CATTTCAAAC TTTACAATTG TTGCTAATCG 42 0 

ATATATTTGC TTTTAGTGAT CCCTGCTATA AAATAAATCA ACGATTTCTA ATAAGTGTTT 4 B0 

30 

TGTATTGAAT TGTTCATCAA TTTGCGTTAG TTCATCCACT GCTGCGTCTC TATGATAAGT 54 0 

CAATTTATCT TCTGCGCCAT CTTTCCCTAA TAAACTCACG TACGTACTTT TATTATTTTC 6 00 

AAGATCGCTG CCCACTTTTT TACCTAACTT TGCTTCATCA CCATAGCAGT CTAATAAATC 66 0 

35 

ATCTTTAATC TGGAACATCA TACCTAAATG ATAACTATAA CTTTCTAAAT GTTCTTTAGT 72 0 

TGTATCATCG ACATTAGCGA TATCTGCTGC ACTCATAACC GCAAAAGTTA ATAATGCTCC 78 0 

TGTTTTTGTT TTGTGTATCA TTTCCAAAGT TTCAAGATCA ATTGGTTGGC CTTCGCTTTG 84 0 

40 

CATATCTAAC ATTTGACCGC CGACCATTCC AACATGACCA CTTGCTATTG ACAGCCGTTG 90 0 

TAGAACTTTT ATTTTTACTT CATCAGTTAA TCTATCATCA CTTGAAATAA GTTCAAATGC 96 0 

45 TTTAGTTAAT AAAGCATCAC CTGCTAATAT CGCAGTCCAC TCACCATATA CTTTATGATT 102 0 

TGTTAATTTT CCTCGTCGAT AATCATCATT ATCCATCGCT GGTAGGTCAT CATGAATAAG 10 80 

TGAATATGTA TGAATCATTT CTAGTGCAAT TGCGCTCTTC ATACCTAACT CATACTCGGT 114 0 

SO ATTTAGTGAA TCTAAAGTGA GTAATAACAG AACTGGTCGG ATGCGTTTAC CTCCAGCATT 12 0 0 
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AATTAAAGCA CGTGGTTGGT TACCATCTTT AATACGAATT TCATAGTTAT CGATTTTATC 11700 

GAAATATTTA TTCGCTTGTT CAGTAACGTA CTGTGTAATA CCAATTGTTT CAGCTTGTCC 11760 

ATAGTAATCG ATTGGTAAAT CTACTACTAA TCGTTGTGGC TTTTTATCAA CAAATTTAAC 11820 

TTTCCCTACT GCTTGTGTGA AATTAGAAAA ATATGATTGC AAATTATCAT TAAATTGCTT 11880 

GAAATTATTA TTTAAATTTT CATCATAATC TGCTGCTGTT GAAGAAGGTA ATAAAGCTGA 11940 

TTTTTCATTG ATATTATGCC ATTCATTAAG CTTTGTTTGA CTCTTTTCTG CAGTCGCTTG 12000 

AGTGATAAAT TCACCTGGTG TGATTGAATC TTCACTTGAT TGCTTATAAA TTGCAAAATG 1206 0 

AATTGGTATA TCTTTTAAAT CATCATTTTC ACGTAACCTT GATAATATCT CACTAGCCAT 12120 

TTGTTTACCT TGCTTTTTAA CTcGCTATCA TCTAGTTTTT TACTAAAAGT CGATCCATCT 12180 

TTTTCTTTTT TATAGTAATA AACACTATTC ATAGCTAAAC CAATCGTCAT ACCTTTAATA 12240 

TTCTTACCTT TTGTATCTCC ACCACCATAA AAATCTTGCT CTAAAATGTT AGATAAATAG 1230 0 

GCTGGTGATT TTTCTGCAAT CTTTTCAGGA TCTGTTTCAC CTtCGTGTGA TGGATTAAGT 123 6 0 

CCTAAATTTT CATTCGCTTT CTTGTCTTTT TTATCTTTTT CAGACATTTT ATCGATTTCA 12420 

25 CGTTTTGTAT ACTTAGGATT TAAATAGGCA TTAATTGTTT TCTTGTCCAA AAATTGACCA 124 8 0 

TCTTGATACA AATATTTATC TGTTGGAAAT ACTTCTTTAC TTAAGTTCAA TAAACCATCT 1254 0 

TCAAAGTCGC CGCCATTATA ACTATTTGCC ATGTTATCTT GTAAAAGTCC TCTTGCCTGG 126 00 

CTTTCTTTAA ATGGTAACAA TGTACGATAG TTATCACCTT GTACATTTTT ATCCGTTGCA 126 6 0 

ATTTCTTTTA CTTGATTTGA ACTATTGTTA TGTTTTTGAT TATCTTTTCC AGCCTGGTCA 12720 

TCCTTATGGT TACCACAAGC AGCGAGTATA AAGATAGCTG TAATCAATAA TACTAATGTA 12780 

CGCTTCATCG ACATACCCCT CTAACTATTT AATTCATTTT GCTTATCTAC AAATTGTTGC 1284 0 

TCTGTCCAAA TTTCAATACC TAAACTTTGT GCTTTTGTTA ATTTTGAACC TGCATCTTCA 12900 

CCAGCAATAA CGACATCTGT ATTTTTAGTA ACGCTACTTG TAACTTTAGC ACCTTGTGAT 12 960 

GCAAGCCATT TAGATGCTTC ATTGCGTGTC ATTTGATGTA GCTTACCAGT CAGTACTATC 13020 

GTTTTACCAC TAAATTCAGG ATGTCCTTCA ATATCTGATG TTTTGATACC TTTATAAATC 13 080 

ATATTAACAT GTTTATCTTT TAATTTTTGA ATTAAAGCAC GAATATCTTC ATTTTCTAAA 13140 

TAAGTAACTA CAGATTGTGC TACTTTATCA CCTATATCAT GAATTTCTAC TAATTCCGCT 132 00 

TCAGTTACCG TTAGTAATCG ATCTATCGTT TCATATTTTT CTGCTAACAC TTGGCTCGCT 13 2 60 

50 TTAACACCTA AATGC CTAAT ACCTAGACCA AATAATAAAT TTTCTAAAGA GTTGTC CTTA 13 320 

GCTTGTTGAA TGGCAGCTAA TAAATTATCA ACTTTTTTCT GCCCCATTCT GTCTAAAGGT 13 3 80 
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TATATTGGTA TGCAAGTATT TCAAAAAGAA TAAATTTAAT TTTCCTACTT TTCTAAACAT 9 900 

TTATCTTTAT GTATAATGTT TTCAAGTAAC TAAATTATAA ATTAAATAAA GGGAGTGTTT 9 96 0 

ATCATGCTTA CAATGGGGAC AGCATTAAGT CAACAAGTAG ATGCCAATTG GCAAACTTAT 1002 0 

ATTATGATTG CCGTCTACTT CTTGATACTA ATCGTTATTG GCTTTTACGG TTACAAGCAA 10 080 

GCAACTGGTA ACCTAAGCGA GTACATGTTA GGTGGACGTA tATTGGACCG TATATTACTG 1014 0 

CATTATCAGC TGGAGCTTCA GATATGAGTG GATGGATGAT TATGGGGCTA CCTGGTTCTG 10200 

TCTATAGCAC TGGTCTATCA GCTATGTGGA TTACAATCGG TTTAACATTA GGTGCTTATA 10260 

TAAATTACTT TGTTGTTGCT CCTAGACTTC GTGTTTATAC CGAATTAGCT GGAGATGCAA 103 2 0 

TTACATTACC AGATTTCTTT AAAAATCGTT TAAACGATAA AAATAATGTG TTAAAGATTA 1038 0 

TTTCTGGATT GATTATCGTA GTATTCTTTA CATTATATAC ACATTCTGGT TTCGTATCTG 10440 

GTGGTAAACT ATTTGAAAGT GCTTTTGGAT TAGATTATCA TTTCGGTTTA ATATTAGTTG 10500 

CTTTCATTGT CATTTTCTAT ACTTTCTTTG GTGGATATTT AGCTGTATCA ATTACAGATT 10560 

TCTTCCAAGG TGTCATTATG TTAATTGCGA TGGTTATGGT CCCTATTGTT GCTATGATGA 10620 

25 ATTTAAACGG CTGGGGAACG TTTCATGATG TAGCAGCTAT GAAACCTACA AATTTAAATT 106 8 0 

TATTTAAAGG GTTATCATTT ATAGGAATTA TCTCTCTATT TTCATGGGGA TTAGGTTATT 10740 

TCGGTCAACC TCATATCATT GTAAGGTTTA TGTCTATTAA ATCACACAAG ATGCTACCTA 10 800 

AAGCTAGACG TTTAGGTATT AGCTGGATGG CTGTTGGTTT ATTAGGCGCT GTGGCTGTTG 10 86 0 

GTTTAACAGG TATTGCATTC GTACCTGCTT ATCATATTAA ACTAGAAGAT CCTGAGACAT 10920 

TATTCATCGT GATGAGTCAA GTACTCTTCC ATCCTCTTGT AGGTGGTTTC TTACTTGCTG 10 98 0 

CGATTCTAGC TGCAATTATG AGCACGATTT CTTCACAATT ACTTGTAACA TCTAGTTCAC 11040 

TAACGGAAGA CTTTTATAAA TTAATTCGTG GTGAAGAAAA AGCTAAAACG CACCAAAAAG 11100 

AATTTGTTAT GATTGGAAGA TTATCTGTAT TAGTTGTAGC AATTGTTGCC ATCGCGATTG 1116 0 

CATGGAATCC AAACGACACA ATTCTAAACT TAGTAGGTAA CGCTTGGGCC GGATTTGGTG 11220 

CATCGTTCAG TCCACTTGTG CTATTTGCAC TTTACTGGAA AGGTTTGACA CGTGCCGGTG 112 8 0 

CTGTAAGTGG AATGGTTTCA GGTGCCTTAG TCGTTATCGT TTGGATTGCA TGGATTAAAC 11340 

CATTGGCACA TATCAACGAA ATATTCGGCT TATATGAAAT TATTCCTGGA TTTATTGTAA 114 0 0 

GTGTAATCGT TACATATGTT GTAAGTAAAC TTACTAAAAA ACCTGGTGCA TTTGTTGAAA 114 6 0 

CTGACTTAAA CAAAGTTCGT GACATCGTTA GAGAAAAATA ATTCATAAGT CTTAACAAAT 11520 

""an n ^ £^": r "7\ i\ tp,^^ ?, T'-\a l AAATTA' r - A'TAArA"* 1 * C/^^A^^T^T TATTAT^TT^ - - q c n 
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GGATCATCAA TTTCTTCACC TAAATTAAAC GCaGTgTnAG GCGCTGTTGG 


ACCAACTACT 


8100 




ar , aTP 7 ATaaT 


' | * 1 "1 ' CC* TV 7A T* TV 

1 1 ILbAAlAL 


tttatpa a Afi 

1 1 1 A 1 L-MMML? 


Iv-nl 111 1MM 


TPAATGTTCT 


AACTTTTTGA 


8160 


r 
o 


IjHI 1 1 1 1 In 1 


APT A aPP ATP 


ATAfiTA A PPT 


A A PTT A ATC; 
vj/uiVw l inn 1 V3 


PAAATGTACC 


TAAGAAAATA 


8220 




CGACGTTTTA 


CTTCTTTACC 


GAAACCTTCA 


GATCTTGACA 


TTTTATATAA 


TTCTTCTAAT 


8280 




VjAA 1 VjAVjL 1 1 


Ll 1 1 Avj AA 1 


aTa tv r*r* AT A A 
M 1 nnL \— n 1 nn 


V>VJ/V\ 1 1 V- V_VJ 1. 


CAAAACGAGA 


AAGGTTTGAC 


8340 


10 


GAAijLI I LIU 


7A TV* 7\ TV f* 7A & T 


pap/^t a atat 

LALVj i AMI M 1 


f2 A TfSrt A AT A C 

V9A 1 wVJMM 1 ML- 


PAAJVTTTAGT 
\_^wi xxx nvj x 


ATTTGGCAAT 


8400 




GATACTTCCT 


LAALGALAb L 


ALL 1 AAAloA 1 


•l"T"P TV TV TV 

1 1 I AAMo 1 1 1 


PTAPAnPtTTT 

TAf^CTAATflP 
X rWJKJ x nn X V7V. 


TTGAACTGCT 


8460 
8520 


15 


T L W 1"I"1"1 ACGT 


CATCAG L 1 AL 


ALL 1 i CALL 1 


AAolAl 1L1 1 


AACTTTTAAT 


C CTTT AAT AT 


CTTTACCAAT 


1 1 LAoA lb 1 A 


A AfiTCTACAT 
MMo 1 V_ inLn 1 


P ATP AAPTGG 


TGCACTTGTA 


8580 




GAGT CATT AA 


CA I L JAjLALL 


ar* a a RTarrT 
AvjAAA 1 AkjL 1 


THT A AT A fVl A 


TTflPATTATP 

X X \J\»rt x x n x v. 


TTTTACATTT 


8640 


20 


CGAGTCAATG 


GACCAAni\j 


oiL lAAloAA 


CI A TfZC A A A Afi 
VjA i\jrv»MMMMvj 


PAAPTAATPP 
v^nnL Xnn^ x v— v— • 


AAATCGAGAT 


8700 




ACACGACCGT 


ATGTTGGTTT 


prtb rTV" ap a 
LA 1 AL LLrALA 


APn^fACAAT 
ALxj^V^MV^MM 1 


ATf^PAGPPGG 


TTGTCTAATT 


8760 




GAACCACCTG 


TGTL IXjAALL 


rp7\ T» /">«"P TV 7v TV T 

1 AAvjL 1 AAA 1 


vjVlXAUinAVJv^ 


P AfJPTYIP AA r- 


TGCTGCTGCA 


8820 


25 


GATCCACCTG 


TV T*f* TV T\ TV 

ATGAAL GAL L 


1 VjvjLAL IIjL 1 


tt a Tfvyr r* a a 

1 lAlvjVJlt-MM 


ATnnnTTAAr 

nivjovjii 1 AnL 


TGrrrrrrTG 


8880 




AAATAAGATG 


TTTCTGTTGA 


TV /T^ TV ^'/^/^R r P r P 

AlLALLLAI 1 


pp tv a arTr* A T 

urL-AMMlw 1 V^rt. 1 


PP ATA TTT A A 
\_V_nXnX X inn 


TTTACCGATT 


8940 




AAAACGGCAT 


TTTCATTATG 


TAG1 1 1 i iLL 


ATT ACAnTAfZ 
Al lALAvlMo 


ATTPfZTAAAT 


TGGCACAAAA 


9000 


30 


CCTTCTAACA 


TTTTACTTG C 


ALAlol JAjI i 


TPT A ATPPflT 
1 V- 1 AM 1 V~ v^vj l 


TTCTAATAAT 
X X w xnn xnnx 


GTTATCTTTT 


9060 




ATACCCATTG 


/"*7VTVT<1\^" , /'"' , TV7VTV 

GAAT A L LAAA 


TAA1 I 1\jLLA 


1 LiV^n 1 1 1 vs/vi 


P TTTTrt PTTfZ 

Ll 1 1 IwLX X w 


TAATTCATCC 


9120 




AATT CTTGCG 


CTTTTTTGAT 


1 VjLA 1 1 1 1 L 1 


TT A TP PAH PR 


PTAGAAAAGA 


CTTAATTGTT 


9180 


35 


GGATCAb 1 V- 1 


L I 1 LAA1 1L>L 


afp jv T 71TZV T 2L 
Al LA1M1M1M 


TPTTTAAPAA. 

ILIA 


PAT PAGATGG 

X ^wXVa*n^ X VJVJ 


TTTGATTTTT 


9240 




TTGTCTTTTA 




Inhnl 1 Vw 1 La 


APPGATTCGT 

AvV»wAi iV»vi 


AGCGAATGCT 


CATCTTACGC 


9300 


40 


G T CCTCCTCA 


1 1 LA 1 IxA 1 IVj 


t a fznc & f l " I u ! 1 

1 nvJVV^nVr 111 


AAATTGTCCA 


TCTTCTGTTT 


CTTTGGCATT 


9360 


i II LAAAOL 1 


AMI 1 L 1 1\j 1\j 


n 71 a t* a rr'TT 1 t* 

onniALv.1 i i- 


AJVTTGP/TTTA 


TCTTCACGTA 


AAACGTTTTG 


9420 




TV TV 7\ rT>/"" 1 T 1 TV TV A 

T AAA I L I AAA 


AHj IvjAlAlU 


T A rt^TT P A A f* 


("3PPTTPTGTA 


TCAGCGCTAT 


CATTTTGTTT 


9480 


45 


TGCAAAATCT 


TV TV TV TV TV*' i'*"T v T kr P 

MAMA 1\tL 111 


f^^*a a'pr^'pnTT* 

Ll AAlolvjl 1 


f^T^PP ATTTPT 


TPPGTTTPTT 


CAGGAGAAAT 


9540 




TTGAAGTCTT 


GCAAGATTCG 


CGATATGCTC 


AACTTCTTCA 


CGTGTTACTT 


TTGTCATTAA 


9600 




TAAAAGCCTC 


CTTTAAGTCA 


TTCATCACTA 


AATTGTATCA 


AATTTCCaAT 


TAAAAATCTA 


9660 


50 


AGTATTTATG 


AGGTGCTACT 


TTAATTTCAT 


ATAAACTGTA 


TAAACATTAT 


CATTCGTTTA 


9720 




TCAAATCATT 


TTTTATGAAA 


ACAACACTCT 


TTTAATATTA 


GACAACCCAA 


TTCAATATTA 


9780 
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AACGCATTAA TAAAATTAAT ATTTTTACCA TTAACATGTA CAATGAATAA AGTTAAAAGT 6 3 00 

AATTTGACTT CTATAGATAT AAATAAACCC TCGATTGCAT CTAAGTCAGC AATCAAGGGT 6 3 60 

TTATTTTTTA AATCTTCATA GTTTGATGAT TTAAATTATC TTTTATCTAA TTCTTGTTTT 64 20 

AATAGTTGAT TTACTAATTG TGGATTAGCT TGACCTTTAG ACGCTTTCAT AATTTGACCA 64 80 

ACTAAGAAGC CCATAGCTTT GCCTTTACCA TTTTTGTAAT CTTCAACTGA TTGTTCGTTA 6 54 0 

TTGTCTAATG CTTCATTTAC AAATTTTAGA AGTGTTGCTT CATCAGAAAT TTGAACTAAG 6 600 

CCATTATCTT CCATAATCTG TTTAGCATTA CCACCTTTAG CTGCTAACTC TGGGAAGACT 6 66 0 

TTCTTCGCAA TTTTACTGCT CATTGTTCCG TCTTCGATAA GTTTAATCAT ACCTGCTAAA 6720 

TTTTCTGGTG TTAATTTAGT ATCTAATAAT TCTACTTGAT TTTTATTTAA ATATTCGTTT 6 7 80 

ACGCCACCCA TTAACCAGTT AGATGTTAAT TTAACATCTG CACCGTGTTC AATTGTTGAT 6 840 

20 TCAAAGAAAT CTGACATTTC TTTAGTCAAT GTTAATACGT GTGCATCGTA TGCAGGTAAA 6 900 

CCTAATTCAT TTACATACTT AGCTTTACGT TCATCTGGTA ATTCAGGAAT TGTCTGACGA 6 960 

ACACGCTCTT TCCAAGCATC ATCAATATAT AAAGGTACAA TGTCAGGCTC TGGGAAGTAA 7 020 

25 CGGTAATCAT CAGAACCTTC TTTAACACGC ATTAAAATTG TTTTACCTGT AG ATT CAT CA 708 0 

AATCGACGTG TTTCTTGTCC GATTTCTCCA CCATTTAACA ATTCTTCTTC TTGGCGTTTT 714 0 

TCTTCATATT CTAAACCTTT ACGTACATAG TTAAATGAGT TTAAGTTTTT CAATTCGGCT 7 2 00 

TTAGTACCAA ATTTTTCTTG ACCATATGGA CGTAAAGAGA TGTTAGCATC ACAACGTAAA 726 0 

GATCCCTCTT CCATCTTAAC GTCTGATACA CCAGTGTATT GAATAATTGA ACGCAATTTT 73 20 

TCTAAATATG CATATGCTTC TTTAGGTGAA CGAATATCTG GTTCAGATAC GATTTCAATT 73 8 0 

AGCGGTGTAC CTTGACGGTT CAAGTCAACT AATGAATACT CACCTTTATG TGTTGACTTA 74 4 0 

CCAGCATCTT CTTCCATGTG AAGACGAGTA ATACCGATTC GTTTTGTTTC ACCGTCGACT 7 500 

TCGATATCGA TATATCCATT TTCACCAATT GGTTGATCAA ATTGAGAAAT TTGATATGCT 7 56 0 

TTTGGATTAT CTGGATAGAA AT AG TTCTT A CGGTCAAACT TAGATTCTGT TGCGATTTCC 762 0 

ATATTTAGTG CCATTGCAGC ACGCATTGCC CAGTCTACTG CACGCTTATT AACAACTGGT 76 8 0 

AAGACACCTG GATATGCTAA GTCGATAACA TTTGTATTTG AGTTAGGTTC TGCTCCAAAA 7 74 0 

TGCGCTGGTG ATGGAGAAAA CATTTTTGAG TCCGTTTTTA ACTCTACGTG AACTTCAAGT 7 800 

CCTATAACTG TTTCAAAATG CATGATTTCC ACTCCTTATA ATTTTTCATA AACGTCATGT 7 86 0 

AAATTGTATT GTGTTTCATA TTGATAAGCG ACACGATATA ACGTTTTTTC ATCGAATGGT 7 92C 

........ . ................. - • — - - — » ~w„ ~~ T . A ^ a a r * t. ^; r. r. * 



30 



35 



40 



10 



15 



20 



25 



EP0 786 519 A2 

AGTTCTTTTT TACTTTGATA ATTTTATATT CAATTTGTTC ATTAATTAAA GCTTGTGGTA 4500 

TGAAAATAGG AAAGCGATCT ATTTTTACGA CACCATGGCC TTCATGCGTT AAATCAACAA 4 560 

CTGTTCCCGT TTTTATGTCA TTTTTAGCTA TTGCTTGCAA AATTTTACCT CCAAAATGAA 4 620 

CAGGTTAGGA ACAAAATTAT GCGCTTCCTA ACCTGCCATT ATATATTTCA CTATTTCTGT 46 8 0 

TTATTCTTCG ATTAAATTGT CATCAACATG ATCATTATTT ATTAACTCTT CATTTACAAT 4 74 0 

ATCATTAGGT GCAAAGACAT CTATATGACG TTCTAGATTT AAGAAATTCG CTGGTAATTT 4 BOO 

ACCACCATAT TCTCCATCTA CATTTAGTTG TAAGTCTGTG AATGATGAAA TATTAATTGC 4 860 

CTTTGCTTTT TCATAAATAA CTTTAGGATG CTTAGTATGT TCTCCTCTTG AAGCTAAAGT 4 920 

CATAATATGA CCAAGTTCTG CAAGGTTTGA TTTTTCAACT ATAATTAACG TAAAATAGCC 4 980 

GTCATCTAAC TTAGCGTCCG GCACTAATTT TTCAAATCCT GCCATTGAAT TTGTTAAACC 504 0 

TAAAAAGAAT AATAATGCTT CTCCTTGGAA AACATTACCA TCATATTCAA TTCTTAAATC 510 0 

TACAGCTTTC ATTTGAGGTA ACATTTCGAA ACCTTTGaTG TAATAAGCAA ATGGACCAAC 5160 

AATAGATTTC AATTTACTCG GTGTTTCATA AGAGACTTGC GTCAATTGTC CGCCTGCAGC 5220 

TAAATTAATA AAGTATCGAT TATTCATTTT ACCAATATCT ACTTTAGTAG AATGACCTTC 52 80 

AATGATGACA TCAAGTGCCC CCATGATGTC ATTAGGTATA TGCAATGCAC GTCCAAAGTC 534 0 

ATTAACAGTA CCCATAGGAA TGACACCTAG CTTAGGACGA TTAGGCTTTT CTGCGATACC 54 00 

30 ATTAACTACT TCATTTAATG TTCCATCACC ACCTGCAGCG ATTAATACAT CATAATTTTC 54 6 0 

ATGCATAGCT CTTTCTGCTT CAAGTGTGGC ATCACCTATT TTCTCGGTTG CATATGCACT 5520 

CGTTTCATAT CCCGCTTT7T CTAATTTTAT TAAGGCATCA GGTAATTCTC TTTTAAATAG 5580 

CTCTTTACCT GATGTCGGGT TATAAATGAT TCTAGCACGT TTCCTCATAT CTTATCCCTC 564 0 

TACTTAAAAT TCATATATTT TAACTTCATC TTTGTTTCGT CTAATAGGGA GTGGGACAGA 57 00 

AATAATATTT AACAAAATTT ATTTCGTTCT ACCCCAACTT GCATTGTCTG TAGAATTTCC 576 0 

TTTCGAAATT CTCTATGTTG GGGCCCCACC CCAACTTGCA CATTATTGtA AGcTGACAGA 582 0 

AAGTCAGCTT CTTTGTTTGG GGGCCCCGCC AACTTGCACA TTATTGTAAG CTGACAGAAA 58 80 

ATCAGCTTCT ATGTT GGGGC CCCACTAGAA TTGAAAAAAG CTTGTTACAA GCGTATTTTC 594 0 

TTTCAGTCAA CTACAGCCAA TATAACATTG TAGTGCCTAG GACATTGAAT TTATGACCCA 6000 

GGCTCAGTCT TATTTCATCA TTCTTAATAT CGTTAAAGAC CAACTTGTAT CTTAAACAAA 60 60 

50 TACTATCTCA ATATGTACAA AGCTTGTTAT TTATTCAGCA TTTTTTGCCG TTCTTCATTA 6120 

TAtAGcTTCG TCAGTTATGC TATTTTACCT TTAAAATGAT GTTGTAAATA TAATGTTGTC 6180 
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AAAGCAATAA GCGGTATGCA TACT AAA CAT AAAAATAAGT GATGAATAAC CAAATACCTT 2700 

AATTAAAATA AGCAAGCCAG TACTTAATAG GATTAGTGGT GACAGCATAA TAATTGAGAA 276 0 

5 

TTGCCATTTG TTGAAGCAAG CATCTGCTGT TTGGAATAAG ATTCTGTCTT TTTTTATATT 2 82 0 

AAACATAGGT TTGCTATCTT TTTTAAATAA AAGAAATAAT GCTCTATGGA TAAGTTCATG 28 8 0 

TAAAATCAAT AAAATAATGA ATCCAGCAAA CCCATATACA AGATTGATGA TGATATTTTG 2 94 0 

70 

ATCGACAACC GCTGTGACAC CTAACGCCCA CTTATACGTA AATAAAATCA CGAATAACGC 3 00 0 

AATAACAAGT TGCAAGATAA TAAACCTTCG CATTTGAAAA TTATTTGTCG TTAAATCAAT 3060 

15 TTTATGCATT ACCAACCCTC CCGATCATGA CATTCTTATT CTTCTTTAAA TATAGTATAC 3120 

AATGTCACAT TTAATTTAAA AAGTTCATAT CAAGAAAGTA AATTGGCTGT AATAAAATTT 3180 

TAATATACGA CTTCTTTCTT CACTTATTAA GGCGAAATTT TATCtCAAAT CATGTGCGCT 3240 

20 ATTTCAAATT GAATAATGCC ACTGTCTCAA CATGTGTTGT TTGTGGAAAC ATATCTACCG 33 00 

GTGTTACCTC TTCAAGTTGA TATTTTTCAG CTAATAATAA TGCATCACGT TGCTGTGTTG 33 60 

CGGGATTACA TGAAATATAG ACAATACGCT TAGGTTCTAA TGTAAGCAAA GTCTGAATAA 34 2 0 

25 ACGTTTCGTC ACAGCCCTTT CTTGGCGGAT CAACCATTAC AACATCTGGT TTAATCCCTT 34 8 0 

GTGCTTTCCA TTGTAAAATA ACTTCTTCAG CTTTCCCACA GACAAAAGTT GTATTATTGC 3540 

ATTGGTTTAT AGTCGCATTT TGTTGTGCGT CTTCAATTGC AGAAGGTACT ACTTCAACAC 3 6 00 

30 

CGTATACATG TTTTGCAAGT GGTGCCATAT ATAGCCCTAT TGTTCCAATA CCACAATAGG 3 660 

TATCTAATAC AACTTCATTA CCTGTCAATT GCGCATACTC AATTGCTTTA TTATATAATT 3 72 0 

TCTCTGTTTG TTCAGAATTA ATTTGGTAGA ATGACTGATC ACTTATTTTA AATGTACTAT 3 78 0 

35 

CTGTTAATTG ATCAATAATT GTATCTTTAC CATATAGCGT TATAGATTGA CGTCCCATAA 3 84 0 

TAACXTTAGA GTGGCTATCA TTAATGTTTT GTTTAATGCT TGTCACATTA GGAAATGCAT 3 900 

CTAATATCTT CTCAACAACA GCATTTTTTT GTGGCCACTT TTTACCATTA GTTACAAAAA 3 96 0 

40 

TAATCATCAT TTCGTCTGTA TGATATCCTG TTCTTACAAC CAAATGTCTC ATTAAACCTT 4 02 0 

TTTTCAATTG TTCTTGATAA ATACTTACAT TTAAATCTTT TAAAATAGAT TTAACTTCAT 4 06 0 

as TCATCACTTC TTGATGTTGT GAATCTTGTA TTAAACAACT TTCCATGTCA ATAATGTCAT 414 0 

GGCTTCTTTG ACGATAAAAG CCCATAATAA CTTCATTCTG TTCATTCTTA CCAACTGGAA 4 20 0 

TCTGGGACTT GTTTCGATAT CTCCAAGGAT CTGTCATGCC AACTGTATCG TTAATCTTAG 4 26 0 

50 AATTATCAAA ATGCGCTTTT CGCTGAAACA AATTAATCAC TTGTTCCTTT TTCATTTCAA 4 32 0 
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aGATTTACCT GTTGTCGCAC ATAACGCGGC 
TCAAAATATT GGTTTACCAA CTCCAAATTT 
5 AAGAACCGTT GATTCGTATC GATACGGTTT 

TTTTCATGGT CATCATGATG CATTGAATGA 
GCTACTGAAA AATTATGAAA ATTTAACATA 

10 

AGATAAAGGC TAGGACTAAA TAAAATACTC 
ACTTTGCAGG GAGCTTTATT TTATATAAAG 
TGTTCTAAAT TACCTACAGT GACACCAATA 
TCGTTATAAA GTAAATATGC AATATTATAA 
AAACTCATCT GTTTAGATAG CGTTTCAAAT 

2Q TTAGCTGACT TCTGTAATTT ATTTAGACGT 

TTTTCTTAAA ATCTCTTCAT CATCATTCAC 
TGATTTTCTT ACTCTTGATG ATTTCACTTC 

25 TAAACCCCGA CCTCTTTTTC CAAACAAACG 

ATCTCTACCG TTAAAAATAC CATTATCATG 
TGGaAAATCT CCAATATCCA ATGTCATCAA 

30 AGTCATACCA TCAGGTTTAT TCATACCACT 

AACACCTGCA GATGCTGTTA AATGTGTCTG 
AATTTTCGAA GCAGGAAGGT CTGGTCTCAC 

35 

CAATGACATC GGTTCTACCT TATCTGTATA 
TGTTTCTCGG TAAGCACCAA AATTACTTGT 
CGCTTGTGAC ATAGGCATTG CTGAATGGAC 

40 

AGAGACAACA CCCCTACTGC TTGCTTTACC 
GGGGTTATCT CTCATTTCGA CTTGTGCAAA 

45 TCTCTCAGTC AAGTGCTCAC CTCCCTACTA 

TCAACATAAT TATACGCTGT GTACGATTTT 
ATATTGaGAT AACTACAAGT TGTTATaAAA 

50 CGATTGAGTA AGAAAACATA GGTAATCTGA 

TCATCATATT AAAATTTATT ATTGAGTCGG 



ATTTGATATG AACGTCTTAC ATCAAAGCAT 900 
AACTTACTTT TGTAGTTATC AACTTGCTAA 960 

AAAACATATG ATGGAGTTTT ATCAATTAGA 1020 

TGCCAAAGCA TGCGCAATGA TTACTTTTAG 108 0 

TGTAACTAAT ATTTATGGTA AAAATCTAAA 114 0 

CCTTCAAAAG TAAGCATTGT AAAAATGTAA 120 0 

TCATATATCG TCATATTTTT ATAAGTTGAT 12 60 

AGTCGAATTG GTACATCAGG GTCTTTTAAA 1320 

ATATCTTCTT CAGAACTAAC CGAATCTCTT 13 80 

TGATAAGTTT TAATTTTAAC CGTTACAGTT 1440 

TCAGCTGTTT TACCTGnACA ATT CC CAT AC 1500 

GTCTGTTGCA AATGTGCGTT CAGTCCCTAC 1560 

ACTATGGTCA ATACCGCGTG CCTTGTTATA 1620 

TATTAATTCA AATTCCGTTT TCTCATATAA 16 80 

CATTACTTTT TTGGAAGCTT TACCTACGCC 1740 

AATATCATGG aCATTTTGAT AATCAATCAC 1800 

CGCTAATTTA GCTAAAAATT TGTTATAAGA 186 0 

CTCTAGAATA TCTTTTCTAA TATACTGAGC 1920 

TAATTCTGTA ATATCTAAAT ACGCTTCATC 1980 

ACTTCGGAAA ATAGACATAA TCTGCGCAGA 2040 

GACAAAGTAT CCATTTGGAC ATAATTTATG 2100 

GCCGTATTTT CGTGCTTCAT AGGATGCCGT 2160 

ACCAACAATG ACTGGTTTCC CTTTCAATTT 2220 

AAAATAGTCC ATATCTATAT GAATAATTCG 2280 

ATTTTTACTT TTATAACGCA CAAAAATATC 2340 

TTTACATAAA TCTTGCACTT AGCGATAACT 24 00 

TCAATTGCTA TTTAAGCATG ATGATGAAGA 24 60 

AATAATTCAA GCAAATTCAT TTTGTTGGTA 2520 

CTTTTGATGA TACAAATAAA TACTATCTTC 2 580 
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AAAAATAAGA ATTAATTATT TATATGTAAA CGGTTTCTAC CTCTATTTTA AATGAAATTT i86 0 

GTGACAAAAA AAGGTATAAT ATATTAATGA CATACAAAGA AATGGAGTGA TTATTTTGGT 192 0 

5 

TCAAGAAGTT GTAGTAGAAG GAGACATTAA TTTAGGTCAA TTTCTAAAAA CAGAAGGGAT 198 0 

TATTGAATCT GGTGGTCAAG CAAAATGGTT CTTGCAAGAC GTTGAAGTAT TAATTAATGG 2 04 0 

AGTGCGTGAA ACACGTCGCG GTAAAAAGTT AGAACATCAA GATCGTATAG ATATCCCAGA 2100 

10 

ATTACCTGAA GATGCTGGTT CTTTCTTAAT CATTCATCAA GGTGAACAAT GAAGTTAAAT 2160 

ACACTCCAAT TAGAAAATTA TCGTAACTAT GATGAGGTTA CGTTGAAATG TCATCCTGAC 22 2 0 

15 GTGAATATCC TCATTGGAGA AAATGCACAA GGGAAAGACA AATTTACTTG GAATCAATTT 22 8 0 

ATACCTTAGC TTTAGCAAAA AGTCATAGAA CGAGTAATGG ATAAGGGACT CCATACCGTT 234 0 

TTAATGC 234 7 

20 (2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13542 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
25 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 

ACAAGACGTn TCTATAACTT ATCTGAAATC GCTCGTCAAG ATAAAGATTA TGCAACTATC 60 

TCATTCTTAA ACTGGTTCTT AGATGAACAA GTCGAAGAAG AATCAATGTT TGAAACTCAC 120 

ATCAATTATT TAACTCGTAT CGGCGATGAC AGCAATGCAT TATATCTTTA CGAAAAAGAA 1B0 

CTTGGCGCTC GTACATTCGA CGAAGAATAA TTAAACATCA CTACAATAGA CAGATAAATA 24 0 

TCATACGACA TGATAGGCAT TTGGGTCACT TACAATAACC CAATGTCTAT ATTATTTTGC 3 00 

TTTACGGAGA TCACTAGATT CATTTTCTGA ATCATTGATC TGCGTTTTTT CATTTTCAAG 3 60 

GCTAATTATT GTATTTTTAG TCATTTATTT TTTAAACTAC TAATGTTAAT AACTCTAAAT 4 20 

TTGATGTTGA ATTAATTTGA CGATTTTAAA G CAT AT CATC ATTTACTTTT TAATCAGAGT 480 

TACATCCAAA TGATAGATTT CACGTTATAC CTTCACGTAT AATATTATGT ATCGTTTGTA 54 0 

AGCAAATGAC TAAAAGTCTA TTAATATATA CATTTAATTA ATTGAAAGGA TTGACTACAT 60 0 

GATACAAGAT GCGTTTGTTG CACTTGATTT TGAAACAGCA AATGGTAAAC GTACAAGTAT 66 0 

TTGTTCTGTC GGAATGGTTA AAGTCATTGA TAGTCAAATA ACAGAAACAT TTCATACTCT 72 0 
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GGCGTGATCA TACGACCGTC ATTCATGCTC ATGAAAAAAT ATCTAAAGAT TTAAAAGAAG 60 

ATCCTATTTT TAAACAAGAA GTAGAGAATC TTGAAAAAGA AATAAGAAAT GTATAAGTAG 120 

5 GAAACTTTGG GAAATGTAAT CTGTTATATA ACAGCACTAA TGATnACAAT CATTTTTTAC 180 

ATTTCTATAT GCTAATGTGG CAAGATGAGC AAAACTCATT TTGTGGATaA TGTTTaAAAG 24 0 

TCATACACAC CATACACAAG TTATCAACAT GTGTATAAyT cGcCAAATCT ATGTTTTTAA 3 00 

10 

GACTTATCCA CCAATCCACA GCACCTACTA CTATTACTAA GAACTTAAAA CCTATATAAT 3 60 

TATATATAAA CGACTGGAAG GAGTTTTAAT TAATGATGGA ATTcACTATT AAAAGAGATT 420 

ATTTTATTAC ACAATTaAAT GACACATTAA AAGCTATTTC ACCAAGaACA ACATTACCTA 4 80 

15 

TATTAACTGG TATCAAAATC GATGCGAAAG AACATGAAGT TATATTaACT GGTTCAGACT 54 0 

CTGAAATTTC AATAGAAATC ACTATTCCTA AAACTGTAGA TGGCGAAGAT ATTGTCAATA 600 

2 0 TTTCAGAAAC AGGCTCAGTA GTACTTCCTG GACGATTCTT TGTTGATATT ATAAAAAAAT 660 

TACCTGGTAA AGATGTTAAA TTATCTACAA ATGAACAATT CCAGACATTA ATTACATCAG 720 

GTCATTCTGA ATTTAATTTA AGTGGCTTAG ATCCAGATCA ATATCCTTTA TTAC CTCAAG 780 

25 TTTCTAGAGA TGACGCAATT CAATTGTCGG TAAAAGTGCT TAAAAACGTG ATTGCACAAA 840 

CAAATTTTGC AGTGTCCAcC TCAGAAACAC GCCCAGTACT AACTGGTGTG AACTGGCTTA 900 

TACAAGAAAA TGAATTAATA TGCACAGCGA CTGACTCACA CCGCTTGGCT GTAAGAAAGT 960 

30 TGCAGTTAGA AGATGTTTCT GAAAACAAAA ATGTCATCAT TCCAGGTAAG GCTTTAGCTG 1020 

AATTAAATAA AATTATGTCT GACAATGAAG AAGACATTGA TATCTTCTTT GCTTCAAACC 1080 

AAGTTTTATT TAAAGTTGGA AATGTGAACT TTATTTCTCG ATTATTAGAA GGACATTATC 1140 

35 

CTGATACAAC ACGTTTATTC CCTGAAAACT ATGAAATTAA ATTAAGTATA GACAATGGGG 1200 

AGTTTTATCA TGCGATTGAT CGTGCCTCTT TATTAGCGCG TGAAGGTGGT AATAACGTTA 1260 

TTAAATTAAG TACAGGTGAT GACGTTGTTG AATTGTCTTC TACATCACCA GAAATTGGTA 1320 

40 

CTGTAAAAGA AGAAGTTGAT GCAAACGATG TTGAAGGTGG TAGCCTGAAA ATTTCATTCA 13 80 

ACTCTAAATA TATGATGGAT GCTTTAAAAG CAATCGATAA TGATGAGGTT GAAGTTGAAT 1440 

45 TCTTCGGTAC AATGAAACCA TTTATTCTAA AACCAAAAGG TGACGACTCG GTAACGCAAT 1500 

TAATTTTACC AATCAGAACT TACTAAAAAT AAATATAAAT AAAGGATGAC GTGATTAATT 1560 

AAAACGTCAT CCTTTATTTT TTGGCAAAAA TAATTCTAGG TGCGTATGTA AAATAAATTT 1620 

50 GG CAGCATTT TAAACAGCAA ATAAAAGACG CCAATTAAAT TTATGACAAA TGTATCCAAA 1680 

ATTTAATAAG TGTGCTTATA TGCCCTTTAA ATTTAAAATT TTAATAGTCA ATAACAAGTT 174 0 
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AATTGTG C CT 


GCTAAAGCAG 


AGGAAAAACA 


AATTTTTAAT 


ACTTTCGAAA 


AAAGTAGTGG 


6600 


CTATAATAGT 


TACAAAGCAG 


TTCAAGATGT 


AAAAACTCAC 


TCTGAAGAAC 


AAAGAGTAAC 


6660 


AGCTAAAnAA 


TAATTCGTTC 


GAAATTAACA 


CAATTTAATA 


GG AATTTTTC 


TTTAAAACTA 


6720 


TTGCTAATAA 


AGCTATATTT 


TGAT AC CTTT 


ATCAAGTGTT 


AAACAAAATG 


TTTGATAAAA 


6780 


OlAAACTTAA 


TATAGCTTTT 


TTAGGTGGAA 


AAATAAATGA 


ACATAGGTAA 


TAAAATTAAA 


6840 


AATCTTAGAA 


GAATTAAAAA 


TTTAACGCAA 


GAAGAACTTG 


CTGAACGTAC 


AGACTTATCG 


6900 


AAAGGCTACA 


TTTCACAAAT 


AGAAAGTGAA 


CATGCCTCAC 


CAAGTATGGA 


AACTTTCTTA 


6960 


AATATTATAG 


AGGTGTTAGG 


AACGACGCCA 


AGTGAATTTT 


TTAAAGACAG 


TGAAAATGAA 


7020 


AAAGTATTAT 


ACAAGAAGGA 


AGAACAAGTT 


ATTTATGATG 


AGTATGATGA 


AGGTTATATA 


7080 


TTAAATTGGT 


TAGTTTCAAA 


GTCAAATGAA 


TATGATATGG 


AGCCATTAAT 


ATTAACTTTA 


7140 


AAGCCTGGAG 


CATCATATAA 


AAATTTTAAT 


CCATCAGAGT 


CTGATACGTT 


TATTTATTGT 


7200 


ATGTCAGGTC 


AGATAACACT 


TAATTTAGGC 


AAAGAGATAT 


ATCAAGCACA 


AGAAGAAGAC 


7260 


GTTTTGTATT 


TTAAAGCACG 


AGATAATCAT 


CGTTTGTCAA 


ACGAATCAAA 


CAATGAAACA 


7320 


CGAATACTTA 


TTGTAGCGAC 


AGCTTCATAT 


TTATAGGGGG 


GATCTTATTT 


GGAACCGTTA 


7380 


TTATCATTAA 


AATCAGTTAG 


TAAAAGCTAT 


GATGATCTTA 


ATATCTTAGA 


TGACATAGAT 


7440 


ATTGATATTG 


AATCAGGATA 


CTTTTATACA 


TTATTAGGTC 


CTTCAGGTTG 


TGGTAAAACA 


7500 


ACAATTTTAA 


AATTAATTGC 


AGGGTTTGAA 


TATCCTGACA 


GTGGTGAAGT 


GATTTATCAA 


7560 


AACAAACCAA 


TTGGTAATTT 


ACCACCAAAT 


AAACGTAAAG 


TGAATACAGT 


CTTTCAAGAT 


7620 


TATGCATTAT 


TTCCACACTT 


AAACGTCTAT 


GATAATATCG 


CTTTTGGTTT 


GAAATTAAAA 


7680 


AAATTATCAA 


AAACCGAAAT 


TGATCAAAAA 


GTAACTGAGG 


CATTAAAATT 


AGTAAAACTT 


7740 


TCAGGTTATG 


AAAAAAGAAA 


TATTAATGAA 


ATGAGTGGCG 


GACAAAAGCA 


ACGTGTTGCA 


7800 


ATTGCACGTG 


CTATCGTAAA 


TGAACCAGAA 


ATATTATTGT 


TAGATGAATC 


TTTATCCGCA 


7860 


TTAGATTTGA 


AATTGCGTAC 


TGAAATGCAA 


TATGAATTAC 


GAGAATTGCa 


AT CT AG ATT A 


7920 


GGtATTACAT 


TTATATTTGT 


aACACATGAT 


CCA 






7953 


(2) INFORMATION FOR SEQ ID NO: 153: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2347 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
50 (D> TOPOLOGY: linear 
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ATGGTGCAAC TGGCCAAAAT GCAATGAATC ACATTAAACG TTTATTAAAT AATCCAGAAT 48 00 

TATTATTAAT GGAGGGGTAA AACATGGTAG TTGGAGATTT CCCAATTGAA ACAGATACTA 4 860 

TAGTAATCGG AGCAGGTCCT GGTGGATACG TTGCAGCAAT TCGTGCAGCT CAATTAGGAC 4 92 0 

AAAAAGTAAC AATCGTTGAG AAAGGTAATC TTGGTGGTGT TTGCTTAAAC GTAGGATGTA 4980 

TTCCTTCAAA AGCATTACTA CATGCTTCTC ACCGTTTTGT TGAAGCACAA CATTCTGAAA 504 0 

ACTTAGGTGT TATTGCTGAA AGTGTTTCTT TAAACTTCCA AAAAGTTCAA GAATTCAAAT 5100 

CATCAGTTGT TAATAAATTA ACTGGTGGTG TTGAAAGCTT ACTTAAAGGT AACAAAGTTA 516 0 

ACATCGTTAA AGGTGAAGCA TATTTCGTAG ATAACAATAG CTTACGTGTT ATGGACGAAA 5220 

AGAGCGCACA AACATACAAC TTTAAAAATG CAATCATTGC AACAGGTTCA AGACCAATTG 5280 

AAATTCCTAA TTTCAAATTC GGTAAACGTG TTATCGACTC AACAGGTGCT TTAAACTTAC 534 0 

AAGAAGTACC aGGTAAATTA GTTGTAGTTG GTGGAGGATA CATTGGATCA GAATTAGGTA 54 0 0 

CAGCATTTGC TAACTTTGGT TCAGAAGTAA CCATCCTTGA AGGTGCTAAA GATATCTTAG 5460 

GTGGCTTCGA AAAACAAATG ACACAACCTG TTAAAAAAGG TATGAAAGAA AAAGGTGTTG 552 0 

25 AAATCGTTAC TGAAGCTATG GCTAAATCAG CTGAAGAAAC AGATAACGGA GTTAAAGTTA 558 0 

CTTATGAAGC TAAAGGCGAA GAGAAAACAA TCGAAGCTGA TTATGTATTA GTAACTGTAG 564 0 

GTCGTCGTCC AAACACAGAC GAATTAGGCC TAGAAGAATT AGGTGTTAAA TTCGCTGACC 5700 

GTGGATTATT AGAAGTTGAT AAACAAAGCC GTACGTCTAT CAGCAATATC TATGCAATTG 576 0 

GTGATATCGT TCCAGGTTTA CCACTTGCTC ACAAAGCTAG CTATGAAGCT AAAGTTGCTG 582 0 

CTGAAGCAAT TGATGGTCAA GCTGCTGAAG TTGATTACAT TGGTATGCCA GCAGTATGCT 588 0 

TTACTGAACC AGAATTAGCT ACAGTTGGTT ATTCAGAAGC G CAAG CT AAA GAAGAAGGTT 594 0 

TAGCAATTAA AGCTTCTAAA TTCCCATATG CAGCAAATGG TCGTGCATTA TCATTAGATG 6000 

ATACTAACGG ATTTGTTAAA CTTATTACAC TTAAAGAAGA TGATACTTTA ATCGGTGCTC 606 0 

AAGTAGTTGG TACTGGTGCA TCAGATATTA TCTCTGAATT AGGTTTAGCA ATTGAAGCTG 6120 

GTATGAATGC TGAAGATATC GCATTAACAA TCCATGCACA TCCAACATTA GGTGAGATGA 6180 

CTATGGAAGC AGCAGAAAAA GCTATCGGAT ACCCAATCCA TACAATGTAA TAACTGATTA 624 0 

TCTATAAAGA TTCAGTCATT AAAAGCTGTA GCATATGCTA CGGCTTTTTT GTTTTAGGTA 6300 

AAGTAATGTA AGGAAATTGA TTTGAGATAT CGTTAACATG TGACATGCAT GTTATACTAG 636 0 

CGATGCTAAT AAAAGAATTG AAATGGAGGG TTCAACAATG GAATATGAGT ATC CAATTGA 642 0 

TTTAGACTGG AGTAATGAAG AGATGATTTC AGTGATAAAT TTCTTTAATC ATGTAGAGAA 64 80 
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CTATTAGAAG TAATGACCCA GTCGTATACT TAG AG CAT AT GAAATTGTAT CGTTCATTCC 3 000 

GTGAAGAAGT ACCTGAAGAA GAATATACAA TTGACATTGG TAAGGCTAAT GTGAAAAAAG 3 06 0 

AAGGTAATGA CATTTCAATC ATCACATACG GTGCAATGGT TCAAGAATCA ATGAAAGCTG 312 0 

CAGAAGAACT TGAAAAAGAT GGTTATTCTG TTGAAGTAAT TGACTTACGT ACTGTTCAAC 3180 

CAATCGATGT TGACACAATT GTAGCTTCAG TTGAAAAAAC TGGTCGTGCA GTTGTAGTTC 3 24 0 

AAGAAGCACA ACGTCAAGCT GGTGTTGGTG CAGCAGTTGT AGCTGAATTA AGTGAACGTG 33 0 0 

CAATCCTTTC ATTAGAAGCA CCTATTGGAA GAGTTGCAGC AGCAGATACA ATTTATCCAT 3 36 0 

TCACTCAAGC TGAAAATGTT TGGTTACCAA ACAAAAATGA CATCATCGAA AAAGCAAAAG 342 0 

AAACTTTAGA ATTTTAATAC ATTTTAAAAG TTAACGAAGT TAGCGTATTT TAGTCTCATT 34 8 0 

GATTAAAATG AAATGTTTAA TTTACGAAAT CTTAGGAGGG CAAAAACGTG GCATTTGAAT 354 0 

TTAGATTACC CGATATCGGG GAAGGTATCC ACGAAGGTGA AATTGTAAAA TGGTTTGTTA 3 6 00 

AAGCTGGAGA TACTATTGAA GAAGACGATG TTTTAGCTGA GGTACAAAAC GATAAATCAG 3 660 

TAGTAGAAAT CCCATCACCA GCATCTGGTA CTGTAGAAGA AGTTATGGTA GAAGAAGGTA 3 72 0 

25 CAGTAGCTGT AGTTGGTGAC GTTATTGTTA AAATCGATGC ACCTGATGCA GAAGATATGC 3 78 0 

AATTTAAAGG TCATGATGAT GATTCATCAT CTAAAGAAGA ACCTGCGAAA GAGGAAGCGC 3 84 0 

CAgcAGaGCA AGCACCTGTA GCTACTCAAA CTGAAGAAGT AGATGAAAAC AGAACTGTTA 3 90 0 

AAGCAATGCC TTCAGTACGT AAATACGCAC GTGAAAAAGG TGTTAACATT AAAGCAGTTT 3 96 0 

CTGGATCTGG TAAAAATGGT CGTATTACAA AAGAAGATGT AGATGCATAC TTAAATGGTG 4 02 0 

GTGCACCAAC AGCTTCAAAT GAATCAGCTG CTTCAGCTAC AAGTGAAGAA GTTGCTGAAA 4 08 0 

CTCCTGCAGC ACCTGCAGCA GTAACATTAG AAGGCGACTT CCCAGAAACA ACTGAAAAAA 414 0 

TCCCTGCTAT GCGTAGAGCA ATTGCGAAAG CAATGGTTAA CTCTAAGCAT ACTGCACCTC 42 00 

ATGTAACATT AATGGATGAA ATTGATGTTC AAGCATTATG GGATCACCGT AAGAAATTTA 4 26 0 

AAGAAATCGC AGCTGAACAA GGTACTAAGT TAACATTCTT ACCTTATGTT GTTAAAGCAC 4 32 0 

TTGTTTCTGC ATTGAAAAAA TACCCAGCAC TTAACACTTC ATTCAATGAA GAAGCTGGTG 4 3 80 

AAATCGTTCA TAAACATTAC TGGAATATCG GTATTGCAGC AGACACTGAT AGAGGATTAT 4 44 0 

TAGTACCTGT TGTTAAACAT GCTGATCGTA AGTCTATTTT CCAAATTTCA GATGAAATTA 4 50 0 

ATGAATTAGC TGTTAAAGCA CGTGATGGTA AATTAACAGC CGATGAAATG AAAGGTGCTA 4 56 0 

CATGCACAAT CAGTAATATC GGTTCAGCTG GTGGACAATG GTTCACTCCA GTTATCAATC 4 62 0 

A^^^A^Ar^^ r-.-^.T^T-r. ^OAATT^nrr GTA7TGCTCA AAAA n CTAT^ TTTAAAGATG 4 ^ R n 
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AAGAAAAGCA AGACGTTGAT CAATTTAAAT AATTAATATA ATACAGATGG TAGGAAACAA 12 00 

CTAATACAGT TCCTATTATC TGTATCTTTT TTTATTAAAA CAGAACTTTT TCAAATGGTT 1260 

TAACAGTCCC ATTTATTTGT GGTACAATTA GTAAGGATAA AATGAATTTC TATACAATTA 1320 

TGGGAAAGGT ATTGTGAATT GAATGGCTCC TAAGTTACAA GCCCAATTCG ATGCAGTAAA 13 30 

AGTTTTAAAT GATACTCAAT CGAAATTTGA AATGGTTCAA ATTTTGGATG AGAATGGTAA 1440 

CGTCGTAAAT GAAGACTTAG TACCTGATCT TACGGATGAA CAATTAGTGG AATTAATGGA 1500 

AAGAATGGTA TGGACTCGTA TCCTTGATCA ACGTTCTATC TCATTAAACA GACAAGGACG 1560 

TTTAGGTTTC TATGCACCAA CTGCTGGTCA AGAAGCATCA CAATTAGCGT CACAATACGC 1620 

TTTAGAAAAA GAAGATTACA TTTTACCGGG ATACAGAGAT GTTCCTCAAA TTATTTGGCA 1680 

TGGTTTACCA TTAACTGAAG CTTTCTTATT CTCAAGAGGT CACTTCAAAG GAAATCAATT 1740 

CCCTGAAGGC GTTAATGCAT TAAGCCCACA AATTATTATC GGTGCACAAT ACATTCAAGC 1800 

TGCTGGTGTT GCATTTGCAC TTAAAAAACG TGGTAAAAAT GCAGTTGCAA TCACTTACAC 1860 

TGGTGACGGT GGTTCTTCAC AAGGTGATTT CTACGAaGGT ATTAACTTTG CAGCAGCTTA 1920 

25 TAAAGCACCT GCAATTTTCG TTATTCAAAA CAATAACTAT GCAATTTCAA CACCAAGAAG 1980 

CAAGCAAACT GCTGCTGAAA CATTAGCTCA AAAAGCAATT GCTGTAGGTA TTCCTGGTAT 2040 

CCAAGTTGAT GGTATGGATG CGTTAgcTGT nATATCAAGC AACTAAAGAA GCACGTGACC 2100 

GCGCAgTTGC AGGTGAAGGT CCAACATTAA TTGAAACTAT GACATATCGT TATGGTCCTC 2160 

ATACAATGGC TGGTGACGAT CCAACTCGTT ACAGAACTTC AGACGAAGAT GCTGAATGGG 2220 

AGAAAAAAGA CCCATTAGTA CGTTTCCGTA AATTCCTTGA AAACAAAGGT TTATGGAATG 22 80 

AAGACAAAGA AAATGAAGTT ATTGAACGTG CAAAAGCTGA TATTAAAGCA GCAATTAAAG 2340 

AGGCTGATAA CACTGAAAAA CAAACTGTTA CTTCTCTAAT GGAAATTATG TATGAAGATA 24 00 

TGCCTCAAAA CTTAGCAGAA CAATATGAAA TTTACAAAGA GAAGGAGTCG AAGTAAGCCA 24 60 

TGGCACAAAT GACAATGGTT CAAGCGATTA ATGATGCGCT TAAAACTGAA CTTAAAAATG 252 0 

ACCAAGATGT TTTAATTTTT GGTGAAGACG TTGGTGTTAA CGGCGGTGTT TTCCGTGTTA 2580 

CTGAAGGACT ACAAAAAGAA TTTGGTGAAG ATAGAGTATT CGATACACCT TTAGCTGAAT 264 0 

CAGGTATTGG TGGTTTAGCG ATGGGTCTTG CAGTTGAAGG ATTCCGTCCG GTTATGGAAG 2 700 

TACAATTCTT AGGTTTCGTA TTCGAAGTAT TTGATGCGAT TGCTGGACAA ATTGCACGTA 2760 

50 CTCGTTTCCG TTCAGGCGGT ACTAAAACTG CACCTGTAAC AATTCGTAGC CCATTTGGTG 282 0 

GTGGCGTACA CACACCAGAA TTACACGCAG ATAACTTAGA AGGTATTTTA GCTCAATCTC 2 880 



30 



35 



40 



45 



55 



EP0 786 519 A2 



w 



15 



AGTCTCACGA TACACCCAGT AAGGAATCGA AACAACAGCG AGAGCAATAG CACTGACCAC 960 

ACCTTACTGG TTCACTTTAG CGAACTACGC CATCGGTTAG TAAAAATTTT ATTGTCGTTC 102 0 

GTCATTACGG TCATCGTCGT ATATGTyTCA TCATTTTGGT GGATGACACC ATTCATAACG 10 80 

TATATyACCC GgCACATGTG TcCTTACATG CATTTcATTC ACAGAAATGA TACAAATAAC 114 0 

GTG 114 3 
(2) INFORMATION FOR SEQ ID NO: 152: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 7953 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



20 (xi) SEQUENCE DESCRIPTION : ! 

CAACGCCTGA ACGTAAACCA TATCGTTTCG 
ACTCTCTCAT GGCGATTAAT GTTTCTTTTT 

25 CTGCTTTTTG ACGCAAAGTT GGATGACCAT 

ATTGCACCTT CCTTATTTTA ATTTGTTTTA 
ACTCATACAT TTTTATGTGT AAATATCTAC 

30 TTAGATAGTA TATGTAAATT TTCAaGAtAT 

TTGGTTTCAA GCAATGaTAC TTTAGAAATT 
TATAAATGAC GTAACTGTCA ACAGATATAC 

35 

TAAAATTGAT TTCCAAGCAG ATTTTATTTA 
AAGTAATGAA ATTTGGAAAA ACAATCGCAG 
GATGTACTAC GGATAAAAAA GAAATTAAGG 

40 

ATGATGAAGA ACCAATTAAA ACTGTTGGTA 
AAAAATTAAC TGAAGATGTC AATAGTAAAG 

45 ATTTAATTAA AAATGCCGAT GATCGTCTAA 

AGAAGTCTGA ACAAGACTTT AAGAAAGCAA 
TTAAACGTAA AGAAGTAAAA CAATTAGATG 

50 GTGATTACGC GAAAGCATaT AAAAAGGCTG 



JEQ ID NO: 152 : 

CGATTTCCTC ATCTTGACTA TTTACTAAAA 60 

CTTCTTTAGT TAATGGTAAT TCTAACTCAG 120 

CTCTAATGAT GTCTTTCATT GTTAACATAT 180 

GTTGAATGAC AGTAAAAAGG TTGTTAAGAT 24 0 

AAAGTTAACC AACTACTGCC AATGTTTATT 3 00 

GCgTAATTGC gTTAAAAAAT GaTTAAAGTG 3 60 

TATTTATCAT CTTGACTTTA AAAATTATAT 42 0 

TTAGTArTGA AGATGTGTAA TGTAATTGTT 4 80 

TCATTTAATT TAAATAGCAA GTGGAGGTAC 54 0 

TAGTATTAGC ATCTAGTGTC TTGCTTGCAG 6 00 

CATATTTAAA G CAAGTGG AT AAAATTAAAG 6 60 

AGAAAATTGC TGAATTAGAT GAGAAAAAGA 7 20 

ATACAGCAGT TCGCGGTAAA GCAGTAAAGG 7 80 

AGGAATTTGA AAAAGAAGAA GACGCAATTA 84 0 

AAAGTCACGT TGATAACATT GATAATGATG 900 

ATGTATTAAA AGAAAAATAT AAGTTACACA 96 0 

TAAACTCAGA GAAAACATTA TTTAAATATT 102 0 

TTAA^AAAA ATrAwAAn™. ATAHA A^Ar.a n a n 
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TGACACGTTT GCGAAGTGAA TTTGAATATC AAAAGCACAG TTATGATTAG CGATATAATC 2520 

AAATATTTCA TTTGTATTCA TTAACTTTAT ATTACGCTTA GTAAATTGAA TTGCAGAAGC 2580 

5 GTGACTTCCC ACTTCTGCAA TTTCTAATGT TTCATGATGA TTAATTTTTG TATCTACAAA 2640 

ATGAATGTTT GCCAATTTCG CCTCATTCAC TTTTATATAG TTAAGCACCC AAACTGCAAT 2700 

ACGCGACTTA AATCGATATT GAAAAAGTAA ATATTCAATA AAACTTTCTT TAATTTGATT 2760 

10 

GAGTGTCTCT GACATCAAAT ACCCCATTTT AAGATTGCAA TCTTGaTAAT TCGTCATGCC 2 820 

AATTTTCGTT ACTTGGcTCT AGTTCCAACA ATTGATTTAA AATAGTAATT GCTTGTTCCT 2880 

TTTGACCAAT TTCAATTAAA TAGAAATAAT AATCACTCAT AAAATCAATA TTTGTTTTCA 2940 

15 

TCGTTGGATA TGCTAATTCA AAGAAATGTT GAGCTTCTTT ATCTCGCTC 2989 
(2) INFORMATION FOR SEQ ID NO: 151: 

2Q (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1143 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 
CATCAACTCC TTAATTACAC TGTAAATGAT ATGCGTCTTT TTGACAACTA TATTTGTCAA 60 
30 ATCTACACCA AAAAATATGA TTATCCACCT ATGTATGACA TTTTGAAACA AACACCTCAA 120 

CGCCTACAAG TCATAATTGT TTACTTTCGT TACACCTTCC TGCATAATTA ACAGCATTCT 180 
AATTTTAGTA TGATGCACGC ATTTTCACTA AATCAAACCA TTCAAAGGAG ACTATTATGG 24 0 

35 

CATTTACATT ATCTGCAATT CAACAAGCAC ATCAACAATT TACTGGTGTT GACTTTCCAA 3 00 

AACTATTCAA AGCTTTTAAA GATATGGGGA TGACTTACAA TATCGTCAAC ATTCAAGATG 360 
GCACTGCAAC ATACGTACAT CAATCAGAAG ATGATATCGT TACGTCATCT GTAAAAAGTA 420 

40 

ATCATCCTGT TGCTCAAAAA TCAAACAAAA CAATAGTTCA AGACGTCTTA ACTAGACATC 4 80 

AACAAGGGCA AACAGATTTT GAAACATTTT GTGATGAAAT GGCTGAAGCT GGCATTTATA 54 0 

45 AATGGCATAT CGATATTCmA GCGGGCACTT GTACTTATAT CGACTTGCAA GACCAAGCTG 6 00 

TTATTTCAGA ATTAATCCCT CAATAAACTA TATTTATAGC AACATTTTAA TTATTTCATA 660 
AAATTTTATT GATAATCATT ATCGTTCGGT ATAAAGTAAA TACTATATAC TACTTATGAG 720 

SO TGAGGTTGAT TATCATGATA ACTAACACTT TTATTTTAGG CATCACAGGC CCAACAAGTC 780 

TTGTCGTCAT TAGCATTATC GCTTTAATTA TTTTTGGTCC GAAAAAATTA CCACAATTTG 84 0 
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AATTCAGTTT ATATAAATGT AATGCATTCC TAACTAAATT AAATCAATTG AAATTGGGAT 720 

TATAACTTTA TGATACGTAC CACTACAATA AAATAATATA GTGAATAATC TACCATTAGA 7 BO 

AAAATAAGCA CAAAAAAACT AGCAACCACA CAAAAATGTG ATTAGCTAGT TAATAAGTGT 84 0 

CTAATTTAAG TTAATTGTTA ATCTATAAGA TTAATCACTT GAACGCGCAA TCAAAATAAT 900 

ACGTACAAGC TCTGCTACAG CGACTGCAGT TGCTGCAACA TAAGTCATTG CTGCTGCAGA 96 0 

TAATACTTTA CGCGCATGCT TGTATTCTTT TTCATTTACA ATGTTCAATG CCGTAATTTG 1020 

TTTCATCGCT CTTGAACTCG CATCAAACTC AACTGGTAAC GTAACAATTG AGAATAATAC 108 0 

CGCTAATGAC ATTAAACCAG CACCAATCCA TAAAGCAGTT GAACCaAATG CACTACCTAT 114 0 

CGCTGTTAAG ATAATACCTA ACATGATGAT CATATAACTT AATGAACTCC CTAGGTTTGC 12 00 

AACAGGTACT AATGCTGCTC TGAATCTTAA GAACCAATAT CCTTGGTGAT CTTGAATGGC 126 0 

ATGACCAACT TCGTGGGCTG CAATTGCAGT TCCAGCAACT OATGGTCTGT CATAGTTTGC 1320 

AGGAGATAGT GAAACAACTT TCTTTTTAGG ATCGTAATGA TCTGTTAAGA ATCCTTCACC 13 80 

TTTAACAACT TOGA CAT CAT AAATACCGTT TGCATGTAAA ATTTCTAATG CAACTTCACG 14 4 0 

25 ACCCGTTTTA CCACTAGTTG ATCTAACTTG TGAATATTTC TCATAGTTAG ATTTAACTTT 150 0 

GTGTTGTGCC CATAAAGGAA GCACCATTAA TATTACGAAA TAAATTATCA TAGTAAAAAT 1560 

TGAAGACAAT AAACTCACTC TCCTTTATAA ATATTTTACT GTCATTTGCC GTTTTTATCA 1620 

AATCATTTAC ACTTTAATAA TTTGTTTAAT TCAATATAAA GCAAAAGTCC AAAAACACTT 16 8 0 

AGACAACATG ATAATACACC AATTTGCCAC ACATGTGTAG TTATAAAATC ATAATATGGA 174 0 

AATTGAAGGT GAAAATAGTC AATATAATCA TTCAAAAACA CCCAAATCAT yGCTACACTG 180 0 

ATTCCAATCA TAGAACGTTT AAACCTAGGA TAGAAGTAAA TTGCCTGAAC AGCCATTATA 1860 

CTGTGGGAAA ACATTAATAC CAAACCATTT ACTGTAATAT CACCTTGTTC AATAATAAAT 192 0 

AATATATTCA TTATAACTGC CCAAATCCCA TATTTGAATA ATGTTACAAA TGCCAGTGCA 198 0 

TCGATAATAC TATTTTGTTT TTGAATTAAT ATCAATGAGA TAGAAATAAC TAAGTATAAT 2 04 0 

ATTGCAGTTG GGCTA7CTGG AACAAAAATC TTAAAATGCC AGGGCGTATG ACTTAATTGT 210 0 

45 TCACCATACC AT ATATAAC C ATAAATCATC CCTAATATAT TACAAATGAG TAGCATCATT 216 0 

AACCAAGAAC GTTGATAAAG TGTATATTGC CAAAATGCTT TAATTGTCAT CTGCTAAGTC 222 0 

CTCAAATTGA TTATGTTTAT TTACTAGCTT GAGTGTATTT AAAATTTGCG TTAGTTGATA 2280 

50 AAAACGTTGC TTTTCATTCA TCTGTAAACT TAAATCAATA TTGTGTAACA AGTAATCTAT 234 0 

TAATAAT^A "iTTAT^r" r,ATrTATAG^ ^A~AC^A' r ~ T T AArVTTATnAA OATAAnTTT^ ?4 n " 
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AAAATGCTTA TAATTCTTAA TCTGCACAAC TGCCCCATCT TCTGTTACCG TTGTTCCGTC 972 0 

CTCATTTATA TCTAGTACTA AGGCATTGTT ATCTTTTGTT AAAAACGTAG TTTTACCAGT 978 0 

ACCGAACTTG CCGTATATCG CAAATTTATA AAACTTGTTT GCATTTTGTT TGCTGATGTC 9 84 0 

TTTTACACCT AGTTGCGTTA AAATATCGAC AT CTTG ATT A GTTTTTTCAG TCATCTATTC 9 900 

TCCCACCTTT ACCGTGTATG ACGTTGGTTT CTCCACAATG CTAGCACCCT CTAAAACTTC 9960 

GCCGTTTGCG TCAAT CAATG TGCCGTTTTC AGTTACATTG AAATCTTTCT TAATGTCTGA 10020 

TTGGCTAAGT TTTTTAGTTA CTTTTACATA GTTGTCAAAA CCTCGTTGCT CAAGTTGTnT 10080 

AATGACTTCT TGCTCATTGC TAACTTGAAT GACTTTTGAA CCTTTTCTGG CTGTCACTTT 1014 0 

TCCGTAAGtG TATTCAACTT GAATTTGCTA TCTTGTTCTT TTTGTATTCT GTAATATTCA 10200 

ATTACAAGGC TTTGTAAATA TTCTTTGCCA CTCTGTAATT TTTCTACTTC TTTATCTTTC 10260 

CATTCGTTTA TGCGTTCAAT TTCTTTATTT GCTAAATCGT TGATTTCATT CTCTTTAGTT 10320 

GTGATTGCAT CCAGTTTCTn AAAAACCCAG TTAGCACTGT CTAGATCAGT nACTTTGAAT 103 8 0 

CGGTCGTCTT GTTCGAATGT n 104 01 
25 (2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2989 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

30 (D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 

TTTCTCTCTA TTATTCTCGA TGCGTAGATA ATTGTTTAAA TTTAAGTTTA TAGTAATGTT 6 0 

GAGTTTATAA TTTCATATAT CTAAAAACAG GTGTTGTATA TATAATCATT CATCTAGTTA 120 

TACTTACTTT AAAAATAATA TAATTTCATG CGATGCAATT CATTGATGGA TGTTTTTAAT 180 

CTTAATCAAA TCCAaATAAA G CAT AT ATTT TTAAATTCAC TTTCTTTCGA ATCGATTTTT 24 0 

ATCTCTTGnA TTAAACTTTT CCATTGTTTC ATTAAAGCTC TCTGTCATAT CTATTC CCAT 3 00 

TGAATTCGCT AAACATAACA ACACAAATAA ATTATCACCT AATTCTGCTT TAATCGTATT 360 

TGCTTCCTCT GAATCTTTCT TCTTTTTTTC ACCATAGGTA TGATTTATTT CACGTGCAAG 420 

TTCGCCCACT TCTTCAGTCA ATCTAGCTAA GTTAGCTAAT GGTGAAAAAT ATCCTGTTTT 4 80 

50 AAATTGTCCA ATATATTCAT CAACTTCACG TTGCATTTCT ACCATTGATT TCATTTCTAC 54 0 

GTTCTCCTTA TATTGCATTT CTAATATAGT ATATATCAAT TTGAAGTCTC ATGCATGTTT 600 
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CTCCTTTCAG CATTTTGTTG AGCCTCTCAT CAACTTTTAT CCACGAGTCA TGCAAGTGAT 792 0 

ATTTATCATC AAACGACTTA ACGCCAATTG CGTGCTGTTC ATTATGATGT TGTCTACACA 7 98 0 

GTGCTAACAC ATGTTTGTCG TAGTGATTCA TTTTGTTTCT GTTCATGCCT CTGCCGACTG 804 0 

CTTCATAATG TGCCAGGTCT GCGTGAGGCT TTCCGCATAT TACACAGTTG CGGTTGATTG 8100 

TAGCCCAATA TAATAACGCT TTATCTTCGC TTAACAACTT ACTCGTTTCT ACACTCATAG 816 0 

GTATTTGATG ATGAAACATA AACGCTATAA TCAGTTCTAT TAACTCCCTT GCAACTTTCA 822 0 

TAGAACAGTC GCGCAGACTG ATTTCTTCAT AACCTTTCAT AATTTCCAAT TCTGTTTGTA 82 8 0 

ATAATTTTCT AGTTGATTCT ACTGGTTCGC CCCAGTGAAG TTCTATATCT CTACACATTG 834 0 

CGAATATTTT TTTGCGTTGT TCTATAGATA GTTTTTTATT GTCCGGAACC TCTACTTCTG 84 0 0 

CTTTTAGTGG ATATCCGTTT TCTAGTAAGT CAATGTGACT TTGTTCAAGT TCAACACCAG 84 6 0 

TAGCAACGAC GGAATAAGTA CCGTCATTGT CTTTCTGGTA TCTTGTAATG TATTGCATTT 852 0 

AAACCACGTC CTAGAACGGT AAATCATCAT CATTGATTTC TATTGGACCA TTAGCATTAG 85 8 0 

CGAATGGGTT TGATTGTTGA CTCATTGGCG TCTGTTTCCC ATTTGCTTGC TGTTCTTTTT 864 0 

GTTTCATCTC ATCAGTTTTA GGTTCTGGTT TATTAACTAC TTCATCGTCT TTATTCCAAA 870 0 

CTTTTACATA TGAGAGTCTT ACAAAATACT TGCCTTGTTC CTCGTTAAAT TTATTTTTAA 8 76 0 

GTACAATAGT TCCGATTTTG TTAATTAATT GATCTGTGTC AAAAGTTAAA TCTGGTAAGT 8820 

TCAATTTAAT TCCTAATCTA CTAAGTAACT CGATATATTG TTTTTCTTGA TAATCTTGTT 8 8 80 

GGAATGGTGG GACGAATTGG TTGTGTTTGT ATTGTTTACC TTCGTTGTTT TCAAAAACAA 8 94 0 

TCGTGAAGTA TCTGTTTTCT CTGTCGTTAA ACTCGACATT TGCAACTTTT ACTGTAAATT 90 00 

CTCCAGCTCC TAAAAAGTCC CCACCTTTCA TGAATGCCTC TTGATTAGTT TCTTGAATGT 906 0 

ATTGTGTTCT ACCAGTGATT TTCATAATTT TTATACCGTC CTTTTAATTA ATTTTTAATT 9120 

ACCATTTCTA ATTGCTTGTA CAACATCGTT AATACTTGGA TTAATGAAAC GTTTGTTGTT 9180 

AATTTTGATG TTGCTTGAGT GTCTTATCTT TGTCTCGAAT AAATTTGATG GTTCAGCGTT 924 0 

AAGTACATAT TGATAAGTTT TTTCGCCGTC TTGCTCATGT TCTTCTATTG TCATTCTTGC 93 00 

TAACACGTCA GATTGACTGA TGACTGCTTT TTTTATTTGG TCTTGTGCCT CTATCGTGAT 93 6 0 

TGTTGGATTG ATAGTACTTC CCTCATCATC TTTGTCTTTG TTAATGCCCT CGTGTCCGCT 94 2 0 

TATAGCAAGA TGAAATTGAT AATGTTCTTG TAATTTAGAA ATATAACGAT AAATACTTAC 94 8 0 

AATGCGTGTA GCACACTCGC CCCAATCATT AAATGTCGGT TTCTTTGATT TACCGTCCAT 9 54 0 

^•i^CTr^^-'- 1 ^ m A^T r, «A m A T ^n^Mir^T tt tvt^^t" -^n ^.^"^"^r, ~ T\ p, ^ a r. n ^ or^- 
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AGACTAAAGA AAGATGTTTT GTATCCATTT TGTGCTATGT TCAGCATCAT GTTTAATGCA 6120 

AAACCTGTCT TACCCACTGA GGGACGCGCT GCGATGACGA TTAATTGTGA TGGTTCTAAT 6180 

CCCCCTATTT TGTAATCCAT TAGCTTGTAA CCCGTCTTAA TTTGCTTCTT AGGGCTATCG 624 0 

CTGTATAACT CTTCGACAAA CTCCTCAACA AACTTCTTGG TTCCATCTTC TTTTTTGTTA 63 0 0 

GTAATTGTTT TTAAATCCTT GAGTTCATCA ATCAAGTTGT TAAAGTTTTG GTTCGTAGGT 6360 

TGTTGTTTGA ACTCAGTTAC CAATTCGTTA GCTTTGTTGA GCTGATAACT TTC CAATAAT 642 0 

TCTTGTTGAT AACGTTCAAA GAAGCCATAT CCAATGAAAT CGGAGTTGTA AAGTTTAGTT 64 80 

ATAGTATCTG CATCTAAAAA TTCTTTATCT TTAGTTGCTT TTAAATAGAT TTCTTGATGA 654 0 

TCTATCTTTC CGACGTCCAT TACATAATTG AAAAAGGTTT TAAACTTTTC GTTCGTAAAC 6600 

ATGTAATCTT TAACTCTTAT CTTTTCTAAT ACGTCCGGTT GTTTAAGTAG CGTAGCGATT 6660 

ATTGT A CTTT CAATTTCGAA TTGTCCGTAA TTCATTCGTT TTCGCCCCCA AATTCTGCCA 6720 

ACTTATTCAT GAACTTATCT AGCGCTATTT TTCTTTGTCT GACATATTCG GGGTCATTCT 6 780 

GCATTTTCCA TTGGTGTGTA GCGGTTTCGT TATCTACTGG CTCGATAGAT ACTTTTTTAG 6 84 0 

GTTCCTTACG CATGATTGCT GGTAAGTTAG GCGGGTACGG GTTGTTACTG TTGATATAAA 690 0 

CATCTACCGC TTTTACAGTT GGTTGATAAT CTCCATTTTG ACTTAATACA TCAATCCACA 6 960 

TTTCTAACTT CGGTTTATCA AAATCAATGT TGTATACGTA CCTAACTTTT TTAATAATTT 702 0 

30 CTAATGCTTG TGTTTTGCTC ATCGGCATTA GTCATCACTC AATTCTTTTT CCATTTGTGC 7080 

AATGACATCA TCAGTAGTAT TTTTTCTAGG TGCTATTTTA TTTTCTGCAT CTTCTTTTGT 714 0 

TTTGACATTC TCTTTAGCCC AGTTGTTTAA AACTTTAATT AAATAGCCAC CATGCGCACT 7200 

TTTGCTTTTA GTGTACTCAA CACCTACTTT TACAACTTCA AAAGCGTTTG TACCTATATC 7260 

ATCAATAGCA AACCCTAATT GTTCCATTTG ATTAGGTGTT AACTTATCAT CCAAATTTGC 732 0 

AATTATATAT TTTATTGAAG ATGAGAAGAC GGCTTCTCTT TCTTCTTCTT TATTCTTATA 73 80 

TTCTTCTTCT TTTTCTTCTT CTCTTTCTTC TTCTTCTTCT GTATCGTTAC GTAACGTTAC 744 0 

GGTAACGTTA CGTTTTGCTT CTAGTAACTT TTTCTGTTTC TCACGATAGC GTTGTTGTCG 7500 

CAATTTATTT TTTTCTTTAT GCTTAGCTTT GCTATCTAAG CTTTGATGCT TCTCCCAGTT 7560 

TGTCACTTTT ATGACACCAT TAACTTTTTC AATCATGCCC AATGTCTCAA AAGTTTGAAT 7 62 0 

TGCTAACCTT ATTGAGTTAA TAGGTCTATT AAATTCATTT GCTAACATTT CTTCGTTGTA 76 8 0 

CGG CAAGTTT TCGGATAGCA TAATATAACC TTGTTCATTG TACTTTCCTG ATAAAGTTAG 774 0 

TAACTTAACC CAAATAGTTA TGATCGTATC TCTTTCGGGT AAAGCTTCGA TATATTTGAT 78 00 
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ACATTTAAGT TAACCATCTC AGCTTTTCCG TTTTTATATC CACTAATAGT TGATCTTGAT 4 32 0 

ACGCCAGTTT CATTGTGCAA ATCTTGGACA CTTACGTTAT CTCTAGCCAT GATTACCCTT 4 38 0 

5 AAATTAGTTG CGAATACTtC GTTCAACTTC ATTTATTCCA CCTCTATATA TGCATGTCTT 444 0 

ATTGTTATGT TGTCATACTT TAGTAATTCG TCCGGATTGT CATCTAAGCG CTTTGCCAGC 4 50 0 

GTATCTTTTT CTTTATCCAC ATCATCGTAA TGCTGATATT CAACTTCTGT AGGTATTCTT 4 56 0 

10 

ATATCAATCG TTGCGTTTAT ATATGCTTGT TGTTGCATTA GATCACTTCA TTTCTCTTTT 462 0 

TCTTTTACGT CTGACTTTCA CTAAGTCCTC ATATACCATC CATTCTTGAC CTGTGTATTT 4 6 BO 

AGGCGCTTTA CATATCCACG TTAAATTCAC ATCTCTATAC TGATATCTGA ATATCTTCGC 4 74 0 

15 

TTTGATGTTG GCAACTTCAG TCGCCTTACC TTTAACGTCT ATAACTTCAA CCAGTTTCCC 4 8 00 

TTCCTTCCAC AAAGAGAAAT CGGCTATATA CGTAATCGGT CTTTGTTTCC CGAATTTAGG 4 860 

2Q TTGTAATTCA AATTTCGGTT GTATTTCGAT ACGATCATAG TTAGTGCCAT T CAT ATT ACT 4 920 

TTCTAAATAT TGGTAATATT CGCACTCTAC TTTGCTATCA AATACAATTC CTTTGTACTC 4 98 0 

AACTTTCTTA GCATTGTATT TACTCATTGT GCCACCTCTA AATATCAAAT ATCGTTGCTT 5 040 

25 GCAATCCTAG CTCTTGCTCA TATAGAAGCC CGTGAGCGCC TTTGAATCGT TTTAGGTCAC 5100 

TATCAGTCAT AATTTTCTTT TCGTCGCTGA AATGGGCTCC TGTGAGCGAA TAAACTTCAT 5160 

TTACGTTGTC TTTATACTTG ATGACCTTAA TATCTTCCGT GCCATCTTCT CGGTATAAGT 5220 

30 AATATTTTTC TTTCGGCATT TTTTAACACT CCTTAATGTG TGTTTTCTTC CAGTTGATTT 5 2 80 

CATTCATGAT TTTCTTTTCA ACTCTGTCGT AATCATCGAA AGGCGATAAC TCGTTATTGT 534 0 

CCAACAATCT ATTGACCGCC CAACCAGTCT CGATATATAC ATTTGCTACA ATCGGGTCGC 54 00 

35 

TTTGCTTTGT CTCTTCATAC ATCGATTTCA ATAAGCTTTT GAATTGCATT ATATTCATGT 54 6 0 

GAAAAACCTC TGAGTCTTCT TGTAATACTC AAATTCAATT ATTCCGGTTT CGCCGTCTTT 5 520 

GTTTTTGGCT ATGTTACATT CAACAATAGA TTTGCCAGTG ATACTGTCAT CTTCGTCACG 5580 

40 

GTTATAATAA TCATCACGGT AAAGTAGCAT CGCTAAACTC GCATCTGCTT CTATTCCGCC 564 0 

TGATTCTTTC ATGTCCGATA GCATTGGTCT TTTATCCTGT CTAGACTCGA CACCACGATT 5 7 00 

CAGTTGTGAA AGTAGTACGA TGATTGCGCC TGTCTCGTTA GCGATTATCT TTAAGTCACG 5 76 0 

TG AT AT CTTT TCTACTGCTA CACGTCTATC AACTTTCGCA TCAGTATCCA TCAGTTGAAG 5 8 20 

ATAATCTATA AAAATAACTT GTTGCCTGTC TGAATGCCTC ATTGtTGCGC TCGCACATCT 58 80 

SO TGCGGTGTGA TATTACTTTT ATCAGAAATA TCGATGCCTA ATTTCATGAT TTTATCCATC 5 94 0 

GCATT CGTTA ACTTTGTTAA GTCATCCGGC GTTAAGTTCC TGATTTCTTT TAT CTTTG TT 6 00C 
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TACAAGTATT GGAACTAATG TAATGATGTA ACTCACTTCC CCAAAACCTC CTTGACTCGA 2 520 

TCTAAGATGT CTTTACACTC CGCTACTTCC GAAGCCTTTT TCTCCACGTT CTGAAACACT 2580 

TTCGAATTCC TCCACTTGCT TTAGTTCAGG TGTCCATATA GGCACGATAA CCAATTGAGC 264 0 

TAGTTTGTCT CCTTCGTTGA TTTGATAAGT TCCGTATTGT CTTATGGCGT CACTCAAATC 2700 

GATTTCTCCT TTAATATCAA AAACACCTGG TGTGATATAA CCATTCGATG CAATAGCGTC 276 0 

ATTCTTGATA TTAATCCCTA AATTGCCGTG ATATCCCGCG TCTATCTTGC CTGTTTCAAT 2820 

CACTAAATGC GTTTTACTAC TTACACCACT ACGGCTAGTT AATAGTCCGA CATAGCCCTC 2880 

TGGTATGCTT ACAGCTACAT CTGTTTTAAT CACTGCCTTT TCTTGTGGCT CAAGTACGAC 2940 

AGTTTCAGCT GAGAATATGT CATAACCTGC ATCCGTCTTA TGATTTCGTT CGGGCATTCT 3000 

AGCATTTTCT GATAATAGCC TTACTTGTAA TGTGTTAGTC ATTTTCCTGC TCCTCCCTAG 3060 

CTGTAGCAAA CGCTATTCTC AATTTCAATC TTTCAACAAT ATGAATTAGT GCGGTATTGA 3120 

GGAATATTTC AAATTCTTCA ATGTTCTCAT CTATAAAATC AAGTATTTCT TCCTCTTGTT 3180 

CACTGTCAAA CTCGCTTAGT ACATCCCAAA TATTTATGTC GCTTTTGCTC GTTTCTAATA 3240 

CTCTTTTGAT TATTTCTGAA TTACTTTTAT TACTCATTTT CCTTGTTCCT CCTCATATTT 3300 

ATAGACAACT TGACCTGCCA TAATCCCTAC TGCTTCATCA AGTTCAATAC CTTCTTTAAC 33 60 

TGAATGTTGA ATAGCATTTG TCATTCCCTC AAGTATTTCA TCAAACGCTT GTGCTCTCTT 34 20 

30 ATACACGTCC TCAATCTCTT TTAGTAATCC CTCTGTGTCA TTACCGTTAT ACGCACTAGC 34 80 

ACTGATCACT GATTGTTCAA TTTGTTCGCG GTT ATT CATC ATTTCCATCT CCTCTAAAAT 3540 

AAAGTTAGTT GCTTCTGCTC CTCGTATTCC AAACCATGTT GCTTTATATA TGTTTCGAGC 36 00 

TCTTCCGCTG TATCAAATGT CTTTTTCACG CCTTGCCAAC CTGGCACGAT ATGCCCATGa 3660 

AAGTAATAAG TGCCGTTCAC TACATGGATA TGTGCCACTC GTTCGTTATC CTGATACAGA 3720 

TATCTCTTAG ATCCGAAAAA TTGGTTTAAG TATTCTTTAC ATGCGCTATC GGTTTTAGGC 3780 

ATTTATGCTT CCTGCCATTT CTTAAACATT TGGTTATAAG TAGTATCAAA CCAGTACGGA 384 0 

TCACGTGAAT GTTTTTGAGG CACATTAAAC AAATGTGGCT TCTTCTTACG TAGTTCAGCC 3 900 

TCTTTACGTC GTTGCCTAGC CATTTCACGC TCTTTGCTCT CTCGCTCCAT GATTTTGGAT 3 960 

AACACAATTT CTTTATACTC AGCTAAGCGC ATACCATAAG GTGCATGTAA GGCTTCTAAC 4 020 

AACGCCCAGC CACCTCGTAC TCTTTTTGCA ACCATTCCTG GAGTTAAAC C GTTCTTTTTT 4080 

AT CAATTCAT TTTCATGTTC GGTAAATTTA TATGGTTTAC CGTTAATCTT TACGATACTC 414 0 

ATTTATTCCA CCTCTATACA TTTACTTTTT TTAATCCAAT CCTCTAATTT GTGCGTGTTG 4 200 
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TTTAAATGGT 


CAT A TTT PTT 

X X X X \— X X 


aptpta arrr 

ML iu A HAo^L 


1U11 ^jAw 1 1 


L 1 LL 1 L 1 MLfL 


a at apa arr a 

MM 1 ML\MA\jLA 


720 




GATAACGCTA 


AAGPTTPJTGT 


MM A ML A LA 1 1 


AAA PP ^'* # TV 1 *T w 1 1 


L 1 lo 1 M 1 L 1 o 


1 1 L 1AM 1 Lvj 1 


780 


5 


TCTTTAATAT 


ATTCCGAAAC 


ATT A IPfiTTT 
ftl i nnw\ AAA 


ptta apaaTP 

Li 1 MALAA 1 L 


VjML 1 1 oL 1 MM 


AP A PTPTPPT 
MLiML 1 L 1 viL 1 


84 0 




GTTTTCTTAC 


TATAACCTGC 


TGTAATTGPT 
a \j a MM a 1 LjL 1 


LjL 11111 IhL 


LAI 1ALA1LL 


ITTPaTTfiTfl 
Mil LA 1 1 A 1 A 


900 




TATTCATCTG 


CGAATPTPTT 

v» w/m x \v x w X X 


TTGTTTTTPP. 

AAVJAAAAA LVj 


1 1 Lnl 1 1 U>\1 


TTa ppAPPa a 

1 1ALLALLAA 


PTPTPP r*p PT 
L 1 L 1 LoLVjL 1 


960 


10 


ATACGl'TaTT 


TAAAATTAAA 

x rvw\ X x /vv\ 


AAAGGATTGG 


PTATA ATPaP 


ppa appr*ar*a 

LLAALLLALA 


TaPlTPPTTT 
1 ML»M 1 L L 1 1 1 


102 0 




ATTCCTAATT 


GCGATAAGGG 


AAAPGPAGTA 


PGATAriTPa a 


TaTPPra^a p 

1M 1 LL 1 ALAL 


TSTPhTH aTa 

1 Al LAI AAl A 


1080 


15 


TCTCATTTAA 


GGTATPAAAA 


ML A uu^nU 1 A 


TAPTVIPPA AT 


1 1 LMLj 1 L 1 1 l 


LLl 1 AAl 1L1 


114 0 


TCCGCCAATC 


TAGATATGAT 
x nvjn x m x w-/% a 




ATTPTaTfian 

Ml lUlAlunu 


LAu i i L 1 A 1 L 


AoAAAlb 1\j 1 


1200 




ATGTCAAPAP 


AAA P'l u 1 W PP IV P 

AAnV. 1 x x LM w 


AMM1 1 V,L 1X1 


TTATTa AAhT 
1 1M 1 IKnnnl 


a a tr P r p/™ ,r n r pp 
AA1AL ILllb 


TV TA TP TV A TTrT 1 

AA 1 LxAA 11 Lo 


126 0 


20 


CGTTCTTTCC 


TGPTTOATP-T 

X Vj X X VJM. A VJ X 


fiTT(^ATT& T A 
Vj 1 1 VjM liAln 


PGTTPa AT Af2 
V-Vj 1 1 LAAlAlj 


pprrrrT a a a 
LoL 1 L 1 1AAA 


L 1 LAALiu A 1 1 


1 J Z 0 




TTAPPTPTTp 


GTATAPTAPA 
O Inl ML 1 nUi 


A AP ATA ATT A 


PTT a PTYlTT'a 


1 1 1 L 1 \j 1111 


LLjAaVjIAI 1A 


1380 




GACGGTACAA 




T AT ATTTP.TA 


TrTcrrrtfi a a 


TP p a nnr* tpt 

1 L LMLIjv? 1 Lj 1 


LAI 1A1 1 I LA 


144 U 


25 


P TT PTT A A A T 

w x x x x n/\/i x 


V— i A v — MMO A A O 


TTT ATP AT a A 


1 IAuuAIAAI 


Parana rptp 
LALALAAL 1 L 


ATCTT CTAAC 


1500 




TTTCGAACTG 


TTnATAATTT 

X X VAAV X riTi X A A 


TAATPPGTAT 

A MM A UV^VJ 1 A A 


TTPTTTTT A^ 


TPAT^xA ATAP 
1 LM 1 unA 1 ML 


pptpppt an 
LL 1 LLo 1 ALA 


1 C C f\ 




AATATG TTT A 

X X VJ X X X M 


niLi i v-MMMvj 


TPTPTP A ATP 
IblL 1 LMM 1 L 


t a rTTnT a a 

1 ML 1 1 V- 1 1 MA 


TUTPTPTRTP 

1M1L1L1A1L 


1 L 1 LVjL I L 1 1 


1620 


30 


TAACTTTTAr 


ATfArfTTTT 

M A v»M\_ w A A A A 


A a PTPTTPPP 
nrtL 1 w A 1 LLL7 


I- 1 1 1 MML-M 1 


LALMLL-MMAL 


A A T A T\ P A TP TV 

AA r AAGATG A 


1680 




CTAGTAATAT 
>— x nu x *».r» x m x 


AATTGPTATP. 

MM A A Ox— A rt A Vj 


ATT A APPAPA 


Tr**a T( 1,1 ra rrr 
1 V— Ml w InL 1 L. 


LoMLALL ill 


VjLLL I LAI LA 


174 0 




AATCAGAPTG 


ATPArrrAAr 

M x ^nv. A LnnL 


TTTGPGA APT 
AAA OLvJrtrtU A 


LnL 1 1 uuLoL 


ptpt a p a tp a 
L 1 L 1 ALA 1 LA 


tp a tt a pppp 
lLAl IAoLLCj 


1 D A C\ 
1 O U U 


35 


TCATPATAAT 

x v^n x v«*n A /Wl 


AT AT A PTTGP 

MAMAMLA 1<JL 


TPAP/TTAPAT 
1 LMvj 1 1 nLrt 1 


A PTT A PPT A a 
MU 1 1 ALL 1 MM 


ptp ATapaTr* 
L 1 LA 1 ALA 1 L 


PPT APT A A P A 

LjL 1 AKj I AACiA 


i o c r\ 
I86 0 




ATAATAGTPT 


PA A A A TTT PT 


TTAAPPAPPA 

A AMMLLMLLA 


PTa a apAPPp 

LiAAALALLL 


PHTPTT 7A A 'T W T* 
LAHj 1 1AA1 1 


T A TPP A T A A T 

1 A 1 LLxA 1 AA 1 




40 


TTGTATAGCT 


TGTTTTAATG 

X X X X X X \J 


PGTPTPTTTT 

w vJ A^AwAAAA 


TTPTTTf*2 AT A 

A A ^- A A A wMAM 


TPTPT ATT AT 
ALAL1MA AMI 


PPPPTA Tp p I"?'P 
LvjV— \ M 1 L 1 1 L 


1 Q Q Pi 


ATCAGCTGAC 


ATTAACTCAC: 


TGTCATATT C 

x sj x x *i x x 


ATATAATAGT 


T PTP ATA TTT 
1^1 vJA A M A A 1 


PATTAPTAP.P 
V— M A AMLAMLjL 


n xt n 




TACTACTAAT 


AAGTTTTPAT 


PTAPATPAAT 


PGTTAPPGTT 


TTPTTTP-GP A 


tptpp a tptp 

A L 1 LLM A L A L 


9 1 PI Pi 


45 


TCCTTATCTT 

X \— V_ . A A *1 X >w X X 


AAPTTGTG PP 


TPGTATTTGP 


GPTPAGPTTP 


TTPTTT APTP 
A A L A A A ML 1 L 


TPTnppTP a a 

1 L 1 uLL 1 LAM 


z 1 o u 




CAACTGTAAA 


PGTPTGATTA 

\_» VJ X V- . X VJ j \ X X AY 


TPTPTAGPAG 

A V^ A V^ 1 /AvJ V. — nu 


TAGTAAAATG 

A no A MMMM A \J 


TTP A T^TPPT 
1 1 LM IVj 1 Vjvj 1 


tptpptpttp 

1 vj 1 LL ILt 1 IVj 


o o o n 
z z z u 




AATCTTTGAA 


TGTTGTGACT 


AAGTATTGCG 


TCACTTCTTA 


TCACTCCTTT 


GAATGATTCT 


2280 


50 


AAGTTTITCT 


ACGAATAAAA 


GTATTAGTAC 


AACACTCAAT 


GTAGCCAACA 


TATTTTTTTG 


2340 




CTTTGCAAAA 


TCTACTATAA 


CGATTAAGAC 


T AAT AA PA TT 


PPAATTP^PP 
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GTTGCTATAG AGCCTGAGGC TTCTCCAGTA TTGAGCGGTG GTGAGCCAGG TCCACATAAA 54 0 0 

TTACAAGGTT TAGGTGCTGG ATTTATTCCA GGCACTTTGA ATACAGAAAT CTATGACAGT 54 6 0 

ATTATTAAAG TAGGAAATGA TACAGCGATG GAAATGTCTC GTCGAGTTGC TAAAGAGGAA 552 0 

GGTATTTTAG CAGGTATTTC ATCAGGTGCT GCGATTTATG CTGCCATTCA AAAAGCAAAA 558 0 

GAATTAGGAA AAGGTAAAAC AGTAGTAACA GTATTGCCGA GTAATGGTGA ACGCTACTTA 564 0 

TCAACACCTT TATATTCATT CGATGACTAA TTAATGTCAT TTAAAAGAGT GAGTTATCTT 5700 

TTTGAGATAA CTTGCTCTTT TTTTCTACCA TGTATATTTT TAAAAATATG AG CG TTAAAT 57 6 0 

TAAACATTTT TCTGATAAAA ATATCCAGTG AATGATAAGA TAATAAACGT ACATACTAAT 5820 

AACTAGTAAA TAGCAGGAGT AAATTTTATT AGAGTTAAAC AATACATAAT TAAAGGGTGG 5880 

TTAACATGAC TAAAACAAAA ATTATGGGcA TATTAAACGT CACACCTGAT TcATTCTcAG 594 0 

ATGGTGGAAA ATTTAATAAT GTTGAATCAG CTATAAATAG aGTGAAAGCC ATGATAGATG 6 00 0 

AAGGTGCTGA CATTATAGAT GTTGGAGGTG TTTCAACGAG ACCCGGTCAT GAAATGGTTT 6 060 

CATTAGAAGA TGAGATGAAC AGAGTATTAC CTGTTGTTGA AGCTATTGTC GGTTT 6115 

25 (2) INFORMATION FOR SEQ ID NO: 14 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 01 base pairs 
(3) TYPE: nucleic acid 
30 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



20 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149: 

35 

TAGATACTGG GnTAAAcaTc AAAAATAtyT GCtTaTTCaC GTGTTTAcGc TCCCtCAAAC 60 

GCAACGTTAA TTGCGTGTAA TCATTTAGTG TGAATTcAGA CGCTTCTTCC ATGACTATGT 120 

CTGATATGCC TTTTATCGAC TTTATTTTCT CTGGGTTATC TAATCCTTTA AACAAAAAAA 180 

40 

CTGCGCCGTT TGGCAATTCA ACTTTGTTAT CAGTCTTATT CCAAAGGCAC ATGTCCCAAA 24 0 

TACCAAAGTT TATCAAACAA TCTTTAACAT CTTCGAACAA ACTATCTTTA ATTGTTGATT 3 00 

45 GTACTTTTCT AAGCCACAGT ATACGCCTAG GATATTTCCA ATCTTGCAAT GCTTTGAGTA 3 60 

CAACTTTTTG TATAACGCCG TGAGACTTAC CGCTCGAACC TCCACCGTAA TGkACTTCAG 4 20 

TGAAGTtATC GTAATTGGTT AGTATTTCGA ATATGTTTCT ATTGAAAACA TTAGACGGTT 4 80 

50 TGTTAAAGTT TAATTTAACT TTCGTCATCG TACTCACCAA TATTAATCTC AATATTCTTC 54 0 

TGAGTAATTT CTTTTTTATC GATATACGCA CCATGTACTT TTAGTATGTG GTCAATAGAT 600 

55 
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AATTGTTATA GCAGAAAATA ATTGTAAAAC AAGTTACTTC ATTATTTAGA ATGATGGGTG 36 0 0 

TAGAATAAGT ACAATTGTTG CATTTTATGA AGTAAAGTAA TTTTTTAAAT ATAGAGTAAT 3660 

AGAGGAGATT GAAATAATGA CACACGATTA TATTGTTAAA GCATTAGCAT TTGATGGAGA 3 72 0 

GATTAGGGCT TATGCTGCTT TGACAACTGA AACTGTTCAA GAAGCACAAA CGAGACATTA 378 0 

TACATGGCCG ACAGCATCTG CTGCAATGGG AAGAACAATG caCAGCAACA GCTATGATGG 3 84 0 

GCGCAATGTT GAAAGGTGAT CAAAAATTAA CTGTCACTGT AGATGGCCAA GG AC CTATTG 3 900 

GACGAATTAT TGCCGATGCA AATGCTAAAG GCGAGGTGCG TGCTTATGTA GACCATCCAC 3 96 0 

AAACTCATTT TCCATTAAAT GAGCAAGGTA AACTTGATGT AAGACGAGCG GTAGGGACAA 4 020 

ATGGATCTAT TATGGTTGTT AAAGACGTTG GAATGAAAGA CTATTTCtCT GGAGCAAGTC 4 08 0 

CaATTGTTTC AGGAGAACTT GGTGAAGATT TTACTTATTA TTATGCTACA AGTGAACAAA 414 0 

CACCTTCATC GGTAGGTCTT GGTGTATTGG TAAATCCTGA TAATACGATT AAAGCAGCAG 4 2 00 

GAGGATTTAT CATTCAAGTT ATGCCAGGTG CCAAAGATGA AACAATTTCA AAATTAGAAA 4 260 

AAGCAATTAG TGAAATGACA CCAGTTTCTA AATTAATTGA ACAAGGATTA ACGCCAGAAG 4 320 

25 GATTACTAAA CGAAATCTTA GGTGAAGACC ATGTGCAAAT TTTAGAGAAA ATGCCTGTTC 43 8 0 

AATTTGAATG TAATTGTAGT CATGAGAAAT TTTTAAATGC TATTAAAGGA TTGGGCGAGG 4 44 0 

CTGAGATTCA AAATATGATT AAAGAAGATC ATGGTGCTGA AGCAGTATGT CATTTCTGTG 4 500 

GAAATAAATA TAAATATACT GAAGAAGAAT TAAACGTGTT GCTAGAAAGT TTAGCGTAAT 4 560 

TTAATTTAAA TCAATACGCT AAAATGTTTA TTTTTAGCGG TTTAGTGAAA TGTAGAACTA 4 62 0 

AATAGTTGTA TAATCCTTAG TGATTTTGTT TGCTTTCTAG AATTTATTTG ATAAAATAAT 4 680 

TCTATATCCG ATAAATAAAC TAAGATTTCA ACAACTAACT AAAAAGGAGT GTTCTTAATG 4 74 0 

GCAGAAAAAC CAGTAGATAA TATTACTCAA ATTATTGGCG GTACACCGGT AGTCAAATTG 4 800 

AGAAATGTAG TAGATGACAA TGCAGCAGAT GTTTATGTAA AATTGGAATA TCAAAATCCA 4 860 

GGTGGTTCTG TAAAGGATAG AATTGCTTTA GCAATGATTG AAAAAGCAGA GCGAGAAGGC 4 920 

AAAATTAAAC CTGGCGATAC AATTGTAGAA CCAACAAGTG GTAATACAGG TATCGGTTTA 4 930 

45 GCATTTGTAT GTGCTGCTAA AGGATATAAA GCAGTATTTA CTATGCCCGA AACAATGAGC 5 04 0 

CAAGAGCGTC GTAATTTATT AAAAGCATAC GGTGCGGAAT TAGTTTTAAC GCCTGGATCA 5100 

GAAGCGATGA AAGGTGCAAT TAAAAAAGCT AAAGAATTGA AAGAAGAACA TGGTTACTTC 5160 

50 GAGCCACAAC AATTTGAAAA CCCTGCGAAC CCTGAAGTTC ATGAGTTAAC TACAGGTCCT 522 0 

GAGTTATTAC AA^AATTTGA A f^C A AAA T A^^AT^rCT ^rn^^^T^n t^ t .-s-" r - n - 
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TTTCTTCCTA AGCCAAGCAC AAGGTGGCGG 
ATCTAAAGCA AAAATGTACG ATAATAATAA 
5 GGCAGATGAA GAAAAACAAG AATTAATTGA 

ATTCAAAGAA ATGGGATCTA GGATTCCTAA 
TGGTAAAACA TTACTTGCTA GAGCGGTTGC 

W 

TAGTGGTTCA GACTTTGTAG AGATGTTTGT 
ATTCGATAAT GCTAAGAAAA ACGCGCCTTG 
TGGTCGTCAA CGTGGTGCAG GTGTTGGTGG 

15 

CCAATTATTA GTTGAAATGG ATGGTTTCGG 
TACAAACCGT CCTGATATCC TTGACCCAGC 

20 AATTCAAGTT GGTCGTCCAG ATGTGAAAGG 

AAACAAACCA CTTGATGAAA CGGTTGATTT 
CTCAGGTGCT GATTTAGAGA A CTT ATT AAA 

25 TAAAAAGAAA ATTGACATGA GAGATATCGA 

TGCTAAGAAA TCTCGAGTTA TTTCTAAGAA 
TGGTCATACA ATTATCGGTA TGGTACTTGA 

30 TGTTCCACGT GGACAAGCAG GTGGTTATGC 

AATGACTGAA CAAGAGTTAT TAGATAAAAT 
AGATATTAAC TTTAACGAAG TATCAACAGG 

35 

AATCGCACGC TCAATGGTTA CGCAATATGG 
CGGTCATAGC AATGGTCAAG TATTCTTAGG 
AAGCCAAATC GCATATGAAA TTGATAAAGA 

40 

ACGTTGTAAA CAAATTTTAT TAGAGCACAA 
ATTAACAGAA GAAACATTAG TTGCTGAACA 

45 ACCTGAAATT GATTATGATG CAGCTAAAGT 

TGGTAAATTC GGTAAATCTT ATGAAGAGAT 
TGACGAAAGT GAAGATCGTA AAGAAGAAAA 

50 TAAATCTGAT GAAAAAGATG AACCAG CACA 

CGATCCAAAT CACCCAGACA ATAAATAATC 



TAGTGGCGGT CGTATGATGA ACTTTGGTAA 18 00 

ACGTCGTGTT CGTTTCTCTG ATGTAGCAGG 18 60 

AATTGTTGAT TTCTTGAAAG ATAATAAAAA 1920 

AGGTGTCTTA CTTGTTGGAC CTCCAGGTAC 1980 

AGGTGAAGCT GGCGCACCAT TCTTCTCTAT 2 04 0 

TGGTGTTGGT GCGAGCCGTG TTCGTGACTT 2100 

TATCATCTTT ATCGATGAGA TTGATGCTGT 2160 

CGGTCATGAT GAACGTGAAC AAACCCTAAA 222 0 

TGAAAATGAA GGTATCATTA TGATAGCTGC 22 80 

CTTATTACGT CCAGGTCGTT TTGATAGACA 234 0 

CCGTGAAGCA ATTCTTCATG TTCATGCTAA 24 00 

AAAAGCAATT TCACAACGTA CACCTGGTTT 2460 

TGAAGCATCT TTAATTGCTG TACGTGAAGG 2520 

AGAGGCAACG GATAGAGTTA TAGCCGGACC 2580 

AGAACGTAAT ATTGTTGCTC ATCACGAAGC 2640 

TGAGGCAGAA GTAGTGCATA AAGTTACTAT 2700 

AATGATGCTA CCTAAACAAG ATCGTTTCTT 2760 

CTGTGGTTTA CTTGGTGGAC GTGTATCAGA 2 320 

TGCTTCAAAT GACTTCGAAC GTGCAACACA 2 880 

TATGAGTAAA AAATTAGGAC CATTACAGTT 2 94 0 

TAAAGATATG CAAGGTGAGC CTAATTATTC 3 000 

AGTTCAACGA ATCGTTAAAG AACAATACGA 3 06 0 

AGAACAATTA ATTTTAATTG CTGAAACATT 312 0 

AATTCAATCA TTATTCTACG AAGGTAAATT 3180 

TGTTAAAGAT GAAGATTCTG AATTTAATGA 3 24 0 

TCGTAAAGAG CAATTAGAAG ATGGACAACG 33 00 

AGATATTGCT GAGGATAAAA AAGAAGCTGA 3 360 

TCGACAAGCC CCAAATATCG AAAAACCTTA 3420 

GATTATATTC AGTACCTCTT TCTATGATAA 34 80 



55 



EP0 786 519 A2 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 

GAGGTTTCTA GACAAGCTTT TAATAACTTA CCAAACTCAT TAAgrTGGTT gTGtTGGACT 6 0 

5 GCCtATTATC mAAGtATTAT GaGTTGTTTA ATATTAGtGC TAArACATAC GAAGAGTGGT 12 0 

TTAAACAATT TAGTAGTAAG AAAGCACAAT TCAGTATTAA TCTCACGGAT AAATGGATAA 180 

TTCAAATCGC ATATGGTAAA TTAATAATAA TGGCTAAAAA TAATGGCGAT ACATATTTTA 24 0 

w 

GAGTTCAAAC AATTAAAAAG CCAGGTAATT ATATTTTTAA CAAATAT CGA TTAGAGATAC 3 00 

ATTCTAATTT ACCAAAATGT TTATTTCCGC TTACAGTGAG AACACGACAA AGTGGCGATA 3 60 

CATTTAAACT GAATGGGCGC GATGGTTATA AGAAAGTGAA TCGCCTGTTT ATAGATTGTA 420 

15 

AAGTGCCACA GTGGGTTCGG GATCAAATGC CAATCGTATT GGATAAACAA CAGCGCATTA 4 80 

TTGCGGTAGG AGATTTATAT CAACAACAAA CAATAAAAAA ATGGATTATA ATTAGTAAAA 54 0 

20 ATGGAGATGA ATAGCGTTAT GCATAATGAT TTGAAAGAAG TATTGTTAAC TGAAGAAGAT 600 

ATTCAAAATA TCTGTAAGGA ATTGGGAGCA CAATTAACAA AGGATTATCA AGGTAAACCA 660 

TTAGTATGCG TGGGTATCTT AAAAGGCTCA GCAATGTTTA TGTCAGATTT AATTAAACGA 72 0 

25 ATTGATACCC ATTTATCAAT TGATTTCATG GATGTTTCTA GTTATCACGG AGGCACTGAG 78 0 

TCAACTGGTG AAGTTCAAAT CATTAAAGAT TTAGGTTCTT CTATTGAAAA TAAAGACGTA 84 0 

TTAATTATTG AAGATATCTT AGAGACTGGT ACTACACTTA AGTCAATTAC TGAATTATTA 900 

30 

CAATCTAGAA AAGTTAATTC ATTAGAAATA GTTACTTTAT TAGATAAACC AAACCGTCGT 96 0 

AAAGCGGACA TTGAAGCTAA GTATGTAGGT AAAAAAATAC CAGATGaATT TGTTGTTGGt 102 0 

TACGGTTTAG ATTATCGTGA ATT AT AC CGA AACTTACCAT ATATCGGTAC GTTAAAACCT 1080 

35 

GAAGTGTATT CAAATTAATT TTTTAATCAA TTTCAGTTAT TATTACTATG CGTTTGAGAA 114 0 

ATAATAGTGT AGACTCAAAA ATATGAAAAA TGTATTTCAT ATATATTTAA TTTTAGACAA 12 0 0 

GACATATGTC TTGAAAAGTT GAAAAATATA GAGATTGATA AAACTAATAC GGGTGTGAAT 12 6 0 

40 

GACATTGATG TTAAGCTCAA TTACTAGCTT ATAAAACATG TCATATGTTA CAATTTTTGT 132 0 

TAGTTTTATT ATGGGAAGTA GGAGGAAATG ACGCATGCAG AAAGCTTTTC GCAATGTGCT 13 8 0 

45 AG7TATCGTA ATAATAGGCG TTATTATTTT TGGTCTATTT TCATATTTAA ACGGTAATGG 14 4 C 

AAATATGCCG AAA C AG C IT A CATATAATCA ATTTACTGAG AAGTTGGAAA AAGGTGACCT 15 00 

TAAAACTTTA GAAATCCAAC CACAACAAAA TGTCTATATG GTAAGTGGTA AAACGAAAAA 156 0 

^° TGATGAAGAC TATTCATCAA CTATTTTATA TAACAACGAA AAAGAATTAC AAAAAATTAC 162 0 

TGATGCTGCT AAAAAGCAAA ACGGTGTAAA ATTAACGATT AAAOAAGAA^ AAAAAPAAAH i £ Q " 
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AG AT ATT CAA ACCACGTGTA CTCAAAATGA TAGCTTGGTA TGTACCTCCA ATAGTAATTT 96 0 

CAATAACTTT GTCTGTTGAA CACTAAGAGC AATTTTAATT TCATAATGTG TTGTAAACAT 1020 

TTTTTTTGAT TGGAGTTTTT TTCTGAGTTA AACGATATCC TGATGTATTT TTAATTTTGC 108 0 

ACCATTTCCA AAAGGATAAG TGACATAAGT AAAAAGGCAT CATCGGGAGT TATCCTATCA 114 0 

GGAAAACCAA GATAATACCT AAGTAGAAAG TGTTCAATCC GTGTTAAATT GGGAAATATC 1200 

ATCCATAAAC TTTATTACTC ATACTATAAT TCAATTTTAA CGTCTTCGTC CATTTGGGCT 126 0 

TCAAATTCAT CGAGTAGTGC TCGTGCTTCT GCAATTGATT GTGTGTTCAT CAATTGATGT 1320 

CGAAGTTCGC TAGCGCCTCT TATGCCACGC ACATAGATTT TAAAGAATCT ACGCAArCTC 1380 

TTGAATTGTC GTATTTCATC TTTyTCATAT TTGTTAAACA ATGATArATG CAATCTCAAy 144 0 

ArATCTAATA GTTCyTTGCT TGTGTGTTCG CGTGGTTCTT TTTCAAAAGT GAATGGATTG 1500 

TGGAAAATGC CTCTACCAAT CATGATGCCA TCAATACCAT ATTTTTCTGC AAGTTCAAGT 1560 

CCTGTTTTTC TATCGGGAAT ATCATCGTTA ATTGTTAACA ATGTGTTTGG TGCAATTTCG 1620 

TCACGTAAAT TTTTAATAGC TTCGATTAAT TCCCAATGTG CATCTACTTT ACTCATGCGT 1680 

25 TTGATAAAAA CTTAAATAAT ATTAATTCGG TCATCAGTGG CGTTAAATCT TTTATCATTT 174 0 

TTAGTTATAG TTGATAAATT TATATTTATA AGCATATATG GATATTTCAT CAAAAATTTT 1800 

TATTTATATA AATCCGAACT GCATACATAT TTGTTTAAAT AAGAGGTATT ATTTTTCGGG I86 0 

AAATTGCTGT CTGAGTTAAA AGGATTAGTT TTATAAAATG AGTTGAACTA TAGCCAAAAA 1920 

CGATTAAAAT ACTGATAATC CATTTTTGtA TTATGTTAGG GACTTTTTTA CTTAATTTTA 1980 

ACCCTATTGG aGCmAATATA ATACTCCCTA TTATAAGGAA TAAGGCGTCA TATAAaGGGA 2 04 0 

TATAAC CTTG AATAAGTTTG ATGACAAAAG CACCAATTGA AGATATAAAA GCAATTACTA 2100 

TACfATTAGC GACTACAGTA TTCATTGGTA ATTTGAATAA AACCAATAAT ATAGGAATAA 2160 

TAATGAAGGC ACCACCTGCA CCTACTATAC CTGAAATAAT ACCAATGAAA AGGCCAATGA 2220 

TAACTAATAA ATATTTATTA AATGAAGACT TTTCGGAACT AGGTTtCACT TTAATAAACA 2280 

TTAATGTTAA TGCAAGTAAA GCAATAATGA TATATACCGT ATTTACAAAT GTAGCATCAA 234 0 

ATAAATTTGC TAGAAATGCA CCTAACATAC TCCCT 23 75 
(2) INFORMATION FOR SEQ ID NO: 148: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 6115 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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TTATTTAATG ATT ATT CT AT ATATGATAGT ATAATGAAAT GTAGATAGGT ATTTAATTTA 2 8 80 

ACAGAGGTGA AATTGAGATG TGGAATTTTA TTAAATGtGT GkTTAAATTC GTATTTAGCT 2 94 0 

TAGTTGCTAT TACAACATTA GTTGCTGGTG TTGGTGTAGT AGCATTTGCT TATATCTTTA 3 00 0 

AAAAAGATTT TGAAGATATT GAAAGAAAAA CTAAAGAAAT TATTTCTGAT ATTGAAAGTA 3 06 0 

AAAATAACTA ATAACATTTA GAGGCTGGGA CATAAATCCC TAAAAAACAG CAGTAAGATA 3120 

ATTTTCAATT AGAAAATATC TTACTGCTGT TCTCTATTTn ATcArnTACTt CGTATTGAAT 3180 

GGCTTCGCTT TCCTAGGGTG CCGTCTCAGC CTTGGTCTTC GACTGGCACT GCTCCCTCAG 3 24 0 

GAGTCTCGCC ATTAATACTA CGTATTAACA TGTAATTTTA CTTTGGAAAT ACTTTTAAAA 3 3 00 

AATAAGACAC TTTGGCCCAA CTTGGCACAT AAATGTAAAA TTCAAT 3 34 6 
(2) INFORMATION FOR SEQ ID NO: 14 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) t.knctw ■ 23*75 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 7: 

GTTGAAGAAA GAAATATAAC AGTCAATTAT AATTATAACC TTGTTGAAAT CGACGGTGAC 6 0 

AAAAAAGTGG CTACATTCGA ACATATCAAA GCATACGATA GAAAAACAAT AAGTTATGAT 120 

ATGTTACATG TAACACCACC TATGGGTCCC TTAGATGTAG TAAAAGAAAG TACACTTTCA 180 

GATAGTGAGG GTTGGGTAGA TGTTAACCCA ACCACATTAC AGCATAAAAG CTACTCTAAT 24 0 

GTATTTGCAC TTGGTGATGC TTCAAATGTA CCTACTTCAA AAACAGGCGC ACTATTcGTA 3 00 

AGCAAGCACC TATCGTCGCT AATAATTTAT TGCAAGTGAT GAATAATCAA ATGTTAACGC 3 60 

ATCATTATGA TGGTTATACT TCATGCCCTA TTGTTACTGG ATATAATAGG TTAATACTTG 42 0 

CAGAGTTTGA TTATAATAAA AATACTAAAG AAACAATGCC GTTTAATCAG GCCAAAGAAC 4 80 

GTaGAAGTAT GTATATATTT AAGAAAGATT TATTACCTAA AATGTATTGG TACGGCATGC 54 0 

45 TAAAAGGATT AATATAATAA AGTACAGAAA ACAATAAATT TTTAATGAAA AATCTTTTAC 6 00 

TATAAAAGAT TAAGTATTTA AATGACGTGT CAGTGTTGTG TTTATATGTC GTGAATTTTT 66 0 

AG CTCT AAAT AGTATAAGAT TGAAAAAGTT GTTACTGTTT TAAATGATCA CGATGAAGTC 72 0 

50 ATTCAATAAG AATGATTATG AAAATAGAAA CAGCAGTAAG ATATTTTCTA ATTGAAAATC 7 80 
ATCTCACTGC TOTTTTTTAA AGGTTTATAC CTCATCCTCT AAATTATTTA AAAATAATTA 
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AGTAAAAGTA ACATTAAAAA TAGCAATATT 
AAGAGTCAAA TTAAACAATC CATATTTTTA 
5 ATGCCATCAA AAGTAGATGC AAAAGCTTCT 

GATAGTAAAC CTGAAGACAC TATACTAAAG 
AAAAATAAAA CATTTACTAA TGGtAAAGTT 

10 

ACGATTGATG GTAATATCAA AGTTGAATAA 
CATAATTTCT CTAATATCGG TGTTATTTAT 
TTGCCAGTAA AGTGAGTGAT ATTAAGGTCT 

15 

ATTTCTGGTG GCGCATTTTC TCCCAATGTA 
ATTGATGTAT GGATGGTGCC AGACCAGCTT 

20 GGATTATTGA ATAACTTAAA TGCTGTAGTC 

GTACGCGCCA GTCTTTCTTT AGATTCTTTT 
TCAAAATGAT TTGTACATAT ATTATCATAA 

25 ACAATTGCAT GGTTCAATGA TTTGTCCATC 

ATTTCTTTCA ATAATTGGAT TGCTTCTGTT 
CCAATCATAT AACATACAAA ACCATTTGCT 

30 

ATAGTTAATC CTGACTCATT CATACCATCC 
ATTTGAGCTA AACCGCTATC TGTAGGTTGA 
GGGTGGTAAT CATAATTTCT AACCATGAAG 

35 

TCTTTTAAAT CGGTAAAACG ATAATGTCCA 
TTGAGTATAC TTTGTAGTCC CATTAATTCT 
ATTTGATAAG TTTCATTTAC ATCTATATCG 

40 

TCTCGATTTT TTAGAAGAGG TGTTTGTTGA 
AAATGTGAAC CTCTAAAAGT CATGATATCT 
45 TTCTTTTTAG TTGTAATATA TTGTAAATAA 

TTAAATAAAG TTATATTTTA CTAAATGAAA 
TATAAGACTT ATCAATCAGT TAAAACATAT 

50 

AACTTAAAAT CATCTTATCA TAAATATCAA 
TGTTAACATT AGCATATAAG GTCATGTCAC 



AAAGTTGTTA TTGGTACGCT ACAAATCGAC 10 8 0 

AACGATCATG GTGACATTGA ATTTAAAAAC 114 0 

ACTAAACAAG GAGATATTCG TTTTAAGTAT 12 0 0 

CTAAATCCGG GAACGGGTGA TAGCGTAGTT 126 0 

GGGAAAAGCG ACAATGTTTT AGAATTTTAT 13 2 0 

ATAAAGGATG TAAGCACCGA TATTAGGAAG 13 8 0 

TTGTTGGCAA AAGTTAAGTC GGTATCTATA 144 0 

TGACCATCTA ACCATGATTT GAAATCTATT 1500 

AAATATGCAG TTAATGTTTC AGGTTGATAC 1560 

TTGAATAGTT TACTGTAAAT TTCATACTGA 1620 

ATATCTAAAT TATCATTAGT TTGTGAAATG 16 8 0 

GTATAATTAC GATTTTCATG TGTTAATATT 174 0 

CGAACATCTA TTGATCTCGG TGTCACTTCA 1800 

AGTATGTAGC TAAATGAGCT TCTGTGTGGT 186 0 

ACATTTCGGC AATTTTCAAG AATTAGACGA 192 0 

GGTTTCTTCC GGTGCATAAA GTTATAGCCC 198 0 

ATTCTTCCAG TTACCCTTGA TACAGGACCA 204 0 

TAAAGTAAGT AGCGACCATC ATAAGTTGCA 2100 

TCTTTGCCTT GAAAGACCGT GCAaCCACTT 2160 

AAGTTTAAAA TAATTTGGCG TGTTGGCATT 222 0 

TCCCATATTT GAGGTGCGTA TGTTTGGAAT 22 8 0 

AAACGTGGGA CaCnTTTTTT CCATTCTTTT 234 0 

AGCCATTTAC CAGTTTTAAC ACCTAACTCG 2400 

GATGTCACTT GTTGCATATC ATCGGCCCCT 24 60 

ATAGTAATCG TATGTATATT GAATGTCATG 252 0 

TATAAAATTG TTTGAGGTGA TTTCTCGGTG 2580 

TTTTATAGAT GGTGGGGATA TTGAGTTAAA 2 640 

TCTTAAGTTA GCATT CACGA TAATAGTCAT 2 700 

GTTGAAACAG AGGTTCCTCG GCATTTTTGA 2760 
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TATGGTAAAA CATTTACAAG AC CAT ATT CA ATTTTTAGAG CAGTTTATAA ATAACGTTAA 36 60 

CGCATTAACT GCAAAAATGT TGAAAGATTT ACAAAATGAA TATGAAATTT CATTAGAGCA 3 720 

5 GTCTAACGTA TTAGGTATGT TAAATAAAGA ACCTTTGACA ATTAGTGAAA TCACGCAAAG 3 7 80 

ACAAGGTGTA AATAAGGCCG CAGTAAGCCG ACGAATTAAA AAGTTAATCG ATGCTTAATT 3 84 0 

AGTTAAGTTA GATAAACCAA ATTTAAATAT TGATCAACGT TTGAAATTCA TAACCTTAAC 3 900 

w 

TGACAAAGGT AgAGCATATT TGAAAGAACG TAATGCGATT ATGACAGATA TTGCGCAAGA 396 0 

TATTACTAAT GATTTA 3 97 6 
(2) INFORMATION FOR SEQ ID NO: 146: 

15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 346 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
20 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146: 

25 GCTACCTAGG CATTTAAGAG ATCAAAAAAT GTATGAATAT GAACGTTATT TTTATGAGCA 6 0 

AGAACTTAAT GGCGTTGATG aAGGGGAAAT TTTAAAGAAG TTAAAAGACC CACAAGATGT 12 0 

TGCAGCTGAA ACAAAAGCTA GAAGTGTTAT TGATTATGCT GAATCTAAAC CAACATTTGA 130 

30 

AAATATTTCA AGAGCTGTTG CTGCTTCATT AAGTTTAGGC ATTCTATCTA TTTTTGTCAT 24 0 

CCTTATACCA GTATCTATAG TTGGATTATT TGTATTAGCA TTATTTTTAA TATCACTTTT 300 

GCTGCTGTTT TGTCCAATTA TTTTATTAGC ATCAGCAATA TCCAGAGGAA TTGTGGACTC 360 

35 

AATTAGTAAT GTATTTTTTG C CAT AT CAT A TTCAGGATTA GGATTAGTAT TTATCATTGT 4 20 

CATATTTAAG ATTTTAGAAT ACATTTATCG TTTAATCTTA AAATATTTAC TTTGGTATAT 4 80 

TAAAACTGTC AAAGGAAGCG TTAGAAAATG AAGAAATTCT TTTTTATTGG GCTTTTAGTG 54 0 

40 

TTTGTTGTCT TTTTTACAGC AGCAACCATT ATTTGGTTCA GCTATGATAA AAACAAATAT 6 00 

GGTACTAAAC AATATGATAA AACATTCAAA gACGATGCTT TTGACAATGT ATCTATAAAT 66 0 

45 TTGGATAGTA CAGAACTTCG TATAAAACGG GGGAATCAAT TTAGAGTTAA ATATGATGGT 72 0 

GACAATGATA TATTAATTAA TATAGTAGAT AAGACGTTGA AGATTAGTGA TAAAAGGTCT 780 

AAGACAAGAG GATATGCAAT TGATATGAAT CCTTTTCATG AGAATAAGAA AACGTTAACG 84 0 

*° ATTGAAATGC CTGATAAAAT GATTAAACGT TTAAATCTAT CATCTGGAGC AGGAAGTGTT 90 0 
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CACTATCATA ACATGCATCA GCTACAATAT ACTCCGGTAA ATAACCGAAG nTATTTTgAA i860 

TCATTGTTAA AAATGGAATT AAAGTTCTAG TATCTGTTGG GTTTTGAAAT AGGTCATAGG 192 0 

ATAAAACAAA TTGAGAATTT GTCGCTATTT GTAAATTGTA TCCTGGCTTA AGTTGGCCAA 198 0 

AGTGTCTTAT TTTTTTAAAG TATTTAAAAG TAAAATTACA TGTTAATACG TAGTATTAAT 204 0 

GG CG AG ACT C CTGAGGGAGC AGTGCCAGTC GAAGaCAGGG GCCCCAACAC AGAArcTGAC 210 0 

ATATAGTCAG CTTACAACAA TGTGCCGGTT GGGGTGGCTG AGACGGCACC CTAGGAAGGG 216 0 

ACCCGTCATC AAAAATTCTA TTTATAGAAT TTTACAGTAA TGTGCCAGAT GGGCATAGCG 222 0 

AAgcCATTCA ATACGAAGTA TTGTATAAAT AGAGAACAGC AGTAAGATAT TTTCTAATTG 2280 

AAAATTATTT TACTGCTGTT TTTTTTAGGG ATTAATGTCC CAGACTCTTT AGTTTATTTA 234 0 

TTTTCAATAT AACAATTGTC TAATCAAGGA TTAACGAATA TTTAAAGATA GTTTGACGCA 240 0 

ATATTAGAAA CAACCTATAA TAATAGTTTG TTTGTGGATT AACTATTATA AATAAAAGCG 246 0 

GCGTAAAGAC ATATAAACCA ACTACTTGAA CAATATAACG TTAATAACAA TCTATACTGA 252 0 

TACATTACGC CTAGATAATC TTTGATGAGC ACATGTAAGA AAAAGTGATA TGGTGTATGA 258 0 

25 CTTCCGACAC CATCGATAGA TAAACCTAAT TTTTGGGCTA GTCGTAAGGC GCGCAATACA 264 0 

TGAAACTGAC TTGTtACACA AACAATTTTA ACTGCTTCAT GATACAAATT GTTGATGATT 2 70 0 

TGTTTAGAAT ATAAAAAGTT TGTGTATGTA TTTATAGAGT GAGATTCCAT TAGTATATCT 27 6 0 

GTTTTATCAA CACCATGTGC AATCAAATAA CGTTGCATAG CTAAAGCTTC AGAAATTGGT 2 82 0 

TCGTCTGGTC CTTGTCCGCC AGATACAATG ATCTTTGTTG CTGATGCTTG TTGTTGATAG 2880 

ATATCAAGTG CACGATCTAA ACGCGCTGCA AGCATTGGTG TGACAAATTC GGTAAAAATA 294 0 

CCAGCACCTA ACACAATTAT GATATCAACT TCTTTGTTGT ATGATCTATG TCTATATGAT 3000 

ACTGTCCAAA CGAGATAACA AATAAAGGTT AGTAACAGGG AAAGACATAA TATAGCTAAC 3060 

CACATAGACA AACCTTTCAC AATAGGTGAC TGAATCGTAC TTATAAATAG AAGTGCTGAT 3120 

GTGTAGAGTA CAAATTTATA TGAAAAAGAT AATAATTTTT TAATAAATAA GCGACTAGAA 3180 

GTATGAGAAA ATAAATATCT ATGTTTGAAT AGCATGATAA TACTGATTAT TATAAATGTT 324 0 

45 ACAAACATAG ACCAAGGGAA AGTATAGGTC ATGATGCTAT AGATGAGTGA CAAAAATATC 3300 

GATATGACAA CTAAGATGTA GCATGTTAAA TTTAACGTCA GAGTATAGTT GAAAATTAAC 3360 

GGACAAATAA CGATAAGTAT AAATATTAAT AATAAATTCA ATAACATACT GACACCTCGC 342 0 

50 TTATAATAAA TATTAAATAT AAATGTAGAT GATTTAATTT ATTAAAGCAA GGAGAAAGCA 34 8 0 

GCAACATGTA AATCTTAATT TGTTATATTA TATATGGGTC AATATTTTTG TGTTTTTTAG 354 0 
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AGGTGATTAT CCTAAAAATG CTCATGAGGT CGCTATTAAT GATAAGTTAG CTGCAGACAA 6 0 

CATTAGAGTC GGGGATAGAT TACATTTTAA AAATAATTCA ACTAGTTATA GAGTTTCTGG 12 0 

5 TATTTTAAAC GACACAATGT ATGCGCATAG TTCCATTGTG CTATTGAACG ATAACGGATT 180 

TAATGCATTG AATAAGGTTA ATACGGCATT TTATCCAGTG AAAAATTTAA CACAACAACA 24 0 

ACGTGATGAG CTTAATAAAA TAAATGACGT TCAAGTTGTG AGTGAAAAAG ATTTAACAGG 3 00 

w 

TAATATTGCG AGTTATCAAG CAGAGCAAGC ACCGTTAAAT ATGATGATTG TTAGTTTGTT 360 

TGCTATTACA GCAATCGTTC TAAGTGCATT TTTCTATGTT ATGACGATTC AAAAAATATC 420 

ACAAATTGGC ATTTTGAAAG CAATTGGTAT TAAGACAAOA CATTTATTGA GTGCGTTAGT 480 

15 

TTTACAAATT TTAACACTAA CAATAATTGG GGTAGGTATT GCTGTGATCA TCATAGTAGG 54 0 

ACTATCATTT ATGATGCCGG TAACGATGCC TTTTTACTTA ACAACGCAAA ATATTTTATT 6 00 

P0 AATGGTGGGG ATATTTATAT TAGTAGCGAT TTTAGGTGCC TCACTATCAT TTATCAAATT 660 

ATTTAAAGTG GATCCTATCG AAGCAATTGG AGGTGCAGAA TAATGGCATT AGTCGTTGAA 72 0 

GATATCGTCA AAAATTTCGG AGAAGGTTTG TCTGAAACAA AAGTTTTAAA AGGTATTAAT 7B0 

25 TTTGAAGTGG AACAAGGGGA ATTTGTCATT TTAAATGGTG CCTCTGGTTC TGGGAAAACA 84 0 

ACATTGCTAA CGATATTAGG CGGATTGTTA AGTCAAACGA GTGGTACAGT GCTTTACAAT 900 

GATGCGCCAT TGTTTGATAA ACAGCATCGT CCTAGTGATT TACGATTGGA AGATATTGGT 96 0 

30 

TTTATTTTTC AATCTTCACA TTTAGTTCCT TATTTAAAAG TGATAGAGCA ATTGACACTC 102 0 

GTAGGTCAAG AAGCGGGAAT GACCAAACAA CAAAGTTCAA CAAGAGCAAT ACAACTTTTG 108 0 

AAAAATATTG GTTTAGAAGA TCGCTTGAAT GTATATCCGC ATCAGTTATC TGGCGGTGAA 114 0 

35 

AAGCAACGTG TTGCGATTAT G AGAG CATTT ATGAATAATC CGAAAATCAT TTTAGCAGAT 12 0 0 

GAGCCCACAG CAAGTTTAGA TGCCGATAGA GCAACAAAAG TTGTTGAGAT GATACGTCAA 126 0 

CAAATTAAAG AACAACAAAT GATTGGTATT ATGATTACAC ACGATCGAAG ATTATTTGAA 13 2 0 

40 

TATGCAGATC GAGTGATTGA ATTAGAAGAT GGCAAAATAA CTGATTAGTG GCTTGTAAAG 13 8 0 

ACGCTAAATG TTAATGATTT AAGACATAGT AGTATAAAAG TTAGATAACA GAATACGATT 14 4 0 

45 TGGGTTTACA AAAAACAGGC TGGGACATTA AGTTCTTAGG CAATGTAAAA AAGCTGATTT 150 0 

CTATTAATTA TTTGATAGAA ATCAGCTTTT TTGATATGTA TTTTATAATG TACAGCTCGT 156 0 

TGCATTCATA TAGCTTGAAG TCACGTTTAA AACCATATCT ATCATTATGG TATGCATATC 162 0 

o0 TTTTAAAACC TATTCTTTTG TTATTAGGAC ATATAAATTC ATCATTAAGT TCGTCATATT 168 0 
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GTGTGGATTG GATTTTAAAA TCACCCTCAT AAATACTGTC ATCAATATGA TAAGTTACAA 12 0 

TTTCACCTAT TATTAAATCA GCCCCATCTA ATACATCTCC AAGCAATATC ATTTGCGmTA 18 0 

GTTTACATTC GAATCTCATT TTCGCATCTT TAATTCCTGG CGTCTTAATC GTTGTAGATG 240 

TTAAAAGTGA TAATTCTGTA CGACTCAACT CACTGTCACC ATATGCTAAC GGCGCTGCAG 3 00 

TCTCATTAAT ATCTTGAACA TTATCTTCGT CTGTAATATG CACAACAAAG TCTCCAGTCC 360 

GTTCTATATT TAATGCAGTA TCTTTTCTCT TACCTCCTGC ACGTTGAACT GCAATAGCAA 42 0 

TCATTGG CGG ATGATTATTA ACAATATTAA AAAAGCTAAA TGGTGCTGCA TTTACTGATG 4 80 

CATCTTGATT TAATGTTGTA ACAAAAGCTA TAGGTCGTGG AATAATTGAA CCAATTAATA 54 0 

ATTTATAGTT TTCTCTAGCA GTTAATGATT GTGCATCAAA CGTATACATA ATAC CTACCT 6 00 

CTTTTCTAAG TATATCTAGG TATTTCTCCG ATTTTGGTTA ATTTAAACAT CTATTCTCCT 660 

20 CTGAAAATCA CTTGTATTTA TTTAGCAAAT CTTTTGAAAT ATGACACATA TGCATATCTT 72 0 

CTGGATATTT TTCTAAATGT TGCTGATGTT CTTCAGCACT TTTAATGTAG TTAGACAGCG 780 

GTAAGACTTC CACTGCAATT TGATCTCTGT CTTTACGTCG TTCAATGAAC TGACGCGCTT 840 

CAATTAAGTG GTCATCTACA CAACTATATA AACCCGTTCG ATACTTTTGT CCAATATCAT 900 

TTCCTTGTTG ATTCACACTG TAAGGATCAA TGATTTCAAA TAAATAATTC ATAATGTCTG 960 

TAATTGTTAA CATACGATCA TCGAAATGAA GTTTGACACA TTCAGCATAA CCATCATACG 102 0 

GACCGTCTAA TTTAGAGCTT CTTCCATTTG CTCTTCCTGC TTCTGTATGT ATAATTCCAG 1080 

GTATTGTTGC AAAAAATGCT TCAACACCCC ATAAACATCC TCCTGCTACA TAAACAACTG 114 0 

CCATATTTAC ACCTCATCAT CCTTTTTTAT ATTTTTAACA AGGTTATACC ATTTAATACC 120 0 

GCCATGACAT GATTCTGATA CACCTTCATT ACGATACCCA TATTTTTCAT AAAATGAAAT 126 0 

TAATGATTCT CGACATGTTA ACGTTACACC ATGTCGATGA TGATTCTTAG CAAGAGTTTC 1320 

AAAATAGTTT AGTAAGCGAC CTGCAATACC CTGACCTTGA TAATTTGGTG CTACAACAAG 13 80 

ACCTAACACA CTAATATAGC CACCTTCACT ATTATTTGTG GAGACATTTT TAAATAAATC 144 0 

ATCGCTAATG TAACGCTCTT TTATGACTGG ACCGTTG 14 77 
4$ (2) INFORMATION FOR SEQ ID NO: 14 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 976 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 
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15 



AACCTTGTTG CATAAGCCAA TTTAAAGTTT GTGGCACAAG CGAAATGTGC GTGATTCGTT 900 0 

CATTTTTAAT CATCGTTAAA ATTTGTTCGG CATTGAATTT ATCAACAATG CGCACAGTAA 906 0 

AACCTTCAAT AACAGCTCTT AAAAGTACAC TGAGACCCGA AATATGATAA ATCGGCAAGA 9120 

CAGATAGCCA ATTAGTGTCA CGATCAAATC CCAAGCTCTC TTTACAT CCG ATTGCACTGG 918 0 

CATAATGATT ACGAAACGTT TGTGGCACCG CTTTTTGAGG GCCCGTTGTC CCTGATGTAA 924 0 

ACATAATCGA TGCAATGTCA TCTAAATTAA ATGATGTATT TAATATGTTG GACGGCGACT 9300 

CTTTCGGCAC CACAGTTTCA TTCGATGTTT CATATTGGAT AC C CATTGTG TTGTCCAACA 93 60 

AACTGTTCGT TGTAATATCC CTTCCAGCGA ATTCAATATC ATCCAGCGAT ACAATTTGAA 942 0 

ACCCTCGTAA TTCCAGTGGC AAGGTACAAA AAATCAATTG TACATCGATT GACTTCATCT 94 BO 

GATTCGTCAT CTCATTAGGT GTCAACCTTG TATTAATCAT CGCAATTTCA ATATTTGCCA 954 0 

20 ACCAACATGC ATGTATTAAA ATGATCGATT GAATCGAATT ATCTATGTAT AGCCCAACAC 9600 

GAGATTGTTG ATAAGC CTTG AGTCTTTTAG CCAATAGACT CGCTTCACAG TATAAATTTT 9660 

GATAAGTATA AGATTCTTGA CCGTCTGTTA TCGCAATATG ATGTCCATTT TGTTGTGCTT 972 0 

25 GTTTATATAA CCAAAAGTCC ATGCGTTATT CCTCCAAAAT CATTTACATT ATAATTATAA 978 0 

CGATTTTATG ACATTCTAGC AGTGGTTATG TTTAAAAATA TAAAAAAGTA GACGAATTGA 984 0 

TGCATTGATA TGATTGTTAT AATGCTCAAT ACATATCGTT ATATCATTCG TCTACTATTA 99 00 

TCAGTTATTT TTATTTAATT TTAGTGTCAT TCTGTCATTT TGATGTGGTG ATTTACCCAT 9960 

TGTTGCCACA TCATCTGCAA TGTCAATTGG TATACGGTTC ATGTCTTGTA ATGCACTTAA 10020 

ATGGAATACT TCATCATCTA AATTTTCAAT GAGATATACA TAATATGTTA CCTTGTCCTT 10080 

TTTATATTTT AACGTTTTCC AAAAGTC CGG CTTGCAATTC AATACATTAT CCGGAATATA 1014 0 

TTCAATAAAT AAGTAACGTT TGCTGCCTAC TTTGTCTATG AAATATTTTG CAGTGCCTTT 10200 

TTCTATACCT CTTATATGTG CATAGTCTGC TGAAAAGTAA ATACTACCTA TTGTTTCATT 10260 

ATGTTGTTGT ATTTCAAATC GTTGGCCTAC TATTTTATTA TTTGTGCTAC nGGGGACTTA 10320 
(2) INFORMATION FOR SEQ ID NO: 144: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1477 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 
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PPTTTATTTA 

V— L X X X n XXX 


ggtatppgtp 


PT Zi TTTT Zi A ZV 
O 1 AX 1 X Innn 


ATGTPATPPG 


PTTCATAAAG 


GCGGCTTTGA 


7200 




PPPTGTTPPG 


TTAJVAAAAAG 

X XnAAAAAAw 


APAAGT P AGP 

A\_AA^7 X LAVjL 


AAGPAAGPAT 


TCACATAAAC 


AT AAC CA TTA 


7260 


5 


ATATGGTTGT 

i \ x n x x x \j x 


AATTGAGTTA 

X 1 O.AO X X A 


TAT PP APT AA 

X A X v. L/\v> x nr\ 


AGGGGGGPGA 


AATTCGAGTC 


GCCCCTCTTT 


7320 




T A A T ATG P PT 


gaatgpgppa 


PPAPATPTTG 
A\— A x >— x x\j 


TTPAAAATAA 


TAACCTGCTG 


GTGTAACATC 


7380 




tpptggata zv 


TPZVPPTTTZlP 


p a p pAAG PAT 


PGCTGTAAAA 


TAGCGGCTTA 


AACCATATTC 


7440 


10 


o 1 AX U\_L~vj 


PPAATAAPPA 
C V-AA 1 nnvv A 


PTTTTGPAPP 

L X X X X wLTVw V— 


ATfiAPTTTTP 


AAAGTATCAA 


TTGCCGTTTG 


7500 




P B / ir j'wi iii » rp t\ 
lllni l > — r\ 


atgppappta 


PAPGAAATGG 


TTTT A AT AP A 

X X X X AA X *V*~n 


APAAPTTTPA 


CATTGTATAA 


7560 


15 


1 itlnl LAAA 


TT A A ' i " i ■ A TP.T 


P Pa AP AAPG A 


TGTTGPPTTT 

XwX X w^-^ X X X 


TCATCAAGGG 

X Wll X W**fiWW w 


CTATTGGAGG 


7620 


T A TT^P'I ** i *p p A 
1 Ai 1L» 1 1 l-l-A 


TPPGPTZi fPT 


PATPAAGPAT 

LA X LAft\J\*A X 


GG AG AT AT PT 

VMAVM X A X X 


TTAAATGGCT 


CTTCGATATA 


7680 




AALiAaL-L 1 vj 1 


T P zi pgpg CT A 

1 V_aLv7V v LtL. 1 A 


ATAAPTGTAA 


PTGTGTGAAA 

W X w X >w X w/VW 


TCTTGACGAT 

x x x wi>wNjr> x 


CCAAGGACTC 


7740 


20 


All ILaL-Al L 1 




GAAAGTGAAA 


GTPT A ATT P P 
o x v» x x x \— v- 


CGTAACACTC 


TAATTTGATG 


7800 




UAlbAl 1 IbA 


pppptpp ZiTT 


TTA ATTTA AT 
X lnnl X xaax 


TPTGGTPGGC 


TTTGTTGCTT 

XXX X X Ww X 1 


TTAATGACTC 


7860 




IALjI 1L?1 1 in 


TTTH Zi T Zk Zi CZ P 
1 1 IVjAIAAoL 


PGPTPGpTGT 

X VvVJL X w X 


PGPTCPATAT 

v.wL X w w\ X A X 


GCTACTGAAA 


ATGAAGGCAG 


7920 


25 


TAPTi TP A A A P 

1 ALA I Lr AAAL 


A 1 1 1 on 1 ALn 


ATf^PPATGZiP 


AATAGTTGPP 


PTTGCAGPAG 


GCGTATTTTC 


7980 




P A A TP A A TPT 


A PT A A ' l ■* l ~ l " i ■ A. 
AL 1 AA 1 1 1 1 A 


^TP-PTPiPTT P 
VJ X vT\> X X X L 


ATAPGTTTPA 

AXALwX X X LA 


AATGATTTAT 

X ^7^2 XXX X 


TT CTATTATC 


8040 




TTPP A A PP A T 


TPP^PP A, A ^P^P A 

luL 1 Lnnl 1 A 


PATGTTTPAP 

W\ X *s7 X X iwiv 


TGAGGPAATT 

XwAWNJLAAX X 


GTTTCATGAT 

XXX X X 


CATACCAATC 


8100 


30 


TrT'l*TT A A A A 


cz rn TT A P Zi TT 


PP PPG AAATA 

\_ L L V- U AAA. X r\ 


TGPJVTTTCCT 


TTGTCATCAA 


TCAATTCGAT 


8160 






X LnLun 1 vjllt 


TTAAAGTGAP 


TTTCGGTGTT 


ACAATTTGTG 


ACTTAAATGG 


8220 




pf^n PT A TAT 

Lx LAL 1A1 Al 


TT A T Zi Zi Zi Zi Zi T 
1 1A1AAAAA1 


GPAAAGPTGT 


PAAPTTPATP 

UAftU X X WA X w 


AAATCATCCT 


CTATACAACT 


8280 


35 


1A1 ilLillu 


TAATTTAPPT 

X AA 1 X X nL v_ X 


GTTGATGTAT 


AAGGTAAAGT 


AT CAAC CTTT 


TCAAAGTGTT 


8340 




T PP/ i^f" A r »■ i " i " i • 
1 AL 111 


A T A TTTPGPT 
Mini X X v-vJv. x 


AAATGTTGTG 


ATAAATATGC 


AATCAATTGT 


GCCTTTGAAA 


8400 


40 


1 Vj 1 LnL 11 


APTGAPAAAA 


TATAATTTAG 


GCACTTGGCC 


CCAAGTATCA 


TCAGGATGCC 


8460 


a p n p a t a p 


TG PGT P APTG 
x \j vj x lav x vj 


ATAPCTGGAA 

A X Avv X ^VJ*V* 


ATTGctTCGC 


TACCGTTT CA 


ATTTGATATG 


8520 




P ATZi A ZiTATT 
VjAlAAAlAl 1 


TT p a p pp. r 1 p zi 


PT A AT A ATT A 


AATCTTTACG 


TCGGTCATAA 


ATCATGACAT 


8580 


45 


AAL Li 1 LA 1 \j 


Zi T r*T Zi TTTP Zi 
nltlnl X 1 LA 


GP A ATGTPAP 


P CGTATTAAA 


AT AAC CATTT 


TCAAACGTAC 


8640 




r^r^r* ' 1 w ! 1 A A ATP 
CCU 1 1 AAA 1 L 


TP TT CZCZ Zi T Zi P 
lull ubn 1 Av, 


AAATATAPAT 

AAA X A X AV»rt X 


T PAT CACATT 


GGCGCCTTTA 


AT CATT AATT 


8700 




CTCCATGACC 


TTCTTTATTA 


GGATTTTTAA 


TTTTTACGTC 


AACATTGGCA 


CTTGGCATCC 


8760 


50 


CTACAGTGTC 


AGGACGTGCA 


TGCAACATTT 


CCGGTGTTGC 


TGTTAAAAAT 


TGCGAACATG 


8820 




TCTCAGTCAT 


ACCAAATGAA 


TTATAAATTG 


GCAGGTTATA 


TTGTAATGCC 


GTCTCTATCA 


8880 
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AAATAGCATC TCCTCGTGTT GATTATTTTG GTTGGCTGAC CAATATTTAT TCTAGCACGT 54 0 0 

AGAGATGCAT TTTTTGTGAC AATGGTAGAA CCTTTTCtGa ACCATACGCA TAGCGTATGG 54 6 0 

TTTTCTTTTT ACAATTAAAG AGCCAACCGT TGTTATAGTC TAACAATGGT TGGCTCCTCT 552 0 

TATTTTATGT GCTAAAAATT TATAGGCAAT TTTATTACAA CAATGTACAT TTAAGGTGAC 5 53 0 

CTTCATGCCA AAATCGCATC ACTCATTTAA TGGAAGCAGC ACGTCTTCAT ATAAAGTACC 564 0 

GATCCCTAAT TCAACGCATG TAGTACCACA TCTTCAAAGC TTGATAGTTC CCATGCGCAC 57 00 

ACCACGTTTC ATACTAGCTA TGCGACTCAA CTTGGTTCAT AAACTCTTTA ATATAAGTCA 576 0 

ATGTTTCAAC CATCGCTGGT GGTCTTGGCA CATGTCCTTC TGCCATTTGA TAAAATGTTT 5 82 0 

CATGCGTGGC ACCTTTTAAC TCTAGTTGGT CCGCTAAATA ATACGCATGA TGAATACCAA 5 88 0 

CTTGCTGGTC TTTCCCTCCA TGTACAATTA ATATTGGCGG ACTGTTTTCA TTAATGTTTG 5 94 0 

20 GAATCGCTTG GCGTGCCTCA TATGCCGCTC GATCTTTTTT CGGATGACCA ATCATTCTTC 6 00 0 

GTAGCATGCC TCTTAAATCG ACACGTTCTT CATACATTAA ATCAATATCT GAGACACCAC 6 06 0 

CCCAGATTGT ATAACTTGTT ACTGGTAAGT CTTGAAATGT CAACAATCCT TGTAAACCAC 612 0 

25 CTCGCGAAAA ACCAACCATG TGGATAAATG CATGTGGATA TTTATCATGT AGCAACCTTA 618 0 

ATAATTG CGT CACATCATTT AAATCGCCAC GGTAAAATTC GTCTTTGCCT TCACTCCCAT 6 24 0 

TGTTACCTCG GTAGTATGGC CCAATCACTA AAGTTTGACT ATCTGAAAAT TGCATTAATC 63 00 

TACCTGCGCG CACACGTCCT ACTTGACCTT TGCCACCTCG CAAATAAACT ACAATGCGAT 63 6 0 

TTACTTCATG ATGTGGTGTC AT CATT AAAG CTTTTACTTG TAAGTCATCT GACAAATATG 64 2 0 

TAATTTCTTC GAATTGATGC GTAAAATATT CAATTGGCAT TCGTTTACGT TTGATAAAAC 64 80 

CCAAGTGATT GCACCCTCTC TACGCATTTT AAAATGGTAC TATCTTGCAG TAAGAAACTC 654 0 

CGTTGTGCGA GTTCAATATC ATTGATACAG TTAAACAACA CTGGCCCTGC TGTTTCTAAA 6 6 00 

TAATCGTTCT TGCTTACCAA TGATTCAACT TCGATAAAAT ATACATCTTT TACAAAATCA 6 66 0 

GTTTGATCAT GTGTTTCAAT GGTATATTGT GCTATGTAAT AAATATTTTT AACTTTGGCG 6 72 0 

CCTGTTTCTT CATATAATTC aCGTGTAACT GCTTCAGCAC TACTTTCCCC GCGTTCCCTT 673 0 

TTACCACCAG GAAATTCAAT CCCCCGTAAA TTATGTTTGG TAAAAAGCAA TTGATTTTTA 6 84 0 

AACGTTGGAA TAGCTAGCAC ATGATTGCCA TCTGCTATCT CATTATCCTT TTTAAATGTC 6 90 0 

AAATTAACTT GACGATTATC TTTATCCCTA AACTTCACGC GCATCACATC CCTACATTGT 6 96 0 

ATGTTAATAT AATAGTTAAT TACTATCGTT GGAGGCATTA ATTATGAAAA AGATATTCTT 702 0 



30 



35 



40 



50 



10 



75 



20 



EP0 786 519 A2 

ATGTTTATTA GACCTGAATC AAAAGAAGAA GCTACAAAGA TTAAACCTAA CTTCACTATC 3 600 

GTTTCTGCAC CACATTTTAA AGCAGATCCA GAAGTTGATG GTACTAAATC TGAAACCTTT 3 660 

GTCATTATTT CATTTAAACA CAAAGTCATT TTAATC GGCG GTACTGAATA CGCTGGTGAA 3 720 

ATGAAAAAAG GTATCTTCTC TGTAATGAAT TATCTCTTAC CGATGCAAGA TATTATGAGC 3 780 

ATGCATTGCT CAGCAAACGT TGGTGAAAAA GGCGATGTTG CATTATTCTT TGGTCTATCT 3 84 0 

GGCACTGGTA AAACAACCTT ATCGGCTGAC CCACACCGTA AACTAATCGG TGATGATGAA 3900 

CACGGCTGGA ATAAAAACGG GGTCTTTAAT ATCGAAGGTG GCTGCTATGC AAAAGCAATT 3 960 

AATCTTTCCA AAGAAAAAGA ACCACAGATT TTTGACGCAA T CAAATATGG TGCAATTTTA 4 020 

GAGAACACTG TAGTTGCAGA AGATGGTTCA GTGGACTTTG AAGACAATCG TTATACAGAA 4 080 

AACACGCGTG CCGCTTATCC AATTAATCAC ATTGACAATA TTGTAGTACC ATCTAAAGCA 414 0 

GCACATCCAA ATACAATTAT TTTCTTAACT GCGGATGCAT TTGGTGTTAT TCCACCGATT 4200 

TCAAAGTTAA ATAAAGACCA AGCAATGTAT CATTTCTTGA GTGGTTTCAC TTCTAAATTA 4260 

GCTGGTACAa GCGTGGTGTG ACAGAACCTG AACCATCATT CTCAACATGT TTCGGAGCAC 4320 

25 CGTTCTTCCC GTTACACCCT ACTGTTTACG CTGATCTATT AGGTGAACTT ATCGATTTAC 4 380 

ATGATGTTGA TGTTTATCTT GTTAATACTG GATGGACTGG CGGAAAATAT GGTGTAGGAC 444 0 

GTAGAATCAG C IT AC ATT AC ACACGTCAAA TGGTAAACCA AGCGATTTCT GGCAAATTGA 4 50 0 

30 AAAATGCAGA ATATACAAAA GATAGTACGT TTGGTTTAAG CATTCCTGTA GAAATTGAAG 4 560 

ATGTACCGAA AACAATTTTA AATCCAATTA ATGCTTGGAG CGACAAAGAG AAATATAAAG 4 62 0 

CACAAGCAGA AGATTTAATT CAACGTTTTG AAAAGAACTT CGAAAAATTT GGTGAAAAAG 4680 

TTGAACATAT TGCTGAAAAA GGTAGCTTCA ACAAATAAAT TTGAATACTA AATCaAAACC 474 0 

ACCCGTGTGA ACGGGTGGTT TGTTCTGCGG CTATAAGCCT TCCTTACTGG CCAGCCCTAA 4 800 

AAGGGCACTG ACAAGTCAGC CAACTGCACT ACTATTCCAG CAACCCTAAA GGGTTACTCT 4860 

TTTTTCTTTC TTTTTTTATT TTTCTCTCCA GTGAAAGGAT CTAAATATTC TTCCATTGAG 4 92 0 

ATTTGGTCTG CAACGATATC CTCTTGTAAT TGATTACGAA TATAATTTTC AATCACTTTT 4 980 

TTATTTCTAC CTACTGTATC CACATAAAAT CCTTTACACC AAAACTTTCT ATTTCCATAT 504 0 

CTATACTTTA AGTTAGCATG TCTATCAAAT ATCATTAAAC TACTTTTTCC TTTTAAATAG 5100 

CCAACAAATG ATGATACCCC AAGTTTGGGT GGTATACTAA CTAACATATG GATATGATCT 5160 

50 TTACATGCCT CTGCTTCAAT TATCTCTACA CCTTTTCTTT CACATAATTG ACGCAATATA 5220 

ATCCCTATAT CTTTTTTTAT TTTTCCATAT ATCACTTGTC TTCTGTATTT AGGTGCAAAG 52 80 
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CAGAAACtTT 


ACCTGTTCCA 


AATGTATCAA 


TTGCAATTGA 


CACTGGTTCT 


1800 


p. r* a a purr* a iv 

Vj^wAMV— Hv_v_AA 


*T^f O f A *^ Ik 

TCGCATATGC 


CAATTGTACT 


TCACATTGAT 


CTGCTAAACC 


TGCTGCAACA 


1860 


A T A TTT'PT 21 P 


c*c A r** A T A A of 

i~ LALA 1 AACG 


TGCAGCGTAT 


GCAGCTGAAC 


GGTCTACTTT 


TGTAGGATCC 


1920 


T^APfAPTflA 




ACCA TGACGT 


GCATAGCCAC 


CGTACGTATC 


AACAATGATT 


1980 


i i 1 V_ L_ 1 vj 


1 1AAILC- 1GC 


ATCACCTTGA 


GGTCCACCGA 


TTACAAAGCG 


TCCTGTAGGA 


2040 


1 1 OA i I .tvjA 


A I 1 I ALj 1 1 I Vj 


TT CATTAATC 


AAGTTTTCTG 


GAACAGTTGG 


ATAAATGACA 


2100 


HjIULI 1 ±AA 


1 G 1 TTuliXi 


AATTTGTTCA 


AGTGTCACAT 


CCTCAGCATG 


TTGTGTTGAT 


2160 


ALtiALAA I Lb 


TAT CAATACG 


TACTGGGTTA 


TCATTTTCAT 


CATATTCAAC 


AGTG AC CTG A 


2220 


ACTTTAC CGT 


CTGGTCGTAA 


ATAATTTAAC 


GTACCATCTT 


TACGCACATC 


TGATAAACGT 


2280 


TTTGCCAATT 


GATGTGATAA 


ATAAATTGCT 


AGAGGCATAT 


ACGTCTCTGT 


TTCATTCGTT 


2340 


f /~v"""P a a o a a 

GCGTAAC CAA 


Ik /"Ilk HWtH> Ik % 

ACATTAAACC 


TTGGTCACCT 


GCACCTGTTG 


CTTCAATTTC 


TTCTTCGCTA 


2400 


TCTTTATCAC 


/■*» ik m ik ik ■« 

GATACTCTAA 


TGCTTTATCC 


ACGCCTTGTG 


CAATGTCAGG 


TGATTGTTCA 


2460 


T CAAT CG CAG 


ttaaaattgc 


CATTGTTTCA 


TAATCATAAC 


CATATTTTGC 


TCTTGTGTAT 


2520 


L CAATTT CI 1 


TAA 1 i\j 1 1 1 C 


TCTAACAACT 


TTCGGAATAT 


CAACATATGT 


TGTTGTAGAA 


2580 


ATTTCGCCGG 


CGATCAATGC 


CATACCTGTT 


GTAACAGTTG 


TTtCACAAGC 


TACACGTGCA 


2640 


TTTGGATCGT 


CTTTTAAAAT 


AGCATCTAAT 


ATTGCATCTG 


ACACTTGGTC 


AGCGATTTTA 


2700 


1 L IXKjGTGTC 


CTTCTGTAAC 


AGACTCTGAA 


GTAAATAATC 


GTTTGTTATT 


TAACATAGTT 


2760 


TGCTCCTTTA 


AATTTATATT 


ACGAAAATTC 


TCTCTCTGTG 


AGCTAAATAA 


AAAAGACCTT 


2820 


f?*A A PTTN ■ 1 "1 'A 

L I AAL T ATT A 


A T*T> A f ft f )k ^ 

ATATAGAGAG 


AAGGCCTAAT 


ACGTCCATTC 


GCTCTTATCG 


TT CAG AC CT A 


2880 


TTTGTCTGCA 


AAcGGTTTGG 


CACCTTTCTT 


TTATAAAAAA 


GAGGTTGCTG 


GGTTTCATTG 


2940 


GG1 CCATGTC 


CCTCCACCAC 


TCAGGATAAG 


AGAATCCGTT 


AAAAATAATA 


GTACCTAATT 


3000 


A A TV* A TV T""P A A 

AAIXjAAI I aa 


TGTCAATTTT 


ftir» Jk ^ ik ik fk m ik ik 

TCACAAATAA 


ATTTACAGTA 


AAATATTGTA 


GATTAATTAT 


3060 


ij 1 1 AA I I Vj 1 


T* A 1* A t *'T' A A T^T^ 

I AT A CT AATT 


AAA T*/*"' r T> ^ Ik Tk /~i 

AAATGTAAAG 


GCTTACATTT 


AAATTATCGC 


TTTGGAGGGA 


312 0 


I i IALjVjAIXjI 


f A fT^ A Z" 1 A /"""A O 

LAGTAGACAC 


A '"P A O A /"^TV"' A A 

ATACACTGAA 


AfA A /^T A A A A 

ACAACTAAAA 


TTGACaaATT 


ACTGaaAAAA 


318 0 




ATTTTCAACT 


TTCGACGACA 


CAACTTTATA 


ik rr> n ik ik ik fw% / wnii 

ATAAaATCTT 


AGACAATAAC 


3240 


CjAAWjCjG I AT 


TAACAGAACT 


TGGTG CTGTT 


aatgcaagta 


ctggaaaata 


TACTGGTCGT 


3300 


TCGCCTAAAG 


ACAAATTTTT 


TGTCTCTGAA 


CCTTCATATA 


gagataacat 


TGATTGGGGA 


3360 


GAAATTAATC 


AACCTATCGA 


TGAAGAAACT 


TTCTTGAAGT 


TATACCATAA 


AGTACTAGAC 


3420 


t* A t^t 1 * p a ^ A 


AAA A AC A ^r: A 


a " m ;.Tn "^n-n 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 

nCTAGGTATT TTAAACCTAA TCTAGATAAA CTAGCTTCGT AAGCAGCTGC TACATTTTCA 6 0 

CGACCGAAAT CCTCAAAATA TAATTTTGAA GTAATAAATA AGTCTTCTCT AGCAATACCA 12 0 

GTTGACTCCA AT CCGGCACG AATGCCAGCA CCTACTTGTT CTTCATTCCC ATAAACTTTT 18 0 

GCGGTATCAA TACTACGATA TCCTTGTTCA ATGGCATACT TAACACTTTC CATGCAATTT 24 0 

TCATCATTTT CCACACGAAA TGTCCCTAAA CCAATTTGTG GCATCGTGTT TCCATTATAA 3 00 

AATGTTTTAA CCTCCATAAA TATCGCCTCA CCTTTTTGAT GTATTATACC CTGTTATCAT 3 60 

AACAAATCTG AGTTGAATAC ATGAGAAAAA ACACTTAGAG CAATCAACCA CTAAAATTCT 420 

AGTAATATCT CTCAAATATT AATCAAATTG TAAAAGTAAT TCTGTTTAAT TTATGACAAA 480 

CTAAAAAAGC CGAAGTAACA ACATATAGTC ATCACTTCAG CCTAACATTT AATTGAATGA 54 0 

TTCAATTTTA TCCATCATTT GTTGTAAGTC TTCCACGTTG TATTGAATAC GACCATGGAA 60 0 

TACAAATTTG TTAAAGAACT CGTCTAATTG TTCAGCACCG ACAAGCACTT TGACAGCACT 660 

ATTTTGATTA TAATTTGAAA TCGTTACATC GCCTTCATTT TTAAGATTAA AGTATAAAAT 720 

25 TGAAGTTGGT GTATATTTGG CACCTAATTC TTTTTGTAAG TCTTCAGCCA ATTGTTTAAT 780 

CGCCTCAATT TGATCTGAAT AATTTACAAA TGATAATGAA CGTTTGTCAT CATTTTGATC 84 0 

CATCACAATA GTTTGCGGTC TAGATTTATC TAAATCCAAT GTATCAAATA CTTGTTCCAT 900 

TGGTGGTAAA TCTTTAAATT GACCGCCACT AATACCATTA TAAACATGAC CTTTTAACAA 96 0 

TTGAGAATCA ATAATATAAA GACCAGTTCT TGTTAATACT AAATGACTAA TTCGTTCAAT 1020 

ATTATTAAAG CCATCCTTTG GTAAAAAGAT ATTTGCCATA ATGTGCATAT CTTCTGGTCG 1080 

AATTCGTTTT TCTTTAACTA ATCTTTCACG AATACCAATT AATCTCATGT CCGTTACATA 114 0 

TTCACTATGA TTTTTCGAGA ACAATTTTAA TGCGTCAATC TCACGATCTT TTGTACTAAC 1200 

CATGTGATTA TAATCTTCTT GTTGTTTTGT AATTGTCTTT TTATTTTGAA TACGCTCTTT 12 60 

CTCTAAAGCT TCTTCATGAG ACTTTTTAAT GTTTTGTTCT TGTTGTTCAT ACTTTTCTTC 1320 

TGTTTGTCGC TTAACTTTTT TCTTACTACC TAAGGCAACT AAAAAAAGGA CAAAAAAGAT 13 8 0 

45 TAATGCAATG AgCTACTGCA ATAATGAGTC CAATGACTAT CGGTGAAGAT AAATCCATCA 144 0 

CAACAACGCT CCTTTTTAAT ATATGAATAA CTTTAATTAT AATAGAaAAG CTAAAGATTT 1500 

TCGATACATA TTATCATTTA TAT AC CGAAA ATCTTTTATT TAGCTATATT CAATTCATCT 156 0 

50 TATTATTTTA CTGCGTCTTT TAATTCTTCC ACTTTGTCTA ATTTTTCCCA TGGGAATAAG 162 0 

ACATCTGTAC GTCCAAAATG ACCATAAGCA GCAGTTTGTT TGTAAATCGG TTGTTTCAAA 1680 
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TTaTATATGA AATTGTTATA 


GATTATTTGA 

VJA X X X\ XXX \JA 


GTAPCITAflTA 


ttit p a a rr a r* 


A TTT A A A A TP 
/ill x/tAAAl 


6180 


ATACTATATG 


TTTTCTGAAA 


AAACAATTAA 


TG A pcv^tt™ IT 

X X X X X 


A ITTT^ A AT 
/vrt x x i/vr\i>\X 


AATPTY1APTA 


624 0 


CTATAGGCAT 


CTCATTGATA 


TGATTCTTAC 


TAAPAGA PAT 


TAAAATPAAA 
x nnAn i \ — riAn 




b J 0 U 


GTCTCTATAG 


AGCGTTCTCT 


TTATTAT CTT 

x a x x n x \v x x 


CTAQTTAPAA 


r\ x x Jr\ X X vi/\x X 


pjpf near* 


o J 6 0 


TGTTGTTGCT 


CATTCGATTC 


TAAAGCATCA 


TATAATTGAG 
x x x x nj^vj 


ATAPTGTATG 


P^"2P A A PTTY"2T 


b 4 ^ Q 


TCTACAATCA 


TTTTCACACC 


GTTTCGTAGT 


TTATTAAPAP 


P rSTTTfi T P A T 

COX X X VJ A v r\ X 


TTf: A P PT A T C 


on 


GCAATCATAT 


TTGTTAATGT 


TCCAAACCTT 


GG A P T AAT AA 


V- X X \m\X 1 0%J X 


TTPPPJ^AaTP 
1 1 L-^_Li*jAMx \j 


C C A P» 


ATTTGTATGC 


CTCCCATTGG 


GTGTGCTTGT 


aclaatttotp 


T A TTTTr' A A f2 




DO 0 0 


AATTGATCAT 


CTTGATCCAA 


TTCATTTAAA 


X WW XXX X x o 


PAPPTGTPOP 




£ C d Pi 

b be U 


ACATTATATA 


TGTCTACTGA 


TTCTTGGTTT 


TTGTATGAAA 


AATAATACAA 




o / zu 


ATGTTCACAT 


CTTCTAAATC 


TTTTTTCAAA 


ATTAAAGACT 


TATTTTCTAT 


TA ATTT" A ATA 


ft /oU 


ATTAGTTCAG 


CAGTTCTTGG 


AGGCATTGGA 


TTTGAATTTA ATTGAATCAT 




XT Q A n 


TTTTGATTAA 


ATTGATGTTG 


GTCTTCAATA 

vJ X v— X X WVt X *» 


CTTAAGCTAT 


TCCATATCCA 


A ^^^P^P AAA ^^^P/*"* 
nil InnAi x V~ 


Ck Pi Pi 

b y u u 


TCTTTCAAAT 


GTTCAATCAT 


ACTTTGGAAA 


ATGCCCaTTT 


CTGTTGGACG 


\- U L. 1 AAA 1 tJA 


b y b 0 


TACTTCAAAT 


CTGCAATATG 


ATTT P PTf^ T A 


CGTCTATGTA 


CTAATTTTTT 


AAA ATPa RTVi 


T a ri 

/ 0 Z 0 


TCATATTCAG 


CACATTCTTT 


TAAAAATAAA 


GAAACTAAAG 


TATCAAGCGG 


TPP1TTPPPP 


/ 0 8 0 


AAATGATGTT 


TTTTAATGTC 


ATTTAATTTG 


TCTTTAGTTA 


AGTACTTGAA 


ttit a rr* tct 




ATCATTGTAC 


CTCTTACACT 


TGGTAAATGA 


GCAGAACGAC 


TCGTCATAGT 


A A TTYYIT AAT 
MAI l\jV?lAAl 


"7 O Pt Pt 


TTTGGATGAT 


GAGCAGCAAC 


ATAACGGACA 


ACATCTAAAC 


TGGCAAGGCC 


TPTAPPA ATA 


•7 0 c n 


ATCGCAATAT 


CGTCCAGTTC 


ATTTACTTCG 


TCTAACGTAT 


TATATGTTGG 


ATAAfiHPCTA 


' JxU 


gcGA^ATATC CTTTTTTACC 


CTTTAAGTTA 


TATGGATCAT 


GGTAGGCAAA 


TGTAPPAPAT 


77 on 
/ j 0 u 


GTTAAAAATA 


CATAATCGTA 


CGCTTGCCAT 


GATTGTCCTG 


AATTTGTAGT 


APATATf^TAA 


7 4 A Pi 


TAAGTTAAAT 


TCGTTTCATC 


GATATTAGAA 


TTTGTATAAA 


TCTCTTGAAC 


TTT A TT A T A A 
1 1 Xi\X lAlnn 


■7 c 

'DUU 


TTAGTTGATA 


TATTTGGATA 


TTTl-I-I'CGTG 


AACATAGATA 


AATAAGATTT 


CATATAATGT 


7560 


CCGAATACAA 


ATCTCGGTAA 


ATATGCAG 








7588 


(2) INFORMATION FOR SEQ ID NO : 14 3: 









(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10320 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

TOPOLOGY: linear 
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CTCGTTTCTG GAATAAGATG AATGTCAAAA CTGTTATCAT GCTTATCAAA TACCGTTAGA 43 8 0 

CTAACACCAT CCACAGTAAT AGACCCTTGC TTAACTAACT GATTATTAAT ATGTTGGCTA 444 0 

CATTGAATCG TAATAATTTT TGCATTGGCT GTTTCATTTA TTTTTGAAAC TGTTCCTAGT 4 50 0 

TCATCTACAT GACCGAGGAC AAAATGTCCA CCAAACCTAC CGTTACCACT CATGGCACGC 4 560 

TCTAAATTTA CTTCTGATTG TCGCTTAACA TCTGCTAAAT AGGTTTTATT TTCAGTGCCT 4 620 

TTAATTACTT GAACAGTAAA AGATGTCTGA TTAAAATCAA TCACTGTTAA ACATGCACCA 4 680 

TTAACACTGA TGGAATCACC AATATGCATA TCTGCCGTAA TCTTATGTGC TTCAATTTCA 4 74 0 

ATCGTCCTGA CTGATTGACG AATTTGAACA CTTTTAACGA CACCTATTTC TTCAACGATG 4800 

CCAGTAAACA TGCATCATCA CTTCTTTCGT AAAGTTAATT TAACATTTTG ATTTAATAAC 4 860 

TCGGAATGAA CAATTTCAAA TTGGTTCGCA TCTGGTATCT CAATCACATC ATTTGTTTGA 4 920 

TAAAATTGAT AATTTCCAGA TCCGCCAATT AATTTCGGGG CATAATAGAG AATAAATTCA 4 980 

TCTATATAAT TAGATTGGAG AAATTCTGAA GTAGTGGTTG GACCTGCCTC GACTAGCAAA 504 0 

GTTCCAACTC CTCTTTTATA TAAATTGTGA AGAATTGTTG TTAAATCGCA AGACTTCAAG 5100 

25 TAAATAATTT CAATATGTGT TTGATTGGTT GTTAAATTTG GATTTTCAGT ATATATCCAA 5160 

ATTGGTGTTG ATTCATCTTG ATAAATTTGC TGATTAAAAT GAATATTCCC AGACTTAGAC 5220 

AATATTACTT TTATAGGGTT TTTTCCATCT TGAATACGTG TAGTATATTG TGGATCATCT 52 8 0 

AATTCAACTG TACGTCTTCC AGTTAACACT GCGTCGTGTC GATGTCTTAA CTTATAGACA 534 0 

TCTTGTTTAA CCTCTTTGTT AGTAATCCAT TGACTTTGTC CATTATCATT CGCTTGTTTA 5400 

CCATCTAAAC TTGCAGATAC TTTCACTGTA ATTTGTGGCA GTTGCTTTGC TTTTGCTTTA 54 60 

AAAAAGTCTT GGTATAATTG TGATGCCCGT TCATCATCAA CGCATTCAAC CTCAATACCG 5520 

TGAGCCCGTA ACGTCTCATC ACCATGTGTG TCTAACGAAT TGTCTTTTGT TGCGTATACT 5580 

ACTTTTGCTA TCTTACAATC AATTATTTTG TTAACACAGC GTGGTGTTGA ACCAAAATGA 564 0 

CTACATGGCT CTAACGTAAT ATAAATCGTC GCACCTTCAG CATTTTGTTG TGCCATATCA 57 00 

AGTGCTTGAA CCTCCGCATG CTTGTCACCT TTTCTCAAGT GTGCACCAAT ACCAACAATC 576 0 

^5 CTACCTTCTT TAACTACAAC AGCGCCAACG GGTGGATTAA CACCTGTTTG ACCTTGTACC 5820 

ATATTTGCAA GTTGAATCGC ATAATCCATA AATTGACTCA AATGATCACC TCTATAAACA 5880 

AAAATCCTCA CAT CATG AAT TAAGATGCAA GGAGaAAAAT TTATCGTTAA ATAAGCCTAT 594 0 

TTGTACACAT TTTTACAAAT ACGCTACATT ATCTTTGTCG ATAATTAACA TTCTTTCTCC 6 000 

CATCCAGACT TTAACTGTCG GCTCTAGAAT CTCACTAGAT CAGCCACTAA TATGAAACAT 6060 
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T AGATTTT A A 


TA A ATTAPPO 
X /\r\n 1 1 nuL L 


ATTTCAATTG 


CaCTTACTGC 


rrvi / M l»p/n ft A A ft 

TGC IT CAGCA 


C CTTT ATTG C 


2580 




CAGCTTTCGT 


APPTPPTPTT 
n^L 1 uL 1L1 i 


TO O IV P A P P*I**T' 

i llalAll It 


GTTCAATAC I 


TTCAGTCGTT 


AAAATAPPAA 

AAAAI AllAA 


2640 


5 


ATATGAPTPP 

■** X *V X W«VV_ X VJVJ 


InLAl 1AL?1 x 


TO ftTPR TTP A 

1 VjAI LAI 1 LA 


CTTTAGaAAl 


ACCTTTCGCG 


ft pfpTP A TT A P 

AlTTCATTAl 


2700 




AAA CAT AA TP 


A Tlx & Tn an a o 

t\ i AA 1 VjALjAL 


fTR PPA OO^**/^ 

AGCACCGC 


P A TV TT A AO /"I ft 

GAATTACG la 


TPPT A A TPT TV 

TCCTaATGTA 


ft wwii ft PTP PAT 

ATT ACT G CAT 


2760 


10 


PATAATTTPP 


TP ATP APP PT 


AA1 1111 1AG 


PTA / ♦* I' TV A 

L TACT AAAGG 


ft ft llWfa TI/-V TV ft ft /-» 

AA ITT CAAAC 


P P TV <•"( »PP P ft 

GCACCTGGCA 


2820 


P.AAATGPTAP 


A TP A ATATTP 
nlLiinlni x\j 


1 L 1 1 LAI 1 AA 


OA TO A TVTPP 

LA 1 LAi Lj 1 LL* 


A A TP A A APTA 
AAT LAAAGTA 


T CTTTTG CA C 


2880 




PTT P AAPT A A 


1L11 LLAL* 1 V> 


aTaaaaTOTiT 
Al AAAA1 LAI 


T A A A TPP A PT 

I AAATCvjAlT 


ft ft PT A PP ft TH ' 

AACTACGATT 


PPft ft / WIUIM T*P TV 

GCAAlITT CA 


2940 


15 


A A TPTTTT P P 

nn X L X X X ILL 


a a ; i w i * a a ■ i ~ ? _ t • a 

AA1 1 AM 1 1 1 A 


POTTO A A A A T 

LL IT CAAAAT 


TPATPTTA A A 

T CAT G IT AAA 


ft TPPTPPT ft T 

ATCCTCCTAT 


ft mhi ft ft ft mp ft p 

ATTAaATGAC 


3000 




tUil ill 1 Hi 


111111 Lvji I 


TPP A T ft mk ft ep 

r L CAT AT AAT 


CATGATTATG 


TACCGTTTCT 


ggtacgataa 


3060 




A '1 " 1 'PT 
111 Uinl 1L1 


*I **T*^^H 'O O A AT 1 IV 

1 1L1 IjLAAlA 


TP A »mpnpKT 

TCAATGCCAT 


ATTGTTTTAA 


TCCCrCAAAT 


ttacttggat 


3120 


20 


Inl iHLl IAA 


Ta A Ti TTT a T7\ 
1 AAA 1 loAIA 


TGTT CGATG T 


TAaAATATTT 


TAAAATCTGT 


G cag caatat 


3180 




P ATA 

VjM 1 1L1 


o a a a t 1 p* f ** i *p a 

LAAAIL 1 I LA 


I LAaaAClTA 


TV Ti/™» <T» K TV TV r >V T» 

ATGCTAAATT 


TGCAGTTACT 


gtatcatatc 


3240 




PTTPPTPA AT 
V- 1 1UL 1 LMA 1 


T a STTPRTR T 

1 AA 1 I LA I A I 


O 0/~ , /™ , /^T , T\ JV 

\s LULvjTAATT 


TGTTTAACAA 


TPPT A TPPP ft 

T C CTATGC CA 


cgaccttctt 


33 00 


25 


P APPTAPTATa 


a aTa atoatp 
AA 1 AA 1 LA TO 


ClaCCATGTT 


ft TT/*i IV m TV T IV 

CATTGATATA 


CTTCATAGAC 


GATT CAAGTT 


3360 




pappappaor 


A 1 LALAALL* 1 


TO A PT TV TV^"/"' l\ 

I LjAL 1 A IXjCjA 


A A ATATPPPP 

AAATATlGCL 


TPT A TV P*-»^* A 

TGTAAGgCAC 


P f~* TV TV TV T*PT1 TV 

GCAGaATGTA 


3420 




APPPTa fSTT 
AbLLjlALAl 1 


r l " T 'P A TP TTP T 

1 1 LAIG1 lb 1 


cgaattgcac 


CTTTTGTCAG 


rri iv /"* ft ft tv m o 

TACAACTAtC 


TCTTCATCTG 


34 B0 


30 




T TT TV A A A P P A 

I 1 iaaaalla 


TACATATCAA 


ATPTTPPPA A 

ATGTTCCGAA 


TV T/VTV*im ft 

ATCTGTAGGC 


ATTTTCACTT 


3540 




TTPPPTT A A ZV 
x IVjLLI 1AAA 


1 1 LAAI 1 1 L 1 


PPTTPTR ti i-tw 1 1 

LtLj I I L I AA 11 


TTTTACGATA 


TTPA ATTA A A 

IT CaATTaAA 


TPATPA ATPP 

TCATCAAT CG 


3600 


35 


T A ATPATOTT 
1 AA1 LA x LI 1 


t a a ttp a tpt 
1 AAI IGAl \3 1 


I 1 I IL1 1 IAA 


A CTTTTGTAA 


ATCTTGTCCT 


TTCGCCATCG 


36 60 


I bLLb 1 LAI L 


ft TT p A T A 7v TP 

All LATAATl 


TlAlAAATGA 


p a pp a r> r~*r^i^ri 

CACCaGCGGG 


/■Ml W »t^/1 /-I ft Slf* ft 

CTTGGCACCA 


GTAAGTTTAG 


3720 




OTR LRTf a a p 


a p p op p ( i ■* i *pt 

AGLLGLTTlT 


GTGTGTCCaT 


TTCTAGCTAA 


TACGCCTTTA 


TCTTGTGCTA 


3780 


40 


PT A ATPP AAA 


T a a a to n oo a 
i AAA 1 Lt AL LA 


OO A OO TV TT A A 

UbALUAI IAA 


AA ILI 1 I AtjL 


TTP A PT A PT A 

IT CAltACT A 


PP A TP A A TP A 

GG AT CAATG A 


3 840 




P P TTTTT PP P 

VJ7<*~ X X X X X UUL 


aptpzv a TOT a 

AL* 1 LAA l\j 1 A 


OO ttp a T* a a o 
LL* 1 1 LA 1 AAvj 


o a ot a a tto p 
LAL 1 AA1 ILL 


1\J 1 1 Vj 1 1 Lt 1 A 


T'PT* A P ft T>P TV T 1 

1 L I ACATGAT 


3 9 00 




PAATAPTPAP 
^- .rV_r\ x rtL- x \ — r\L 


lVjlAAAl IvjL 


oTaooa a aoa 
Kj 1 ALLAAAVjA 


TOTOOO A OTT 


a TP a TP A A OO 

AlLAlLAALL 


ATTTG T A C CA 


3 96C 


45 


A ATPPA A APP 


ttp too a aTa 

1 lL*lL*LAAiA 


I L 1 1 1 ALjALA 


OTOOTPOPP A 

L HjL* IGCGlA 


T A TT A A TPPP 

TA1T AATCCC 


CtTGCTTCTT 


4 0 2 0 




tpppptat a a a 

1 HjLLM 1 AAA 


a TTa a tp^tr 
All AA 1 OLr 1 A 


I X A I LVj I i LA 


TOO A TTP A PT 

ILLAI ICAGl 


A ft f** PP /' »• I ' ft PT 1 

AAC CGCTACT 


ft TV ft T^P ft PPfflm 

AAATCACCTT 


408C 




CATTTTCACG 


ATTCTCATCA 


TCTACTACAA 


taattggttc 


TCCATTTTTT 


AAAGCCATTA 


4140 


50 


AAGCACTGTC 


AATATTATCG 


AATTGCATGC 


TACCCCTCCt 


AAAAACCAAA 


TGCTCTTAAT 


4200 
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CATTAAAACG TGTGTCTTTT GAAATTTTAC CTATATTTGA AACAAGTTTA TCTTTACGAT 78 0 

TTTTTCCATT CTTTTGAAGT TCTAGCATAG GAGAAATTAA CATCATCCCC TCGATTGGCA 84 0 

ATTCTACTTT TTCAAGTAAA TTTAATAAAA TCAAACCGCC AAGTCCTACC CCTAATACAT 900 

AAGTAGGAAT TTTATATTCA TTAGCTATCT TTAACCAGTC TAGCAAACTT TCGTGATACG 96 0 

TTTGAAAGTT TTCAATTTGT CCTTTATTAG CTCTTGAAGT TTGACCTTGA CCAGGCAAAT 102 0 

CTCCCATAAT CACATGATAG CCATTTCTTC TTAACATCGT AATAACATAT GCATATCTTC 1080 

CCGTATGTTC TAATATATTA TGAGCAATAA CAACGACGCC TTTCGCATCA TTTTCAGCTT 114 0 

CCCACTTCCA CATTATTATA CTGCCCCTTT TTCATTAATC TTCAATAACA TAATTATAGC 1200 

AAATTCACTA TGTAGATTTC TATTTATAGT ATTATTGTTG TCCATATTAT TATATATAAA 126 0 

TGAAATCAAC ATCAATAATA GTGTAATTAT ACATAATTAT TTTTGATTGT TTTTGATGAA 1320 

AACGCTTTCT CGAATATTTT TTTCATGCTA AACTTATTGT AAACACAAGG GTTTGGAGGA 13 80 

GTAGCAATGG CACTATTAAA GAATTTTTTT ATCGGATTAT CTAATAATAG TTTTTTAAAC 144 0 

AACGCAGCAA AAAAAGTGGG CCCACGTTTG GGCGCCAATA AAGTCGTTGC CGGAAATACA 1500 

25 ATTCCAGAGT TAATTAATAC AATCGAATAC TTAAATGACA AGAATATCGC TGTTACGGTA 1560 

GACAATTTAG GGGAATTTGT CGGTACAGTT GAAGAAAGTA ATCATGCTAA AGAACAAATT 1620 

TTAACAATTA TGGACGCGCT TCATCAACAT GGCGTAAAGG CACATATGTC TGTTAAATTG 16 80 

30 AGTCAGTTAG GTGCAGAATT CGACTTAGAA TTAGCTTACC AAAATTTAAG AGAGATTTTA 174 0 

CTTAAAGCAA ATACTTACAA CAATATGCAT ATAAATATTG ATACTGAAAA ATATGCTAGC 1800 

CTGCAACAAA TTGTTCAAGT TTTAGAT CGC TTAAAAGGCG AATTTAGAAA TGTTGGTACT 1B6 0 

GTAATTCAAG CATATTTATA CGATAGCCAC GAATTAGTTG ATAAGTACCA AGATTTACGA 1920 

TTACGTTTGG TTAAAGGTGC ATATAAAGAA AACGAATCAA TTGCATTTCA ATCTAAGGAA 1980 

GACGTAGATG CAAATTACAT CAAAATAATT GAACAACGTT TGTTAAACGC ACGCAATTTC 204 0 

ACTT CAATTG CAACACATGA CCATCGCATC ATTAATCATG TAAAACAATT TATGAAAGAA 2100 

AATCACATTG AAAAAGATCG TATGGAATTC CAAATGCTCT ATGGTTTTAG ATCAGAGTTA 2160 

GCAGAAGAAA TCGCAAATGA AGGCTATAAT TTCACTATTT ATGTACCTTA TGGCGATGAT 2 220 

TGGTTTGCGT ATTTTATGAG AAGATTAGCA GAACGCCCAC AAAAC CTATC TCTTGCTGTA 2280 

AAAGAATTTG TGAAACCTGC TGGCTTAAAA CGTGTTGGCA TAATTGCAGC TTTAGGAGCT 234 0 

50 ACAGTTATGT TAGGTTTAAG TACAATTAAA AAATTATGCC GTAAATAGAG CAAGACATAA 2400 

ACAATAATTT AGGAGTCTGG AACAATAATC AATGTTCTAG GCTCCTAAAT GTTATATTGG 24 60 
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AAGTAAATTA GTTGTTTTAT ACGATTCAAA TGATATTTCA 
AGCTTTTTCT GAAAACACAA AAGCTCGTTT TGAAGCATAT 
5 TAAAGATGGT AATGATTTAG AAGAAATTGA TAAAGCGATT 

AGGACCAACG ATTATTGAAG TTAAAACAAC AATCGGATTT 
AACTAATGGT GTTCATGGGG CACCTTTAGG TGAAGTTGAA 

10 

TTACGGTTTA GATCCTGAAA AACGTTTTAA TGTTTCAGAA 
AAATACTATG TTAAAACGTG CTAATGAAGA TGAATCTCAA 
ATATGCAGAA ACATATCCTG AATTAGCAGA AGAATTTAAA 

15 

GCCTAAAAAT TATAAGGATG AATTACCACG TTTTGAACTG 
TGCTGATTCT GGTACTGTTA TTCAAGCAAT CAGTAAAACT 
20 ATCAGCAGAC CTTGCTGGTT CAAACAAATC CAATGTAAAT 

TGAAACACCT GAAGGtAAAA ATGTGTGGTT TGGTGTACGT 
(2) INFORMATION FOR SEQ ID NO: 142: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 758 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 2: 
TAGTAGTATT TATTAAATTA TACGAAGGGA CCcAACACAG AAAATTCATT TTATTGAATT 6 0 

35 

TTACATTTAT GTGCCAAGTT GGGAAAAATG TCTTATTTTT TCaAAGTATT TAAAAGTAAA 12 0 

ATTACATGTT AATACGTAGT ATTAATGGCG AGACTCCTGA GGGAGCAGTG CCAGTCGAAG 180 
ACCGAGGCTG AGACGGCACC CTAGGAAAGC GAAGCCATTC AATACGAAGT ATTGTATAAA 24 0 

40 

TAGAGAACAG CAGTAAGATA TTTTCTAATT GAAAATTATC TTACTGCTGT TTTTTAGGGA 3 00 

TTTATGTCCC AACCTTTTTA GAATATTAAA TTTCTACAAT TTCGTCATCT TCAACAATAA 3 60 

45 AGCCCATTGT ATTGACGCTG TTATTTAAGA AAGTCAGAAT ATAACGCATT ACTTCATCAC 4 20 

GTTCTGGCTC ATTGTGAACC TCGTGGTAAA AACCTTGCCA AGCTTTAAAA TATAATTCAG 4 B0 

GTGTTTGATA TTTTT CTTTA AACT CATCAA TTGCCCTAGT ATCAACAATT AAATCCTTCG 54 0 

TTCCATACAT TAATAGCGTT GGCATTGGTT GAATGTCATG AATATGAGCC ATCGTATCTT 6 00 



TTAGATGGCG AATTAAACAA 132 0 

GGTTGGAATT ACTTACTAGT 13 8 0 

ACTACAGCTA AATCTCAAGA 144 0 

GGTTCACCGA ATAAAGCAGG 150 0 

AGAAAATTAA CATTCGAAAA 1560 

GAGGTATACG AAATTTTCCA 16 2 0 

TGGAATTCAT TATTAGAAAA 16 3 0 

TTAGCGATTA GTGGTAAATT 174 0 

GGTCATAATG GTGCATCTCG 18 0 0 

GTCCCTTCAT TCTTTGGTGG 186 0 

GATGCAACTG ATTATAGTTC 192 0 

GAATTTG CTA TGGGTGCT 1978 
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ACACAACAAC ATCGTAACAA CTTGTTTATG AGAGAAATnT TAATTTTCAA ACTTAGTTAT 618 0 

TAAGAAAnCA TTAAGATGTG TATGCAGAAA TAAATTTTAT AGCATTTAAT TGTGAAGAAT 624 0 

5 ATTATGATAT TGCTATCGAG GTGAAGGTTA TG 627 2 

(2) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 
w (A) LENGTH: 1978 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

15 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 

AAATGATGTT TTACAATAAA TATAnAAACG TATCAACATA TATCATCATA TTTTTAGTTT 60 

20 CAAGTGCAGC CTTTGCAATA TTCTTGTTAA GTGCGnACAT TAGTGCTCAC TCGGAACAAG 12 0 

TGTACGAAAT GACTGACCAT CAAATTAAGA ACAATACGAT AAATAAAGCA TACGAACATA 180 

AAGACCCTAC AAACAATAGC GAACAAAGAG ATGGGAAAGT GTTCGCTTTA ATAAATTGAT 24 0 

25 ACATTGTCAC AACGTTATTT TGCCTATTTT TGCGmAATAG CGTTTTTTAT TACwTTTTTG 300 

CTGATsTTAA ATTTGTTATA TTTTGTTAAA GTATTATAAT GATTGAATAA ACAAATTGAA 360 

GGTAGGTTTT TTAATTGAGT AATTCTGATT TGAATATCGA AAGAATTAAC GAGTTAGCTA 420 

30 

AAAAGAAAAA AGAAGTAGGA TTAACTCAAG AAGAAGCAAA GGAGCAAACA GCCTTAAGaA 4 80 

AAGCTTATCT TGAGAGTTTT AGAAAAGGGT TTAAACAACA AATTGaAAAT ACTAAAGTAA 54 0 

TTGATCCAGr AGGTAATGAT GTAACACCTG AAAAAATTAA AGAGATACAA CAAAAAAGAG 600 

35 

ATAATAAAAA TTAAATCACA AATCTGTAAA GAATTTTCTG ACATTATAAC TTGAAATAAG 660 

TATfTTACTT ATCTTTTTAT TTTAAAATAA GTTATAATGT ATTTGATAAA ATTGAAGAAG 720 

40 GGAAGATACA CAAGATGTTT AATGAAAAAG ATCAATTAGC TGTTGATACG CTACGTGCAC 7 80 

TAAGTATCGA CACAATCGAA AAAGCGAATT CTGGTCATCC AGGATTACCT ATGGGAG CTG 84 0 

CCCCAATGGC TTACACTTTG TGGACACGTC ATCTGAATTT TAATCCACAA TCTAAAGATT 900 

45 ACTTCAATAG AGACCGTTTC GTATTATCTG CAGGGCATGG TTCAGCATTA TTGTATAGCT 960 

TGTTACATGT TTCTGGTAGT TTAGAATTAG AAGAATTAAA GCAATTTAGA CAATGGGGTT 1020 

CTAAAACACC AGGTCATCCT GAATACAGAC ATACAGATGG TGTAGAAGTT ACTACCGGAC 10 8 0 

50 

CACTTGGACA AGGTTTTGCT ATGTCAGTAG GATTAGCTTT ACAGAAGATC ACCTAGCAGG 114 0 

gAAATTTAAT AAAGAAGGAT ATAATGTTGT AGATCATTAC ACATATGTAT TAGCTt CTGA 1200 
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TTTATTTTAA AGTTAAAAAT TCACCAATAG GACCAAGTAA TAGTACTGGA ATAAATGTCA 4 3 80 

AACCACTTAG TAAAACGATA AATACGATTA GTGATACGCC AAAATAAGGT TTATCAATCG 4 44 0 

5 

CTATTGTATA TTTATCTTGA TGGTATGATT TTTTATTCAC TAAACTTGAT GCAATCATTA 4 50 0 

ATTGCAAAAT AATTGGTATA TAACGAGAAA GCAACATAAT GATTCCTGTA GAGATATTCC 4 560 

AGAATGTTGT ATCATCTTTC AGTCCTTCAA ACCCTGATCC ATTGTTCGCA GCAGCTGATG 4 6 20 

10 

TCATTTCATA CATAACTTGT GAAATACCAT GAAAAGACGG ATTCGTtATa CTTtCACTTG 4 6 80 

CTCCAGGAAT CATAAAAGCA AGTGCTGAAA ATACTAAAAT TAAAATTGGG TGTATGAGAA 4 74 0 

15 AGACTAAGAC AATACATTTC ATTTCACGGG CGCCAATTGG CATATTTAAA TATTCTGGTG 4 800 

TTTTACCAAC CATCAAACTG CATATAAACA CCGTCAGTAA GACAAATATC AATAAATTCA 4 86 0 

TGAGTCCTAC GCCTTCGCCA CCAAATACAA CATTTAGCAT CATTAATACC ATTGGTC CTA 4 92 0 

20 ATCCACCTAT AGO C G TT AAG CT AT CATG CA TGTTATTAAC A.G AA. C C C G TT GTAAATGC CG 4 990 

TCGTAATAAC TGTAAATAGT GCTGACAAAC CTGCTCCAAA CCGTACCTCT TTACCTTCCA 504 0 

TATTCGGTCC ATAAATGCCT AAATTCGCTA GTATTGGATT ACCACGATAC TCACTCCACA 5100 

25 

TAGTTAATGT AAGAATTGCT ATAAAAATGA AAAACATTGC GACAAATAAT ATCAACGCAT 5160 

GACGATGTAC TCGTTTACCA TGT CTACTTA ACATGCGACC AAATAAGAAC AACATTGACA 5220 

TAGGAAGTAA CATCATACTG CCCATTTCTA TAAAATTGCT CCAAATATTT GGATTTTCAA 52 8 0 

30 

AAGGTGTTGC AGAATTTCCT GCTAAAAATC CTCCACCATT CGTAC CAAGA TGTTTTATTG 534 0 

ATTCAAGTGA TGCAATAGGT CCAAATGCAA TATGTTGAAT ATGTCCGCTT AAAGTCCGAA 54 0 0 

35 TCATTAAATT AGCATGCAAC GTTTGTGGTA CaCCTTGAGT CATCAATAAA ATACTAATTA 54 60 

AACATGATAA TGGTAAAAGT ACTCGGACAA TAAACCGAAC AATATCTTGA TAAAAATTAC 5520 

CAATGATATT AGTTAATCCA GTTAAACGTC TCAACATCGC TATACAAACG GCGTAACCTG 5580 

40 ATGCACTAGA TGTAAACATT AAATATGTCA TTACAATCAT TTGCGTTAAA TATGTCACAT 564 0 

CTGaTTCACC GTTATAGTGT TGtAAATTAC TATTTGTTAA AAAAGATATT GCTGTATTAA 57 0 0 

ACGCTAAATC TATCGATTGG TTTAAATTAT GATTTGGATT TAAAAAAAGC CATTGCTGAA 576 0 

45 

CTATTAGCAA TACAAATGTT ATAAACCCCA TAAATCCATT AAATGCCAGA AAATGTTTGA 5 82 0 

CATATGTTTT AGCTGACATG TGTTCTAAAT CTGTGCCGAT AATTTTAAAA CACATATTTT 5880 

CAAATCTAGT AAATATTAAA TCTACTCTTG ACGATTGCAC CAATGCTACG CGATATAGAT 594 0 

50 

ATCCACTAAA AACATACGTA ATCATAACCA TCATTGTTAG AAACAAAATT ATTTCCATGA 6 00 0 
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AATTGTTGAC ATCATTAGTG CAGGTAAAAT TGCAAAGTAT TTTGCTACGT CATTAGCCAA 2 58 0 

ACTAAATGTC GTTAATGCAC CTCTCGTCAT TAATAATTGT TTGCCTATTT TTACAACCTC 2 64 0 

5 

TATTAACTTT GTAGGATTCG AATCTAAATC AATTAGATTA GCTGCCTCTT TAGCACTAAT 27 00 

TGTCCCTGAG TTCATAGCTA ATCCTATATT CGCTTtGTGc tAGCGCAGGT GCATCATTTG 27 6 0 

TACCATCTCC TGTCATCGCA ACAATATGGC CTTTCGCTTG TTCATCTTTG ATGACTTTAA 2820 

w 

TTTTATCTTC GGGTTTACAC TCTGCAACAA ATCTATCAAC CCCGGCTTCT TTTGCAATTG 2 88 0 

TAGCTGCTGT TAAAGCATTA TCACCTGTAC ACATAACTGT TTCAATCCCC ATTTTTCTCA 294 0 

i 5 ATTCAGTAAA TCGTTCTACA AGACCATCTT TAATCACATC TTTTAAATAA ATCACGCCAA 3 000 

GCATGACATT GTTTTCAATG ACTATTAAtG GnGTGCCACC TTTACTCGAT ACATCCATAC 3 06 0 

AGAGAGACTC AATATTAAGA GGAATATTGC CTTGTTGTTG TTTGACAAGA TTTATCATAC 312 0 

20 TATTAGGTGC ACCTTTGAAT ACCGATATTT CATTTGTAAT GATTCCGCTC ATTCTAGTTT 318 0 

CAGCTGTAAA AGGCTTATAT GTGCCATCAA TGTCTTTAGG CAGCTCATTT AT ATACAT C T 324 0 

GcttCGCTAA TCGTACAATA CTTTTTCCTT CTGGCGT AT C ATCGTAGATT GATGACATAT 3300 

25 

AAGCAGCGAC TATCAATTTT TCAAGCATTT GTTGATTCAC TGGTAAAAAT TCACTAGCGA 3360 

TTCGATTGCC ATAAGTGATT GTGCCTGTCT TGTCTAAAAT CATTACATCG ACATCTCCAC 34 2 0 

ATACTTCTAC AGCACGCCCA CTTTTCGCTA ATACATTGAA TTGAGTAACA CGATCCATGC 34 8 0 

30 

CTGCAATACC AATCGCCGAT AACAAACCAC CGATTGTCGT TGGTATTAAA CATACTGTTA 3 54 0 

ACGCAATGAG CATCGCAATA GGTAAAATTA AATGCAGGTA AGATGCTATT GGATATAACG 3 60 0 

35 TTACAATAAC GACTAAAAAT ATAATTGTTA ACGTTGTTAA TAATGTAAAA AGTGCAATTT 3 6 60 

CATTTGGTGT TTTATTTCTT TCCGCCCCTT CAACTAAGGC AATCATTTTA TCTAAAAAAG 3 72 0 

ATGTACnCGC TTCACTCTCA ACACGTATTT CTAACCAATC AGATGTTACA AGTGTACCGC 3 78 0 

40 CAATGACTCC ATCAAAATCG CCACCTGATT CTTTTATCAC AGGTGCAGAC TCACCAGTAA 3 84 0 

TTGCAGATTC ATCAACGGTT GCTAATCCAT TTATTACAAC GCCATCAGCA GGGATTGTTT 3 900 

CTCCATTTTC TACCCGAATA TTTTGTCCGG CTTTTAACTC TGTGGCGTTC ACTATCCGAT 3 960 

45 

ACGCACCATT TTCTTCTATC AATCGAGCAG TTAAATTTGA TTGTGCTTGT CTTAAACTAT 4 020 

CAGCTTGCGC TTTTCCACGA CCTTCAGCAA AGGCTTCTGA AAAATTAGCA AACAATATAG 4 0 80 

TTATTAATAA TATGATAAAA ATTGTAATCA AATAACCTCG CGATAGATAG CTAGTTCCAA 4140 

50 

AT ATGT CAGG AAAACATATT AATATCAACG TTAAAATCAT TCCAACCTCA ACGACAAACA 420 0 

TTATCGGATT TTTTATTAAT TGTTTAAGAT TCAGCTTATA AAAACTCATT TTCAAAGCTT 4260 
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AATACGTTTT ATATTTAATT CTTTACGCTT TTTATTAAAA ATACCTGTTG TTAAAATGAA 7 80 

ATAATTATCC tCAATCCAAT ATCGCGTGTT CATAATT CCG ACAATTTGAG AAATGTATGA 84 0 

5 

TATTAAAAAG AATACAAATA CAATACCTAT CCATAAATAT GATTCGGGAT TCGTATAATC 9 00 

AAAATCTTTC AATTGAAAGA TAATGAAAAT AAAAAAGACG ACTATGTTTT GTTTGATAGC 96 0 

w ATTGATTATG CCATTAAAAT ATGAAATCGG ATGTAATTTT TGAGGTTCAG ACATCACTTT 10 20 

CAACCCCTCT CAAATTCGAC ATAGTTCTCT CTTCGATTAT TTTAACATCG TCATGAGACA 10 30 

TCATCGGTAA ATAAATAGTA TGACCTGCAG TCATAAATCC AACTTTATAC AAATTAAGCA 114 0 

15 CTTTACTAAT TGGATTAGAT TTAATCGACA AGTATTGTAA ACGTTCAATT CGACTCGTTT 12 00 

CTTCTTTATA TATAAAAAAT GATGTACGAT ATTGTACACT TAGTTGATCA ACTTTATAAA 126 0 

AGCGACAATG ATATTGCCAT AAAGGCTTAA TAAATAATTT TAATGTACTC AGAGCACCTA 13 2 0 

20 AAACCAACAA AATA7AAAGT AAGTAATGTG GC CATTCAAA TCTTAACCAT ATAAAATAAA 13 80 

AAATGACATA CACAGCTACA CTCAATATAA ATTCTAAGCC ATTCGTAATG TAGTAATACA 14 4 0 

ACAATGCTGA CTTAGGACTC TTAGTCAACT TAGTATAATC TGACATATAC CCCTCTCCCC 15 0 0 

25 

AAATAAAAAA TTATACGGAT TTATAATCTA TTTCATTTTA TTTTTATATG ATGATAATTA 1560 

TAGCATATGG AATATTTCAT GCTAATTTAT TCTTCCTAAA GGTACATCTA AAAATTTAAT 1620 

TAAGCAGAAA GTGCTTGAAT TGCTAAAAAG ACACCATGTT ATAATTTTAT CAACATGATG 16 8 0 

30 

CCTTTCATCT ATAATCAATC TTTCATCTTA TCAAGAGCGA TATTTAGTTC AAGCACATTC 174 0 

ACATAATCAT TTGTTAACAC ACCACGCTGC TTACGATGTT GAATCAAGTC GGCCACTCTT 18 00 

35 GAAGTAGATA CATGACGAGC ATCAGCAATA CGAGGTGCTT GCTTCAATGC ATTTTCGACC 186 0 

GTAATATGCG GATCTAAGCC CGACCCAGAA CTTGTTGCAG CATCTATTGT TACATTTGAA 192 0 

TTCCCAAATT TAACATGATG TTTCATGCGT GCTATTAATT CGGTGTTTCC ATTCGATTCA 198 0 

40 TTACTTCCAC CTGAAGATAC GCCGTTTTTA TATAATTTTT CAGGATTCAT ATTATAATCA 2 04 0 

ACTGCACTCG GTCTCCCGTG AAAATATCGT GTCTCTGTCC AGTGCTGTCC AATCAATTTT 2100 

GATCCAACTA TACGATTGTC ATACGTAATT AAACTGCCAT TTGCTTGTTG ATAAAAAAAT 216 0 

ATTTGACCAA TTAACGTGAT AGCTAACGGG AATAAAAATC CACATAATAC CATAGTTATT 2 22 0 

ATCGTTAAAC AAATACTATT TCTTATCGTA TTCATGGTAC AGGCTCCTTC CTCTTTACAC 22 8 0 

AAAAAATTGT ACAATCATAT CTATTAATTT AATGCCTAAA AACGGGACGA TTAATCCACC 2 34 0 

50 

TAATCCATAA ATCAACATAT TATTTATAAA GATTCTATCA ATGCTGTAAC CCTTTACTTT 24 0 0 
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GATCTTATGC CTCGAAATTT CTCGCTAATA GGTGACTTTG GTTGTTCATG GACAATTAAA 13 80 

CTTGATGTAC TTCyTCGTGT ATTCGTCATG GTAATTCCTC GTAAATTAAA ATTTTTGTAT 144 0 

5 TGAACCTAAA ATAGGTAATC CTAGTTGCGA TTCAACATCT TCTTCTGTCT TAATACGCTT 1500 

ATCTAATAAT TCTTTTAAGA AAATAATCAA TATTGCTAAA ACAATACCAA CAATAATGCT 1560 

GATAACTAAG TTGACAGATA CTATTGGAGA TACTTTTACA GCATTATCAT GTGCTGAGGA 162 0 

10 

AAGTATCGTA ACATTATCAA CACTCATAAT TTTAGGCATG TCATGAGCAA AAACTTTAGA 16 8 0 

TATTTTATTA ACAATTTTGT CAGATTCAGA TTTATTCCCA GTGGTAACTG ATACAGTAAT 174 0 

AATTTGAGAG TTTGTTTGAT TGGTTACTTT TAAAAATGAA TTCAACTCAG CTGTTGAATA 18 00 

15 

CTGACCATCA AnTTCTCTAG ATACTTTATC TAGAATTCTA GGACTTTTGA TAATTTCCGT 18 6 0 

ATATGTATTA ACAGACTGCA AACTACTTTG AACATTTTGG AAAGCTAAAT CACTTGAGGA 192 0 

CTTTTTCATG TTCACTAATA TTTGAGTAGA AGCAGTATAT TTGTCAGGCA TAACAAAAAA 198 0 

20 

GGTT 1984 
(2) INFORMATION FOR SEQ ID NO: 140: 

■ <i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 6272 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

30 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 0: 

CAAATCCCTT GGTGATGAtA AAtGtATTGC TGTGTAGCCA AATAATCTTC GTATATATGA 60 

35 CTGACGTTCA ACAACAGCTT GCAATCGTTT CGTTGGTACA GTTACTTTCT TCTTGTTAAA 120 

GAGACCATAT TCAATTTTAA GTTGCTCATT TTCAAGCATC ACCGAAAAGC CATAAAATCT 180 

TATCATTGTT ATAATCGTTC CAATAATATA TGCCACTATT AATACTAGTA AAATGATGAT 24 0 

40 TAATACTGAA ATACTTACAA TTTGAACCCA TTGACTAATT TCATGATTTA GCTTCGACCA 300 

TGGGATCAAC TCTCTTACAG CCCCGTAAAT CGGTACTAAA GCTGCTAACG TTACACCAAT 36 0 

GGCGCCACTG GTCATTGCCA TAAATAGTGA TTCTTTAAAA TTCATCTGAT ATATAGGAAT 420 

45 

GCGTTTATTT TTCTGATTAA G CAT ACT AT C AGTGTTCTGC ACTTCATCTA AGCGACCTTC 4 80 

TGCGATGTCT TCCACATTAC CTTCAATGTC ATGATTACAG TTGTCATTCT TCTCAGCACT 54 0 

AGACTTTTGC GCCACTTCTG TCTTCAACTC TGTTTGCAAT TGATCAATAT ATCGTTCAAG 600 

50 

ATATTCACCT TGTTTTTTCG AAATAACACT TAAGACAATA CCATCACTTG GTGTTTTGAT 660 
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10 



CCAGATGCTT TGAACTTATC GCAATTGTTG AGTGCGAAAG GTATCGAACA TGACTTTATA 7 8 60 

CCTGGATATT ACCAATTCCA TATTTATCCA GTATTTCCGA 7 9 00 

(2) INFORMATION FOR SEQ ID NO: 139: 

(i) SEQUENCE CHARACTERISTICS: 

(A} LENGTH: 1984 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 

15 

GTCTAAATAA ACAAAATTAT CATTGATTaC TGAACTGGCA TTTCGAAGTA ATGCTTCAAT 60 

ATCATTCGAA TATTTCTTCA ATTTATGATT GTGAAATAAT TCTTGCATCA AAAATGGTCT 120 

TTGGTCACAT GAATGTGCAT CTGAAGCTAC AAAATGAGCC AAATTACATT CTATAAATTG 180 

20 

TAATGATAAC TTTTGAATGT TTTTACCAAA TCCACCAACT AAAGAACTCG ATGTTAATTG 240 

ACTCAGTGCC CCATTTGCAA CCAATTCATA TAATATTTCC GGATTTTTGG CGATACTTCT 300 

ATTTCTTTCA GGATGTGCAA TGATTGGTAT GTAACCTCTC GATTGTATTT CAAAAAACAA 36 0 

25 

TTGTTTTGTA TAATGTGGTA CTTCGCCCGT TGGAAATTCA ATTAATAAAT ATTTCGAACG 42 0 

ATTAATACCT TGAATACTAC CATTATCTAA GCCTTTCAGA ATCGAATCTG TAATTCTAAT 4 80 

3Q TTCTTGCCCG GGAAATAATT TAATATCCAA TGCTTGAACT TCTGGATGCG TTCTTAACTC 54 0 

CGCCAATTTC ACAAGCACTT GTTGAAATGT ATTATCATAT CTCGGATGCA AATGATGAGG 6 00 

TGTCGCTACA ATACTTGTTA CACCTTCATC CTTAGCTTGC TTTAATAGTG CAATACTCTT 66 0 

3S TTCAATTGTT TTAGGACCAT CATCTATATC AACTAATATA TGGTTATGAA TATCAATCAT 72 0 

GATTCATCAG TCCCATAATA TGCATAGTAA CTAGCACTTT TATCTTTAGG CATTCTATTT 78 0 

AAGACTACAC CTAATAATTT AGCACCTGTT GCTTCAATAA GTTCTTTTCC TTTTTTAACT 84 0 

40 TCATCTCTAT TATTATTTTC CGAATTAACT ACGTAGACAA CATTGCCGGT AAACTTTGAA 90 0 

AATAATTG CG CATCTGTAAC TGTGTTCACT GGTGGCGTAT CGATAATTAC AAAGTTATAA 96 0 

TTCATCAATA ATGTGTCATA CAAATTTGCA AATGCCCTTG ATGTAATTAA CTCTGACGGA 102 0 

45 TTCGGTGGGA TTGGCCCAGA CGTCAAGACG TCTAAATCTT GAATTTCAGT TGAGATAATA 108 0 

CTGTCTTGAT AAGTTGACCA ATTTAGCAAT AAACTTGATA GGCCTTCATT GTTTGGCAAA 114 0 

TTAAAAATAT AATGCTGCGT AGGTTTACGC ATATCCCCGT CTACGATTAG TGTTTTATAA 12 0 0 

50 

CCTGCTTGCG CATATGCAAC TGCTAAATTT GCTGCAATTG TAGACTTACC TGCGCCTGGT 12 6 0 



10 



15 



20 



25 



EP0 786 519 A2 

AATAAATTAA CATGATTTTA AATCTATTTG TAAGATAAGG AGATTTGTCA TTATGACAAC 6060 

AGAAGGTCTA TTAGTTGCAG AGAAAGAAAT CGAAGTGAAT GGTTACGACA TTGATGCGAT 6120 

GGGTGTCGTT AGTAATATCG TTTATATTAG ATGGTTCGAA GATTTGAGAA CAGCGTTTAT 618 0 

TAATCAGCAC ATGAATTACT CAACAATGAT CAATCAAGGC ATTTCACCTA TACTTATGAA 624 0 

AACGGAAGCA GAGTATAAAG TACCTGTCAC AATACATGAC AAACCAGTAG GTCGTATTTA 6300 

CTTAGTTAAA GCAAGCAAGA TGAAATGGGT GTTTCAGTTT GAAATTGTGT CCGCACATGG 6360 

CGTGCATTGT ATTGGTACAC AGACAGGCGG TTTTTACAGA TTGAGTGATA AGAAGATAAC 6420 

CTCTGTGCCA CAAGTGTTTC AAGACATTTT AGCAACAAAA TAATGACTTC ATTTTAAAAT 6480 

ATAAAAAGTA AGAAGGTGTT CGAAATGGTT AAGCAATTAA ATAGTGTCGA AGCATTCCGT 654 0 

GAATTTATTC ATCAATATCC GTTAGCAGTT GTACATGTCA TGCGCGATCA GTGTAGCGTG 6600 

TGTCATGCCG TTTTACCACA AATTGAAGAC TTGATGCAAT CATATCCCAA TGTGCCATTA 6660 

GCTGTGATTA ATCAAAGTCA GGTGGAAGCT ATTGCTGGAG AATTAAATAT TTTCaCTGTA 6720 

CCTGTGGATT TAATTTTTAT GAATGGAAAA GAAATGCATC GTCAAGGGCG TTTTATCGAT 6 780 

ATGCAACGTT TTGAACATCA TCTTAAGCAA ATGAATGATA GTGTAAATAA CGATGTCGAT 6840 

GAGCATTAAT ATCGCAAATG ATTAGCATTG CTAAGATTAT GTAGACATCA TAACTTATTT 6900 

CCCAGTAAAT ATTGGTAGTA ATTAGAATCA GCATGGTACA GTAGAACTAT AGTAGAAATC 696 0 

ATCAAAGAGG AGTGACGACA AATGCGTAAA AAATGGTCTA CACTTGCGTT TGGATTTTTA 7020 

GTTGCAGCAT ACGCACATAT TAGAATTAAA GAAAAACGCA GTGTGAAAAG TTATATGTTA 7 080 

GAACAAGGTA TACGATTATC TAGAGCTAAG CGTCGTTTTA TGTATAAAGA AGAAGCGATG 714 0 

AAAGCATTAG AAAAAATGGC GCCACAGACA GCAGGCGAAT ATGAGGGAAC CAATTATCAG 7200 

TTTAAGATGC CAGTAAAAGT GGATAAGCAC TTCGGTTCAA CCGTTTATAC CGTTAACGAT 7260 

AAACAAGATA AGCATCAACG CGTTGTATTA TATGCACATG GAGGCGCATG GTTCCAAGAC 7320 

CCACTCAAAA TTCATTTCGA ATTTATTGAT GAACTTGCAG AAACACTCAA TGCTAAAGTC 7380 

ATCATGCCAG TATATCCGAA GATTCCGCAT CAAGATTATC AAGCGACGTA TGTGCTTTTT 744 0 

GAAAAGTTGT ACCATGATTT ATTGAATCAA GT AG CAG ATT CTAAACAAAT CGTTGTAATG 7 500 

45 GGTGACTCTG CGGGCGGTCA AATTGCTTTA TCATTTGCTC AATTGTTAAA AGAAAAACAT 7560 

ATTGTGCAAC CAGGACATAT TGTATTAATT TCACCAGTTT TAGATGCAAC GATGCAGCAT 7620 

CCTGAAATTC CTGACTACTT AAAGAAAGAC CCAATGGTAG GTGTGGATGG CaGTGTGTTC 7680 

50 TTAGCTGAAC AATGGGCAGG GGACACACCT TTAGATAACT ACAAAGTATC ACCAATTAAT 774 0 
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AGCTTTGCGT GGCCACCAAT TCAAGATGGA TTAAATAGTT TATCGAATTT CTTATTAAAT 42 6 0 

AAAAATTTAA CATTAACAAC GTTTATATTC GGTATTATTG AACGCTCATT AATTCCATTT 432 0 

GGTTTACATC ATATTTTCTA TTCACCGTTC TGGTTTGAAT TCGGAAGTTA TACAAATCAC 4 38 0 

GCAGGTGAAT TGGTTCGTGG TGACCAACGT ATTTGGATGG CACAATTGAA AGATGGCGTA 44 4 0 

CCATTTACTG CTGGTGCATT TACTACTGGT AAATATCCAT TTATGATGTT TGGTTTACCA 4 500 

GCGGCGGCAT TTGCTATTTA TAAAAATGCA CGACCAGAAC GTAAAAAAGT CGTGGGTGGT 4 56 0 

TTAATGTTAT CAGCAGGATT AACTGCATTT TTAACTGGTA TCACTGAGCC ATTAGAATTT 4 62 0 

TCATTCTTAT TTGTAGCACC AGTACTTTAT GGAATTCACG TATTATTAGC TGGTACATCA 4680 

TTCTTAGTAA TGCATTTATT AGGCGTTAAA ATTGGTATGA CATTCTCAGG TGGTTTCATA 4 74 0 

GATTATATTT TATATGGTTT ATTAAACTGG GATCGTTCAC ACGCATTATT AGTTATTCCA 4 800 

GTCGGTATTG TATATGCTAT CGTGTATTAC TTCTTATTCG ACTTTGCAAT TCGTAAGTTT 4 860 

AAATTUAAAA CACCAGGTCG TGAAGATGAA GAAACTGAAA TTCGTAACTC TAGTGTCGCA 4 920 

AAATTACCAT TTGATGTCTT AGATGCAATG GGTGGAAAAG AAAACATTAA ACATTTAGAT 4 98 0 

GCATGTATTA CACGTCTACG CGTAGAAGTG GTTGATAAAT CAAAAGTAGA TGTAGCAGGT 504 0 

ATTAAAGCTT TAGGCGCATC AGGTGTATTA GAAGTTGGAA ACAATATGCA AGCTATCTTT 5100 

GGTCCAAAAT CAGATCAAAT TAAACATGAT ATGGCCAAGA TTATGAGTGG TGAAATTACG 516 0 

AAACCAAGTG AAACGACAGT GACTGAAGAA ATGTCAGATG AACCAGTTCA CGTAGAAGCA 52 20 

CCTGATCAAG TATTCGCTGG TAAAATGATG GGTGATGGTG TTGGCTTTAT CCCTGAAAAA 534 0 

GGTGAAATTG TAGCACCGTT TGATGGTACA GTGAAAACAA TCTTCCCTAC GAAACATGCG 54 0 0 

ATAGGATTAG AATCTGAAAG TGGCGTCGAA GTACTTATTC ATATTGGTAT CGATACAGTG 54 6 0 

AAACTGAATG GTGAAGGATT CGAAAGTCTG ATTAACGTTG ATGAAAAAGT AACACAAGGT 552 0 

40 CAACCATTAA TGAAAGTGAA TTTAGCATAC TTGAAAGCAC ACGCACCAAG CATCGTTACA 55 8 0 

CCAATGATTA TTACAAATCT TGAAAATAAA GAACTTGTCA TTGAAGATGT ACAAGATGCT 564 C 

GATCCAGGTA AGCTAATTAT GACAGTCAAA TAATGATTAA AAATGAAACA GCATATCAAA 5^00 

45 TGAATGAACT TTTAGTCATT CGTAGTGCGT ATGCGAAGTA GCGAGTTGAA AGAGAATACG 5 7 60 

TTACAAAAGG CAGTAGCTTA AAATGAAGCT ACTGCCTTTT TAGTGCGCAA TGATGTATAG 53 2 0 

CAGGTGTGTT GATGrTAATA AGTTAAATAT TAGTGTTAGA TATAGAAAAC ATTGCTTATG 58 80 

50 TTTTTGTCAC ATTTTAGAAA AATGCATCTT CGCGACTAGC CAAATTAATA GTCTCATTGA 5 94 0 
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CCATATGTGG ATTACTTACC TAAGAAAAAT ATTAAAGCCA TTCAAATTGA CACAAATCCT 24 60 

AAAAATATCG GACATCGTTT CAATATTAAT GTAGGAATTG TTGGAGATAG TAAAATTGCG 2520 

5 TTGCATCAGT TAACTGAAAA TATTAAACAT GTTGCTGAAA GACCATTCTT AAACAAAACG 2580 

TTAGAACGTA AAGCGGTTTG GGATAAATGG ATGGAACAAG ATAAAAATAA TAATAGTAAA 2 640 

CCATTACGTC CAGAACGATT AATGGCATCA ATCAATAAAT TTATTAAAGA TGATGCAGTG 2700 

10 ATTTCAGCAG ATGTAGGTAC AGCAACAGTT TGGTCAACTC GATACTTAAA C CTTGGTGT A 2760 

AATAACAAGT TCATCATTTC AAGTTGGTTA GGTACAATGG GTTGCGGTCT TCCAGGTGCA 282 0 

ATTGCATCAA AAATTGCATA TCCAAATAGA CAAGCCATCG CAATTGCTGG TGACGGTGCA 2880 

TTCCAAATGG TAATGCAAGA CTTCGCTACA GCAGTACAAT ATGATTTACC TTTAACTGTA 2940 

TTTGTACTTA ATAACAAACA GTTAGCATTT ATTAAATATG AACAACAAGC AGCTGGTGAA 3000 

TTAGAATATG CAGTTGATTT TTCTGATATG GATCATGCAA AATTTGCTGA GGCAGCAGGT 3060 

GGTAAAGGTT ATACAATTAA GAGTGCTAGC GAAGTAGATG CTATAGTCGA AGAGGCATTA 3120 

GCACAAGATG TACCAACGAT TGTAGATGTA TATGTTGATC CTAATGCTGC GCCATTACCA 3180 

GGTAAAATTG TAAATGAAGA AGCGCTTGGT TATGGTAAGT GGGCATTTAG ATCAATTACT 3240 

GAAGATAAAC ATTTAGATTT AGATCAAATT CCACCAATTT CAGTGGCAGC AAAACGTTTC 3300 

TTATAACTGA TTTAAAGGTT ATCACAATTG AATTGAACTA TAAAAACGGT AATTTCTATT 3 360 

TCAACAAAAT GGGAATTGCC GTTTTGTTTA TTTATCACAA ATGATCGTAC TGAATTGATG 3420 

ATAAAATTGT GAAAAAGTTG TTGAAAACGC TTTTACAAAT ATGTATAATA GCTATGAATT 3480 

AGATATCACT TGCGTGTTAC TGGTAATGCA GGCATGAGCA AACAACCGCA CTATGAGAAT 3 540 

AGTCTTGTTT GTTCATGCCT GCTTTTTTTG TACATGGAAG CGGAAATTGA GATAGGGGAT 3 6 00 

GTTTATATGT TTAAGAAATT GTTTGGACAA TTGCAACGTA TCGGTAAAGC ATTAATGTTA 3660 

CCTGTTGCGA TTTTACCAGC AGCTGGTATT TTATTAGCGT TTGGTAACGC AATGCACAAC 3720 

GAACAATTAG TAGAAATTGC ACCATGGTTA AAAAACGATA TCATTGTAAT GATTTCGTCG 37 80 

GTCATGGAAG CAGCAGGACA AGTTGTATTT GATAACTTGC CATTATTATT TGCAGTTGGT 3 84 0 

ACAGCACTTG GATTAGCAGG AGGAGACGGT GTTGCAGCAT TAGCAGCGCT AGTAGGTTAC 3 900 

45 TTAATTATGA ATGCAACAAT GGGGAAAGTG TTGCACATTA CAATTGATGA CATTTTCTCA 3 96 0 

TATGCCAAAG GGGCAAAAGA ATTAAGTCAA GCAGCGAAAG AACCAGCACA TGCTTTAGTA 4 020 

TTAGGTATTC CAACGTTACA AACGGGTGTG TTTGGTGGTA TTATCATGGG TGCTTTAGCC 4 080 

GCATGGTGTT ACAACAAATT TTATAATATT ACACTACCAC CATTTTTAGG ATTCTTTGCA 414 0 
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ATTGTACCGC TAACTTGGCT AGAAGACGGT GCAAACTTTT TATTAAAGAC GATGGTCTTT 66 0 

TTCTTCATAC CGTCAGTTGT AGGtATTATG GaTGtgCTTC CGAAATTACG CTAAATTATA 72 0 

TACTCTTTTT CGCAGTCATT ATCATAGGAA CATGTATCGT TGCATTATCT TCAGGTTATA 78 0 

TTGCTGAAAA AATGTCyGtT AAACwTAAAC ATCGTAAAGG TGTAGACGCt TATGAATGAT 84 0 

TACGTGCAAG CCTTATTAAT GATTTTGTTG ACTGTCGTTT TATATTATTT CGCTAAAAGG 90 0 

TTACAACAAA AATATCCGAA CCCATTTTTG AATCCAGCAT TAATTGCATC TTTAGGAATT 96 0 

ATTTTTGTCT TACTTATCTT TGGAATTAGT TATAACGGGT ATATGAAAGG TGGCAGTTGG 102 0 

ATCAACCATA TTTTAAACGC AACGGTCGTA TGTTTAGCGT ACCCACTTTA TAAAAATAGA 10 80 

GAGAAAATTA AAGACAATGT CTCTATCATT TTTGCAAGTG TATTAAcTGG CGTCATGCTG 114 0 

AATTTCATGT TAGTGTTCTT AACACTTAAA GCATTTGGCT ATTCTAAAGA CGTCATTGTA 120 0 

ACGTTATTGC CCCGATCTAT AACAGCCGCA GTAGGTATCG AAGTGTCACA TGAACTAGGT 126 0 

GGTACAUATA CGATGACCGT ACTTTTTATT ATCACAACGG GTTTAATCGG TAGTATTTTA 1320 

GGTTCGATGT TATTAAGATT TGGAAGATTT GAATCTTCTA TCGCCAAAGG ATTAACGTAT 1380 

GGGAATGCGT CACATGCATT TGGCACAGCT AAAGCACTAG AAATGGATAT TGAATCCGGT 144 0 

GCATTTAGTT CAATTGGGAT GATTTTAACT GCAGTTATTA GTTCAGTGTT AATACCTGTT 150 0 

CTAATTTTAT TATTCTATTA ATTTAGATAT TTAAAATGAT AGACAGAAAG GGAGGCTATT 156 0 

AGTAATAATG GCAAAAATAA AAGCAAATGA AGCATTAGTT AAAG CATTAC AAGCaTGGGA 162 0 

TATAGATCAC TTGTATGGTA TTCCAGGAGA CTCAATCGAC GCATAGTCGA TAgTTTACGT 16 8 0 

ACAGTGAGAG ATCAATTTAA ATTTTATCAT GTACGTCATG AAGAAGTAGC AAGCTTAGCG 174 0 

35 GCTGCTGGTT ACACAAAATT AACTGGTAAA ATCGGTGTGG CATTAAGTAT CGGTGGCCCT 1800 

GGTTCAATTC ATTTATTAAA TGGTATGTAT GATGCCAAAA TGGATAATGT ACCGCAATTA 1860 

ATATTATCTG GACAAACGAA TAGTACAGCA CTTGGAACGA AAGCATTCCA AGAAACAAAT 1920 

40 TTACAAAAAT TATGTGAAGA TGTAGCCGTT TATAATCACC AAATTGAAAA AGGTGACAAT 19 80 

GTGTTTGAAA TCGTTAACGA AGCAATTCGT ACGGCATATG AACAAAAAGG TGTAGCTGTT 2 04 0 

GTTATTTGTC CTAACGACTT ATTAACTGAA AAAATTAAAG ATACAACGAA TAAACCAGTA 2100 

GATACATCAA GACCAACAGT AGTATCACCA AAATATAAAG ACATCAAAAA AGCGGTTAAA 216 0 

CTAATTAATA AAAGTAAAAA GCCTGTCATG TTAATTGGTG TAGGTGCGAA ACATGCGAAA 2 2 20 

GATGAGCTAC GTGAATTTAT TGAAATGGCT AAAATTCCTG TCATTCATTC ATT AC CAG CT 2 2 80 

AAAACAATCT TGCCGGATGA TCATCCATAT AGTATCGGtA ACTTAGGTAA AATCGGTACC 2 34 0 
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(XX ) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 



ATAATTATTA 


ACATGGTGTG 


TTTAGAAGTT 


ATCCACGGCT GTTATTTTTG TGTATAACTT 


60 


AAAAATTTAA 


GAAAGATGGA 


GTAAATTTAT 


GTCGGAAAAA GAAATTTGGG AAAAAGTGCT 


120 


TGAAATTGCT 


CAAGAAAAAT 


TATCAGCTGT AAGTTACTCA ACTTTCCTAA AAGATACTGA 


180 


GCTTTACACG 


ATTAAAGATG 


GTGAAGCTAT 


CGTATTATCG AGTATTCCTT TTAATGCAAA 


240 


TTGGTTAAAT 


CAACAATATG 


CTGAAATTAT 


CCAAGCAATC TTATTTGATG TTGTAGGCTA 


300 


TGAAGTTAAA 


CCTCACTTTA 


TTACTACTGA AGAATTAGCA AATTATAGTA ATAATGAAAC 


360 


TGCTACTCCA 


AAAGAAACAA 


CAAAACCTTC 


TACTGAAACA ACTGAGGATA ATCATGTGCT 


420 






CCCATAACAC 


ATTTGACACT TTTGTAATCG GACCCGGTAA 


480 


CCGCTTTCCA 


CATGCAGCGA 


GTTTAGCTGT 


GGCCGAAGCA CCAGCCAAAG CGTACAATCC 


540 


mTTATTTATC 


TATGGAGGTG 


TTGGtTTAGG 


aAAAACCCAT TTAATGCATG CCATTGGTCA 


600 


TCATGTTTTA 


GATAATAATC 


CAGATGCCAA 


AGTGATTTAC ACATCAAGTG AAAAATTCAC 


660 


AAATGAATTT 


ATTAAATCAA 


TTCGTGATAA 


nA 


692 



(2) INFORMATION FOR SEQ ID NO: 13 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 

ATACTGTAGC GCAAATTTCA CAATGGCATG TTATAGAAGA TTTAGTTACG AATGAATTAG 6 0 

GTATTAGTAT TTTACCAACA TCAATTTCAG AgCAACTAAA TGGAGATGTG AAGCTGtACG 120 

CATTGAAGAT GCTCATGTAC ATTGGGAATT AGGTGTTGTT TGGAAGAAGG ATAAACAATT 180 

AAGTCATGCC ACAACGAAAT GGATAGAATT TTTGAAAGAC CGTTTAGGCT AACATATTAA 24 0 

TAAAGCACTC ATTATTTAAG GCGCATCATT ACGTGGGTCA TTGAAATAAT GAGTGTTTTT 300 

TTGTGAAAAT GAAGTGAAAT TT AG AG AG CG TTTCCATAGA AAATAGTAAT ACAAACTATA 36 0 

AAAAAAGAGT ATTTTTATAT TGTGTACGCC ATCTTTATAA TAGTTATTGT AACAATTTAG 420 

ACATATTTAG AAAGGGATGG CGCCATGCAC AAAGTCCAAT TAATAATCAA ACTACTACTA 480 

CAACTAGGAA TCATCATTGT GATTACTTAT ATTGGCACAG AAATTCAAAA GATTTTTCAT 54 0 
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GCAAGCTCTC 


TAATACGCTG 


CTGTAATGTT 


TCATAGTTGT 


ATACAGTTGT 


CTCTGTGAAG 


10380 


ATTTCTCCAT 


CTGCTTTAAA 


ACGAAtGaCA 


GTACCTGTCT 


TAtCAGTnGT 


GCCAACTTCT 


10440 


TTTAAGTCAA 


ATTGAGGTAC 


ACCTTTTTTA 


TATGCTTGAT 


GATATATAGT 


CTCATTTCTG 


10500 


TGTACATATA 


CTTCTAAGTC 


TTGTGACAAT 


GCGTTTACAA 


CTGATGAACC 


AACACCATGT 


10560 


AAACCACCAG 


ATACTTTGTA 


TCCGCCACCG 


CCAAATTTAC 


CACCAGCATG 


TAAAACAGTT 


10620 


AAAATAACTT 


CGACAGCTGG 


ACGTCCCATT 


TTTTCTTGAA 


TATCAACTGG 


GATACCACGT 


10680 


CCGTTATCCG 


TTACTTTAAT 


CCAGTTATCT 


TTTTCAATAA 


CAACTTCAAT 


TTGATTTGCA 


10740 


TAACCaGCTA 


ATGCTTCATC 


GATACTATTA 


TCGACAATTT 


CCCACACTAA 


ATGGTGCAAA 


10800 


CCTCTCTCTG 


AAGTCGATCC 


TATATACATA 


CCTGGTCTTT 


TACGTACTGC 


TTCTAAACCT 


10860 


TCTAATACTT 


GTATTTGCCC 


AGCACCATAA 


TTATCCGTGT 


TGTTTACATC 


TGACAATGCA 


10920 


GTCACCATCG 


CTTTCTGTTA 


CTTTATAATT 


TCACCTTGAT 


TAATACGATA 


CAATTTAGCG 


10980 


TTATTCATGA 


TTTCATGATC 


AATACCATCT 


ACAGATGTCG 


TAGTGACAAA 


TGTTTGTACT 


11040 


TTATGCTGAA 


TCGTACTTAA 


TAAATGCGTT 


TGACGCGAAT 


CATCTAATTC 


ACTGAGTACA 


11100 


TCGTCTAATA ATAAGATGGG ATATTCCCCA ACTTCGATAT 


TCATTAACTC 


AATTTCAGCT 


11160 


AATTTAATGG 


ACAAAG CCGT 


TGTACGTTGC 


TGTCCTTGAG 


AACCATATGT 


TTGAGCATCC 


11220 


ATGCCATTCA 


CATCAAAACT 


TATATCATCT 


CGATGTGGTC 


CGAATAAGCT 


AATGCCTCGT 


11280 


TCTTTTTCTC 


TTTGCATATT 


ATCGCTAAGA 


ATAGACATAA 


TTTCTTCAAG 


TCGTGCCGCT 


11340 


TCATTTTGAG 


CATAATCAAA 


TTTAAGACTA 


GGTAAATAAT 


TCAGCGACAA 


CGCTTCTTTA 


11400 


TCATTTGTGA 


TACCAGCATG 


AATCGGTTTA 


GCTAACGACT 


CTAGCTCTTG 


AATAAAATGT 


11460 


GCACGTTTAT 


CAGTTACTTT 


CATTGCATAT 


TCAGCAAACT 


GCTGATTTAA 


TACTTCCAAC 


11520 


ATTGTTAAGT 


CCTTTTTTTG 


GCCTAATTGT 


AACTGCTTTA 


AGTAATTATT 


CTTTTGCTTT 


11580 


AAAATACGTT 


GGTATTGAGC 


TAAATCATTT 


AAGTAAACAG 


CAGAAATTTG 


GCCCAACTCC 


11640 


ATATCTATAA 


AGCGTCGTCT 


TATTtGrGGr 


GAGCCTTTTA 


CAATATTCAA 


ATCTTCTGGC 


11700 


GCAAATAGAA 


CCACATTGAG 


GTGTCCAATA 


TATTGAGTTA 


GACGACTTTG 


CTCTAAGTGn 


11760 


ATTCACTTTG 


GACTTGTTTA 


CCTTTnTTAG 


TTATAAACAT 


TGTTAATGGG 


CATCGTGCCG 


11820 


TGT 












11823 



(2) INFORMATION FOR SEQ ID NO: 137: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 692 base pairs 

(B) TYPE: nucleic acid 

^TRANPEDNESS - i ^ : M 
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AAATCTATTG TATCTTTATT AATATCACGT AACAGTTCAA GTGTGATTTT AGTCATACGC 85B0 

GCTTCAGTAT AACGCATTGC TGCTGCGCCA TCTCCATCCA TTGAACCAAA GTTACCTTGG 864 0 

CCATCAACAA GCGGATAACG ATAACTGAAA TCTTGAGCCA TACGTACCAT TGCTTCATAA 870 0 

ATAGATGAGT CACCATGAGG GTGATATTTA CCCATTACGT CACCAACGAT ACGTGCTGAT 8 76 0 

TTTTTATATG ATTTATCCGG TGTCATACCT TGTTCATTTA ATCCATATAG TATACGACGA 882 0 

TGTACTGGTT TTAAACCGTC ACGAACATCT GGCAATGCAC GAGCAACGAT AACACTCATC 88 8 0 

GCATAATCTA AAAATGATTC ACGCATTTCA CTGGTAATAT TTCGTTCATT TATTCTTGAT 894 0 

TGAGGTAATT CAGCCATCAA GAGTTCCTCC TTCAAAAGTT CAGTTCACAG CGCTTAGAAG 9000 

TCTAAGTTTG CATAAACTGC ATTATCTTCT ATAAATTGTC TACGGTTTTC TACAACGTCA 9060 

CCCATTAACA TTTCAAATGT TTGGTCCGCT TCAATCGCAT CTTCAAGTTT TACTTGTAAA 9120 

AGAGCGCGGT GCTCAGGGTT CATTGTTGTT TCCCAtAATT GATCTGCATT CATTTCTCCA 9180 

20 

AGACCTTTGT ATCGTGCAAT AGACCATTTT GGTGTTGGAT TCAATTCAGA TTTAAGTTTA 9240 

TCAAGTTCCC TATCATTGTA TACATAATAC TTTTGTTTAC CTTGTGTCAG TTTATACAAC 93 00 

25 GGTGGCTGTG CAATATACAC ATAGCCTGCT TCAATTAACG GTCTCATAAA TCGATAGAAG 9360 

AATGTTAATA ACAATGTTCT AATATGCGCT CCATCCACAT CGGCATCAGT CATAATGACG 9420 

ATTTTGTGAT ATCTTGCTTT CGCTAGATCA AAGTCGCCAC CGATTCCTGT ACCAAATGCT 94 80 

30 GTGATCATTT GACGAATTTC ATTGTTATTC AAAATTCTAT CTAATCGTGC TTTTTCAACA 9540 

TTTAATATCT TACCTCGTAA TGGTAAAATC GCCTGCGTTC TAGAGTCACG ACCAGATTTT 96 00 

GTAGACCCCC CGGCAGAGTC CCCTTCGACT AAGAAAATCT CACATTCTTC AGGACTTTTA 96 6 0 

35 CTAGAGCAAT CGGCTAATTT ACCTGGAAGG CTTGCTACAT CTAACGCTGA TTTACGACGT 9720 

GTTACTTCAC GCGCTTTTTT CGCAGCAACA CGTGCACGTG CCGCCATAAT ACCTTTTTCA 9780 

ACCACTGTAC GTGCGACTTG TGGATTTTCA TATAAAAATC GTTCAAAGTG CTCTGAGAAT 984 0 

AATTTATCTA CAACTTGACG CACTTCAGAA TTACCTAATT TTGTCTTCGT TTGACCTTCG 9900 

AATTGAGGAT CACCATGTTT GATAGATATA ATTGCTGTCA TACCTTCACG TGTATCTTCA 9960 

C CAG AAAGTC TATCTTTTTC TTCTTTCATA ATCTTGCTAC TTAAACCATA ACTATTTAAG 10 020 

ACACGCGTTA ATGCACGTTT GAATCCGTCT TCATGCGTAC CACCTTCATA CGTATGAATG 100 80 

TTATTTGCGT AAGTTAAAAG ATTTGTGGCA TATCCTGAGT TATATTGAAT CGCAATTTCT 1014 0 

ACTTCAATAT CATCTTTAGA TTGATGAATA TAAATTGGCT CATCATGAAT AGGTTCTTTA 10200 

TTTTCGTTCA ATAACTCAAC GTACGATTTA ATACCGCCCT CATAGTGATA GGAGTCTTCT 10260 
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GTAGCTGTTT TAATACCTTT ACCACCACGA TTTGATAAGC GATAGTCATT AACTGGCGTA 6 78 0 

CGTTTACCAT AACCATTTTC AGTAACTACT AATACTTCAT CAACACTGTT TGCATGAGCT 6 84 0 

5 

ACATCAAGCC CTACAACTTC GTCACCTTCA CGAAGTGTAA TACCTTTCAC ACCCGTTGCT 6 900 

GTACGGCCTA AAGGACGTAA TGTTGATTCA GGGAATCGAA TTAATGATGC ATGTGATGTA 6 96 0 

CCAATCAAGA TATCTTCTTG ACCACTTGTT AAGCGAACTG CAATTAACTC ATCAT CTTCT 702 0 

70 

CTGAACGAAA TCGCAATCTT ACCATTTCTA TTTATTCTTG AGAAGTTACT TAATGCTGAA 7 0 80 

CGTTTAACGA CACCACGTTT AGTTGCAAAC ACTAAGAAGT TGTCTTCACT TTCAAGGTCT 714 0 

^ TTAACAGCAA TCATTGTACT AATGACTTCA TCATTTTCAA GTTCAATAGC ATTCACTACA 72 00 

GGAATACCTT TAGACTGTCT TGATAACTCA GG CACTTCGT AACCTTTAAG TTTGTATACA 72 60 

CGACCTTTGT TAGTAAAGAA CAATACATGG TCATGTGTAC TTAAAGTTAC CAATTGACTG 7320 

20 ACAAAATCTT CTTCCAATGT ATT CAT AC CT TGAACACCAC GACCACCACG GTTTTG AG CA 73 80 

CGATATGTAG ATACCGGCAA ACGTTTAATG TAGTTATTAT GGCTTAGTGT AATTACTATT 7440 

TGTTCTTCTG GAATTAAGTC TTCGTCCTCT AAGTCTTCAA ATCCACCTAA TTGAATTTCT 7500 

25 GTACGACGAT CATCACCGAA ACGATCTCTA ATTTCAGTCA ATTCATCTCT AACTAACTGT 756 0 

AATAACACTT CTTCATCAGC TAAGATTGCT TCTAATTCAC TAATATAATT TAATAACTCA 76 2 0 

TTATATTCAG CTTCAATTTT GTCTCTCTCT AAACCTGTTA GACGTCTTAA ACGCATGTCT 76 8 0 

30 AAAATAGCTT GAGCTTGTTT TTCAGAAAGT TTGAAGCGTT GTTGCAAGCT TTCCATTGCA 774 0 

ACTTTATCTG TATCTGACTC ACGAATCGTT GAAATAATTT CATCGATATG GTCAAGTGCG 78 0 0 

ATACGTAATC CTTCTAAAAT GTGGGCACGA TCTTTAGCTT TACGTAAgTT GTATTGCGTA 7 86 0 

35 CGTCTTCTAA CAACTGTCTT TTGATGCTCT AAATAATGTA CCAACGCTTC TTTTAAATTA 7 92 0 

ATAAGCTTCG GTCTACCATT TACAAGTGCA ATCATATTCA CACCAAATGA TGTTTGAAGA 7 98 0 

GGTGTTTGTT TGTATAAGTT ATTTAAAATG ACACTAGCAT TTGCATCCTT ACGCACATCA 8 04 0 

40 

ATAACGACAC GCACACCAGT ACGTAAACTT GTTTC^TCAC GTAAATCAGT GATACCGTCA 8100 

ATTTTCTTGT CACGAACGAG CTCTGCAATT TTTTCAATCA TACGAGCCTT ATTCACTTGG 816 0 

AAA GG AATTT CAGTGACAAC AATACGTTGA CGTCCGCCTC CACGTTCTTC AATAACTGCA 8 22 0 

45 

CGAGAACGCA TTTGAATTGA ACCACGACCT GTTTCATATG CACGTCTAAT ACCACTCTTA 82 80 

CCTAAAATAA GTCCAGCAGT TGGGAAATCA GGACCTTCAA TATCCTCCAT TAACTCAGCA 8 34 0 

5o ATTGAAATAT CAGGGTTCTT ACTTAAGCTA AGTACACCAT TGATTAATTC TGTTAAGTTA 84 0 0 

TGTGGTGGAA TATTCGTTGC CATACCTACC GCGATACCTG ATGCACCATT GGCTAATAAG 84 6 0 
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TCCTCATTTT CAATAATACG TTCAACTACC GCTCTACTTT TTTTGACACG TTCTAACGCA 4 98 0 

TCATCAATAA TTTCAATCTT TGATTGTTGT TGTAAAAATG ATTTAATATC CTCAATTGTT 504 0 

5 AGTGTTTCAC CATCTAAATA TAAAGTCATA TATGTTACCC CCTTGTTTAT ATTAAGTAAC 5100 

CCATCCTTCT TGAAGTATAC GTTTTCATTT TTATTGAAAC AATGGTTTTA CGTACATTTA 5160 

TAACCTATTA TCAGAGCACT ATTGTAGTGC GTTAAAGGAT ATTAAGATTG TTGTAAGCAT 5220 

10 

ATTTAATAAT TTATCTATTG ACGAATTGCA TATACAGGTA TAGTATTTTC TATTGTATTT 5280 

AACGACAAAT AATAATGAAT TCAGAAATTT ATAATACATT TTGTTAAAAG TTACTATATA 5340 

TTTTTAAAAT TGAATAAATT CGGAAAAGGC TTTTACATGG GAGGTTATAT CACTATGGAA 5400 

15 

ACGTTAAATT CTATTAACAT TCCTAAGCGT AAAGAAGATT CACATAAAGG TGATTATGGC 5460 

AAAATTTTAT TAATTGGTGG ATCTGCTAAC TTAGGTGGTG CCATTATGTT AGCGGCTCGT 5520 

GCATGTGTAT TTAGCGGTAG TGGTTTAATC ACTGTAGCTA CACATCCAAC AAATCATTCA 5580 

20 

GCATTACATT CTCGTTGCCC AGAAGCGATG GTTATTGATA TTAATGATAC GAAAATGTTG 5640 

ACGAAAATGA TTGAAATGAC TGACAGTATA CTAATTGGTC CAGGTCTTGG CGTTGATTTC 5700 

25 AAAGGAAATA ATGCCATTAC ATTCCTACTA CAAAATATAC AACCGCATCA AAATTTAATC 5760 

GTAGACGGCG ATGCGATTAC AATCTTTAGT AAACTGAAAC CGCAATTACC TACATGTCGT 5 820 

GTGATCTTTA CACCACACCT CAAAGAATGG GAACGATTAA GTGGTATTCC TATTGAGGAA 58 80 

30 CAGACATATG AGCGTAATCG TGAAGCAGTT GATCGTTTAG GTGCAACTGT TGTACTTAAA 5 940 

AAACATGGTA CTGAAATTTT CTTTAAAGAT GAAGACTTTA AATTGACAAT CGGTAGCCCA 6000 

GCAATGGCGA CTGGTGGTAT GGGCGATACA CTTGCTGGTA TGATTACAAG CTTTGTCGGT 6060 

35 CAATTTGATA ACTTAAAAGA AGCGGTTATG AGTGCCACAT ATACACATAG TTTTATTGGC 612 0 

GAAASCCTTG CAAAAGATAT GTATGTGGTG CCACCATCAA GACTTATCAA TGAAATACCT 6180 

TACGCAATGA AACAATTAGA AAGTTAGTCA TTACTAATCA TTGAATATAG TAAAGCATTA 624 0 

40 CTTTCTAGCA TAAAAATAAG ACTCCCCTAC ATATAGGGAA GTCTTATTTT TTATTATTCT 63 00 

TCATCTGATG ATTGTTGTAT ATCTTCTTCA ACACGATCCA TGAAATCTTG TCTTACTTCA 6360 

ATACGTCCAT CTTCATCATT TTCTTCTGAA TCAATCACTT CAGTATGAAT TGCATTTCCT 6420 

45 

GGTGTTTCAT CATTTaCAAC CGCTTCACGT TGTTGTTCAG TACCATCTTC AGATACAGTT 64 8 0 

GAAGTAGATT GCTCATCTTC ATTCGTTTCA TCTTCTGCAT CTTCTTTTAC TTTAGCAACC 654 0 

GTTGAAACAA ATTGATCATC ACCTAAGCGA ATTAAGCGAA CACCTTGTGC TGCACGACCA 6600 

50 

TTTTGAGAAA TATCTGCAAC ATCTAGTCGA ATAATGACAC CTGCATTAGT AACAATCATT 6660 
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TCAATTTTGC TCTTAACTGT GTCAGGCTCA TTTCTGAATA ATCTAATGTC TAACATTAAC 31B0 

CTTCATCCTT TCCCAAATAA TTATCATTTA TTATGGAATG ACGTACGTCT TTATTTTTTA 324 0 

GAAAATAAAA AAAGACCACA TCCCTACAAG GGACGTGGTC TACGCGTTGC CACCCTATTT 33 00 

AACAATTTAA GTTATAAAGA TACACTAAAC CTAAATTGCA CTTCACTAAA ATAACGGTTA 33 6 0 

TCACCGATTG TTCTTTTAAA TTAAGTAGGT AGATTCATAT ATATGTTGAT TCTTGTTCAC 34 20 

ACTAACCACA AGCTCTCTGA TATCGAACAC TATATATTAC TTGTCCTACG AACAATGTCT 34 8 0 

TATTAAGTTA TTTTTAATAT AGCAAACTAT ATTTGCTTTT TCAAGTAACG ATTTCAAACA 3 54 0 

TCACTCATGT CGATTTAGTG ACATGCAGTC GTTTGATAAA TTGATTGCTT TAAATACTGT 3600 

GCAACCGCTT CAATATCTTT ATGAAATTGA CGATCATGTG TAATGGATGG CACGATACTT 3 660 

CGAAACTCAT CATACTTGCG ACGTGTTTTT GGTGATAATC CTTCAACACC TTTTAACTCT 3 72 0 

GCTGCTTGTA ATGCAATAAC ACATTCGATT GCCAGCACAC GTCTTGCATT TTCAATAATT 3 7 80 

TGATAACCAT GTCTAGCAGC TGTAGTTCCC ATAGATACGT GATCTTCTTG GTTCGCAGAT 3 84 0 

GAAGTGATAG AATCAACACT CGCTGGATGC GCTAAAGTTT TATTTTCAGA AACGAGACTT 3900 

25 GCAGCAGCAT ATTGCATAAT CATCGCGCCA CTTTGCAATC CTGGCTCTGG ACTAAGAAAT 3960 

GCTGGTAAAT CACCATTTAA TTGAGGATTT ACTAGTCGCT CTAGACGACG TTCCGATACG 4 02 0 

TTTGCTAATT CACTTACACC TAATTTAAGA TGATCTAATG CAAAAGCAAT AGGTTGTCCA 4 08 0 

30 TGGAAGTTAC CACCTGAAAT AACAAACGTT TCATTTGCTT CCTCAAATAT AAGTGGATTA 414 0 

TCATTAGCCG CATTCATTTC AAATTCTAAT TGCTGTTTAA CATAATTGAA TACTTGAAAA 4 2 00 

CTCGCGCCAT GGATTTGTGG TATACAACGC AACGTATATG CATCTTGTAC ACGTATTTCT 4 260 

GATTGTCGCG TCGTTAATGT TGATCCTTCT AACCAATCAC GCATACGCGC TGCCACATTA 4 320 

ATCTGTTCTT GAAAATTACG AACTGCGTGC ACATCATGTC GATATGCATC TATAATGCCA 43 80 

TTAAGAGACT GATGCGTTAA TGCAGCAATC CATTCAGATT GGTAACCTAA ATCTTCTGCT 44 4 0 

TCTATATAAC TAATGACACC TTGAGCTGTC ATAGCTTGCG TACCATTAAT CAATGCTAAA 4 500 

CCTTCTTTAG CCTGAAGGTT CAAAGGTTGT CTATTTAATT CTCTTAATAC ATCGTCACTA 4 56 0 

TCCTTTTCTT CCCCTCTGTA CAATACTTTC CCTTCACCAA TTAATG CTAA TGCTAAATGT 4 62 0 

GATAATGGCG CTAAATCTCC TGATGCACCG AGAGAGCCTT GCTGTGGGAT TATCGGTATA 4 68 0 

ATACGTTCAT TTATAAAAAA TTGTAATTGT CTCACTAATT CTAAAGTGGC ACCTGAATGA 4 74 0 

CCTTTTAATA ATGTATTCAA TCGTAAAATC ATCATGACTA ATGCTACTTC TTTTGAAAAT 4 800 

GGCTCACCTA GTCCACAGGC ATGTGAGCGT ATCAGATTCA CTTGTAATTC ATTATATTGC 4 86 0 
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TAAGCAATAA CATTAGACAT CAGTTTGTCT GAGGTTAGAC ATTCCGGAGT CTTTAGTCAG 13 80 

CTTCATATTA ACTTTTTATT TTTGAGAATT TTCAATTTTT TATTTAAGAC TACCTCCATA 144 0 

TTTTCTATGG aTTTGTAGTT GTTTTTAAGT ATCAATTTTA TAAATTTTTA TATCTGATGA 1500 

TGAGTCTGGG aTATTGaTTC ATGTACCACT CCCTTaTaAT CATCCCCTCC CCCTaCCCTA 1560 

CTCCATCGAT ATAACTCATA CTACATATCA ACGAAATCAG TATTTTATCG CTTCCTTTCC 1620 

TATATTAGTG ATGCTCAAAC TTGTTACGTT TTAGATTGTT TTAGTTCATC ATAATTATCC 16 80 

CGTATTGTTG CTATAATGAA ATGCGTTCAC CCCATTAAAC CACAAACTTA ATTTATTGTT 174 0 

GTTATGTGCA TTGGCTCACT ATTATATTTT TACAGCACAA AAAAAGTGGC GACAGTTCGT 1800 

CACCACTTTT TAAAATATTA TTTAAAGTAT CTTGCCCTTG CTTTAAGTAT ACGTAGATAT 1860 

ATACTTTTTA AAGCTTGTAG CTAAAGCCTT TATTTAACTG GTTTTGAAAT TTGTGTTTTA 1920 

CCACCCATAA ATGGTACTAA TGCTTCTGGA ATTGTTACTG TTCCATCTTC ATTTTGGTAA 1980 

TTTTCAACAA TAGCAGCAAA TGTACGTCCA ACTGCTAAAC CACTACCATT TAATGTATGT 204 0 

GCTAATTCTG GTTTAGCTGC TTTGTCACGC TTGAAGCGGA TGTTAGCACG ACGCGCTTGG 2100 

AAATCCGTAC AGTTTGAGCA TGAACTAATT TCTTTATAAT CATTGTAGCT TGGTAACCAA 2160 

ACTTCTAAAT CATATGTTTT GCTTGCACTA AATCCAATAT CACCTGTACA TAAAATAACA 2220 

CGACGGTATG GTAAACCTAA CTCTTCTAGA ATTGCTTCTG CGTTTGTTGT CATTTCTTCT 228 0 

30 AAAGCATTCC ATGAATCTTC AGGTTGTTCA AAACGTACCA TTTCCACTTT ATCGAATTGA 234 0 

TGTAAACGAA TTAATCCTCT TGTATCTCTA CCTGCTGATC CTGCTTCACT ACGGAAACAT 24 00 

GCAGATTGAC CAGTGAATTT TTCAGGAAGT ACACCTGGTT GAATAATTTC ATTACGGTAG 2460 

AAATTCGTTA ATGGTACTTC AGCAGTTGGA ATTGTATATA ATCCTTCTTT TTCTACTTTA 2 520 

AATJtfVATCTT CTTCAAATTT AGGTAATTGA CCTGTACCAT ACATTGTATC TGCGTTCACA 2580 

AGCTGTGGTA CCATCATTTC TGTATAACCA TGTTGTGTTG TATGTTTTGT AATCATATAG 264 0 

TTCATTAAAG CACGCTCTAA TTGCGCACCT TCATTTGTTA AATATACAAA ACGCGCACCT 2700 

GAAACTTTTG CTGCACGATC AAAATCAGCC ATTTTCAATT CTTCTACAAT ATCCCAATGT 2760 

GCTTTGGGTT CAAATGAAAA CTCaCGTGGT GTACCCCACT TTTTAACTTC AACGTTATCT 2 820 

TCATCAGATT CACCTTGAGG TACATCATCA CTTATTAAAT TTGGAATACG ACAAAGGATA 2880 

CCTGTCATTT TATTATCAAT TTCATTTAAT TGACTATCTT TTTCTTTAAT ATCGTCACCT 2 940 

AATGTGCGCA TTTCAGCAAT CACATCATCA GCATTTTCTT TATTACGTTT TTTTAATGCG 3 000 

ATTTCTTCGC TTACTTTATT ACGACGTGCT TTCATTTCTT CTGTTGCACT AATTAATTTA 3 06 0 
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AGATTTCTTG AAGGCAGGTA TACCATTGAC AATTGTAGGG aATAtctAgT GaTAGTTTTT 1500 

AGCATGACTT ATTGGAAATG GGTAAGGTTG CnTTAATTAA 154 0 
5 (2) INFORMATION FOR SEQ ID NO: 136; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11823 base pairs 

(B) TYPE : nucleic acid 

10 (C) STRANDEDNESS : double 

CD) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 

15 

ACTTCTCACA ATAAGAAATA TGAAATTGTT ATGTGTTAGT TGAGATTCAG TGATGAATTA 60 

CTTTTATCAT TTAAAATGTT GTTATCATTG TCATGCGTTA CCAAATCGCT TACGTATACA 120 

2Q CGATTCCCAA TCTTAACATA GACGATTTGT ATATCAGAAT TTT CTGATTA CTAACAGTTT 18 0 

ACCTAAGTTT AAATATCTGT TCAATGATTT TCAGTTATTT TTAAAAGAAA AATCGTAATG 240 

CTGCCATGAT AACAATCCCA CTAATAATTG TAATAGTTAA AtACGCGTGA TTATAGATAA 3 00 

2$ AATAACCGTC GGAATGAGCG CGATAATGTA AGGGATGTTT AATGTATACC CCTCACCATG 3 60 

AGGCGTCTGT TGAATAATGC TGTCAATGAC AAGTGCCGTA AATAGTGTGA TTGGGATAAA 4 20 

TGATAGCCAT CGAACCACGA CATCAGGCAA TTGCACTTTT GAAATCATGA TAAAAGGTAT 4 B0 

30 AATTCGAATT AATAGCGTTA CGATACCACA CAATAAAATA AGTATTAACA TGTTCATATG 54 0 

AGTTATCATT GTTCCATCAT CACTCCTAAC GCTGCTGAAA TTGTGGCTGC AATTAATATT 600 

GCTAGATATG AAGGCATAAA CATACTTAGC GATAACATCA TTACTATGAC GGCAATAATG 660 

35 AGTACTATGT AAATTCTTAA TCGCGATTTA GTAATTGATT CAAATTGCGC AATGGCCAAA 72 0 

AAGATAAACA TAGCCGTGAT AGCAAAATCT AACCCTAGCG TTTGCGGATT TGAGATATAT 780 

TCGCCAAATA AAGCCCCAGC TACACATGAA ATTGCCCAAA ATAAATATGC TGTGATGTTA 84 0 

40 

AGACCATGCA TCCAACGATC ATTGATAGCT TCTCCTTTTA AATAAGGTGT AATGGCGACG 9 00 

CCAAACGTTT CGTCAGTTAC TAATGAACCT AATCCAACAC GGTTCCAAAA CCCATATGTC 960 

TTGAAGTTTG GTGCAAGCGA CATACTTAAA AGGAACATTC TTGAATTTAC GATAAATACA 102 0 

GTTAGTACAA TCGCTGATAT AGGTGTACCT GCTATAAACA ACGCGCACAT AATAAATTGC 10 8 0 

GCAgcaCCGG CATATATAAC AAGACATAAC AAGACAATTT CTAAAATACT AAAGTTTTGA 114 0 

so GACGAAGCCA CAATACCAAA TGAAATACCA ACACCGGCAT AACCCAATAA TGTTGGGATA 120 0 

CACTCTTGCA CGCCTTGTCT AAAACTTAAA TGTGTTGTCA TCTCAATTAC CTCCTTTGCC 12 60 
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[2) INFORMATION FOR SEQ ID NO: 135: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1540 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



w 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135: 

TGTAGTTGAA CATGAACAAC AAAAGAAAGA AAAGACAAAA AAGCAATACA AG CCATTTTG 60 

GATTGTCATG AGTTTTATAA TACTTATAGT TGTACTATTA CTCCCGGCAC CTTCAAGTCT 120 

GCCGATAATG GCTAAGGCAG TACTAGCTAT TTwAGCTTTT GCAGTTATTA TGTGGGTAAC 180 

GGAAGCTGTA TCATATCCGG TGTCAGCAAC TTTAATTATT GGCTTAATGA TATTACTTTT 240 

AGGATTTAGC CCTGTTCAAA ATTTAGGGGA GAAGCTAGGT AATCCGAAAA GTGGCAGTGC 3 00 

TATTTTAGCT GGAAGTGACC TTCTAGGAAC TAATCATGCA TTATCATTAG CGTTTAGTGG 360 

ATTTGCAACT TCAGCTGTAG CTCTCGTTGC AGCTGCATTA TTTTTGGCTG CTGCTATGCA 420 

25 AGAAACGAAT TTGCATAAAA GACTAGCTCT TTTAGTGTTA TCAATTGTTG GTAATAAAAC 480 

TAGAAATATA GTTATTGGAG CAATTATCGT TTCAATTGTA CTTGCATTTT TCGTTCCTTC 54 0 

TGCAACAGCT AGAGCAGGGG CAGTTGTACC AATCTTGCTG GGTATGATTG CGGCATTTAA 600 

AGTTTCCAAA GATAGCAAGT TAGCGTCTTT ATTAATAATT ACTTCAGTAC AAGCTGTGTC 6 60 

AATTTGGAAT ATTGGTATCA AAACGGCGGC AGCACAAAAT ATCGTAGCGA TTAATTTTAT 72 0 

AAACCATCAA TTAGGATTTG ATGTTTCATG GGGCGAGTGG TTCTTATATG CAGCGCCTTG 780 

GTCCATAGTT ATGTCCGTAG CTTTATATTT CATCATGATT AAAGTGATGC CTCCAGAAAT 84 0 

TAATSCAATA GAAGGTGGTA AAGATTTAAT AAAAGAAGAA TTGCATAAAC TTGGCCCCGT 900 

TAGCCCACGT GAATGGCGTT TAATTGTTAT ATCGATGTTA TTATTACTGT TTTGGTCAAC 960 

TGAAAAAGTA TTACATCCGA TTGACTCTGC ATCCATTACT ATTATTGCTT TAGGTGTTAT 1020 

GTTAATGCCG AAAATTGGTG TCATGACATG GAAACATGTT GAAAATAAAA TACCATGGGG 1080 

AACAATTATC GTGTTTGGTG TAGGTATTTC ACTAGGTAAC GTTCTTTTGA AAACAGGTGC 114 0 

AGCTCAATGG TTAAGTGATC AAACTTTTGG TGTTTTAGGT TTAAAACATT TACCTATTAT 12 00 

CGCGACAATT GCACTTATCA CGCTTTTTAA TATATTGATT CATTTGGGCT TTGCGAGTGC 12 60 

50 AACAAGTTTA TCATCAGCGT TAATACCTGT TTTTATTTCG CTAACCTCTA CGTTACACTT 1320 

AGGAGACCAG TCTATAGGAT TTGTTTTAAT TCAACAATTT GTTATTAGTT TTGGTTTCTT 13 80 
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TACAGTTGCA ATTTTGGTAT AACCACCTAT CGTTTGTTTA TCATTAAGCA GAATAATAGG 24 6 0 

TTGACCATCA TTTGGTACCT GAACACTACC AAGAGCAACC GGTTCAGAAA TGATATCTGC 252 0 

5 TTGATTAAtT GGTGCAACGC TGTCACCTTC CAAACGATAG CCCATACGGT CTGATTGTTC 25 8 0 

AG T AATTAAA TATGGATGAT TTACAATTTT CGCTCTAGCC TCTTCAGAAA ATGCCTCGAA 2 64 0 

TTGAGGTCCT TGAAGAATGT GTATAATATT ATTTTCTGGC AATAAATCGT CCTGTAAATG 2 70 0 

w 

AATCGTCTTT CCAATGTTTT CTTTAAAGTC ATTATTTATT TTCACTGTTA TTACATCATC 2 76 0 

AGCTAATAAC TTTCTACCTT TGAAT 2 785 

(2) INFORMATION FOR SEQ ID NO: 134: 

15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1010 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
on (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 



30 



AATGGAAACG 


GTTGAAACAG 


CAATTATTAC 


TATTTCTATG 


GGTGAAGGTA 


TTTCAGAGAT 


60 


ATTTAAATCA 


ATGGGTGCCA 


CACATATCAT 


TAGTGGTGGA 


CAAACGATGA 


ATCCTTCTAC 


120 


AGAAGATATC 


GTTAAAGTCA 


TTGAACAATC 


AAAATGTAAA 


CGTGCAATTA 


TTTTAC CGAA 


180 


TAATAAAAAT 


ATCTTAATGG 


CAAGTGAACA 


AGCAGCGAGT 


ATTGTTGATG 


CAGAAGCTGT 


240 


TGTTATTCCA 


ACGAAATCTA 


TTCCTCAAGG 


TATAAGCGCA 


CTATTC CAAT 


ATGATGTGGA 


300 


CGCAACACTT 


GAAGaAAATA 


AAGCGCAAAT 


GGCTGATTCA 


GTAAATAACG 


TTAAATCTGG 


360 


TTCATTAACG 


TACGCTGTTC 


GTGATACGAA 


AATTGATGGC 


GTTGAGATTA 


AAAAAGACGC 


420 


GTTTATGGGC 


TTGATTGAAG 


ATAAGATTGT 


AAGCAGCCAA 


AGTGATCAAT 


TAACAACGGT 


480 


TACTGAGTTG 


TTAAATGAGA 


TGTTAGCAGA 


AGATAGTGAA 


ATATTGACTG 


TGATTATTGG 


540 


TCAAGATGCA 


GAGCAAGCAG 


TTACAGATAA 


CATGATAAAC 


TGGATCGAAG 


AGCAATATCC 


600 


AGATGTAGAA 


GTGGAAGTTC 


ATGAAGGTGG 


ACAACCAATT 


TATCAATATT 


TCTTTTCAGT 


660 


AGAATAAAAA 


TTTAAAATAA 


AAAACTACCA 


ATGATAAATC 


ATCAGTTGGT 


AGTTTTTTAT 


720 


TTTGCTATTT 


TAGTGATATT 


GCGGG TTAAA 


AGTATCGTTC 


TCGAGTTGCT 


AACAATGTCA 


780 


TGTTCAACTT 


AGTCATGATA 


AAATAAATAA 


CATACTAAAT 


GATACGTAAA 


AT C AAAT AAA 


840 


ACATAGGTGA 


TTTATTTTGG 


CTAAAGTAAA 


CTTAATAGAA 


AGTCCATATT 


CTCTTTTACA 


900 


ATTAAAAGGT 


ATAGGTCCTA 


AGAAAATAGA 


AGTATTGCAA 


CAACTAAATA 


TT CAT A CAGT 


960 
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AACCGTGATA TAGTAAAGAA TCGACTCGCA 
TCACTTTACC TGGTGTTGGT TGAAATTTCT 
5 CTATCACATG ACCATTAAAT TGAATATCGC 

ATAAATACAG TTGTGCTGCA ACCAAATCAC 
CAACTTGTAT TCGAGCATTC ATTTCAATAA 

10 

CAATCGTACC TGCACTTCTA TAATTTGCTG 
TTTGTTGTCG TCTTTCTTCA GTTAATGCTG 
TTTTACGTTG TACAGAACAA TCACGTTCCC 

15 

CCaTAACTTG AACTTCAACA TGTTTTGcAA 
CATCATCAAA GTATTTTTTT CCTTCACTTT 

2Q CAGCTTTCTT TACAATACGT ATACCTTTAC 

CTGGATAACC GATGTCTTTG GCAAGATTCT 
TTGATCCTGG AATCACAGGA ACACCTGCAT 

25 CCCCCATCAT TTCCATCGTT TTTTTAGTAG 

CGGTTTGAGC AAATTTTGTT GATTCTGATA 
CACCAGTGAT TTGTGCAGCA GATATGATGC 

30 TArcwTCCCC AATACATATA GCTTGATCTG 

CTTTTGCATA AACTGCTACA GTTTCAATCC 
CAGCAATTTC ACCTCTGTTC GCAATTAAAC 

35 ATACGTACCA AAACTTGGTC GTATTCAACA 

ATrtCTCCAG CAACATCTGT TGTTACCTCG 
ATAATATCTC CCTTGTTAAC TTTGTCACCG 

40 

CTATCTTGTA AAAAGAATGT ACCTATCATT 
GAAACATCGG AGTTATCATT CGCTTTTGAA 
7TTGATTGAT TACTGTGTGC AGCCAAATGA 

45 

TCTTCAAAAT TTTTATATTT AAATTTCTTA 
TGTTCGATTT nTTCAATATT CATTTTACAA 
50 CGAAGTATGT CGCAAGCTAG ATGTATCAAA 

TTCATTAAAC AGAGACATTT GTTCCCGATT 



CATTAAAGCC TTGAGGTAAA TGTAACGCTG 66 0 

TTTCaGGATT TTCGGCATTT ATTCTCGCTT 720 

TTTGTGAAAA AGGTAAATGA TTATGTTCCA 780 

GTTCTGCTCG CATCTCTGTA ACAGTATGTT 84 0 

AGTAATGTGC GGTATCAGTT ACTAAAAATT 900 

CACGTGCAAC TTTAACAGCA TCGTTACATA 960 

CACAAGGAGA TTCTTCGATT AATTTTTGAT 102 0 

CTAAATGTAC ATAATTATCC TGCCCATCTC 1080 

CAGGTATAAA AGCCTCAACA TAAACACGAT 1140 

TAGCTTCTTT AAATGCCTTT TCTAAATCTT 1200 

CACCACCGCC ACTGGCAGCT TTGATAACAA 1260 

CAATTTCAGA CACATGATTC ACAGCACCAT 1320 

GATGAACTGT TTGTCTTGCT GTTATTTTAT 1380 

GCCCTATAAA CGCTATGCCT TGTTCCTCAA 144 0 

AAAAGCCATA TCCTGGGTGA ATTGCATTAG 150 0 

GGTCAATATT TAAATAACTA TCTAAAgCAT 1560 

CTAAATGTAC ATGCAAGCTT TGCTCGTCCC 1620 

CATATTCTCT GCAAGCTCTT ATAATCCTTA 168 0 

AACGAAGCAT TTACTTACCC CCTTTACTTA 174 0 

TTTGTGCCAT GATCAGCTAC TATTTCAGTA 1800 

TTTAATACTT TCATCGCTTC AACATATCCT 1860 

ACATTCACAA TTGGTTCAGT TAATTCTTTA 1920 

GGTGATTTAA TGTCATGATA ATCATTTGTC 1980 

GCTGTCAAAT CATTATTGTT CATACTTTGA 204 0 

TTCGAGTCAG TGAAGTCAAT TTCTATTTCA 2100 

ACATCATTTT CCTTCACTAA TTTGATTATT 2160 

ATCCCCTTTT AAAATTGTTG CTAATTTTTT 2220 

AATTGGAGTC TTTTGATGAC TCTTAAGAAT 2280 

CTTATCTACA GCTTCTTGGA ATGATATCCA 234 0 
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CAATCATTTT CGCCACAATA CCATATATAA TCATTAAAAT TGGTAAAATG GAGAATGACA 3 900 

ATTTTAATTC TGCACTGTTT AAATTCACAA TAACTAAAGA TGGGAGTGTG ACATTAAGAA 3 960 

CTAATGTAGC AATGACTTGA CTATCTGTTG CTTTTATAAA ATTAATGCGC TTCAAAAAGT 4 020 

AACCAAGCGC AATTAATAAA ATAATCATAG TAAATTGTTC TGTCACTGTT ATCCCTTCTT 40 8 0 

TCAATAATCT TCATAATTTA TAACTTTAAC ATACTCCACA GATATTTTAG AAGTCTACTG 414 0 

TTTCATGCTA TAATCTACAT TAAATGCACT TAATTATATT TCAAAGGAGT GTTATAGTAT 42 00 

GTCTTTAGAA AACCAACTAG CCGAACTTAA ATATGATTAT GTTCGTCTTC AAGGTGACAT 4260 

AGAAAAACGG GAATCTTTGA ATTTAGATAC TTCCGCACTT GTTCGTCAAC TTAAAGATAT 4 32 0 

TGAAAATGAA ATTAGAAACG TTCGTGCTCA AATGCAAGAT TAATAATCTA TCATTCAAGC 4 3 80 

AATAAATGCT TTTTGTTACA TAAATTTGAC TAGCATTGCT CTGAATACGT TATATTGATG 44 4 0 

AATTGCTTCA TTTTTCGCTC AATTACATCT AGAATCACAA GATGTTGTCG TGTTATGATT 4 5 00 

TAGTGTTTCA TTAACAACAT ACACGCATAT CTATCCCAAC ACTGCTATTT ATGTTTTCTA 456 0 

CGCTGnTGTA CTACATGAAC CCTTTGAAAC GGAGAGGAAG TTATCATATG CAATTTTAnC 4 62 0 

25 TGATTTTACT AGCAATACTT TAACnAATTG nTAGTTTAAT AGAATTTTA 4 66 9 

(2) INFORMATION FOR SEQ ID NO: 13 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2785 base pairs 
30 (B) TYPE: nucleic acid 

{C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 

TTTGCACCCA TCTGaTACAA TGCACCATGC GGTTTAACAT GATTAATTTT AACTTGATGA 6 0 

ATGCGACAAA ACCCTTGTAA TGCACCTAAT TGATAAATCA TCAAATTATA AATCTCGTCG 12 0 

TTAGAGATAT CTATATTTCG TCTGCCAAAG CCTTTCAAAT CAGGTAAACC AGGATGTGCA 180 

CCTACTGCAA CATTATGTGC TTTGGCAAGT TTTACCGTTT CATTCATTAC ATTTTCATCA 24 0 

CCAGCGTGAA AACCACAAGC AACATTCGCA CTTGTAATTA ACGGAATAAT TTGATGATCA 3 00 

CCACCAAAGG AATAATTTCC AAATGCTTCG CCTAAATCAC AATTCAAATC AACTCGCATT 3 60 

ATAATTCCAC CCCTTTAACA ATTTGATGTT TTTCTAAAAA TTTAATATCA ACATCTTTTG 4 20 

CATCTCCATC ACGATATAGT GGATAATTTA AAACTGCATA TAAAAAATCG GCAGTTGTAG 4 80 

AAAATCCATC TATCACCATT TCATCTAAGG TGACTTTCAA CTTATCAATT GCTGAAGCTC 54 0 



EP 0 786 519 A2 





TTCAATAATC 


CTACTGTAAT 


TATTGTCGGT 


CTTATTTCTG 


GTGCATTATG 


GGCGTTTGGA 


2100 




CAAGCGAATC 


AGCTTAAATC 


TATTAGTTTA 


ATCGGTGTAT 


CAAATACTAT 


GCCAGTTTCT 


2160 


5 


ACAGGTATGC 


AATTAGTTGG 


TACAACATTA 


TTCAGCGTTA 


TCTTTTTAGG 


TGAATGGTCT 


2220 




TCAATGACTC 


AAATTATCTT 


TGGTTTAATC 


GCCATGATAT 


TATTAGTTAC 


TGGTGTAGCA 


2280 




CTTACTTCAC 


TTAAAGCTAA 


AAATGAACGT 


CAATCAGATA 


ATCCTGAATT 


TAAAAAAGCA 


2340 


10 


ATGGGTATTT 


TAATTGTATC 


TACAGTTGGA 


TATGTAGGTT 


TCGTTGTACT 


TGGTGACATC 


2400 




TTTGGTGTTG 


GTGGAACTGA 


TGCATTGTTC 


TTCCAATCTG 


TCGGTATGGC 


AATTGGTGGC 


2460 


15 


TTTATCCTAT 


CCATGAATCA 


TAAAACATCA 


CTTAAATCAA 


CAGCACTTAA 


TCTATTGcCA 


2520 


GGTGTGATTT 


GGGGAATTGG 


TAACTTGTTC 


ATGTTCTATT 


CTCAACCAAA 


AGTTGGTGTA 


2580 




GCTACAAGTT 


TCTCATTATC 


ACAGTTACTT 


GTTATCGTTT 


CAACCTTAGG 


CGGTATTTTC 


2640 


20 


ATTTTAGGAG 


AAAGAAAAGA 


TCGTCGTCAG 


ATGACGGGTA 


TTTGGGCAGG 


TATTATTATT 


2700 




ATCGTGATAG 


CTGCTATAAT 


TCTAGGTAAT 


TTGAAATAGA 


AAGTTAAATA 


CTCATGTAAC 


2760 




GTAAAAATGT 


AATCACTTCT 


GAAAATAACC 


ATTCACTTAT 


AGAATGATTA 


AAATTAATTT 


2820 


25 


TCGGGAATTT 


TACGTTGAAT 


GTTCCTCTAT 


ATGTCCTAGG 


AAATACGTGG 


CTCTAAAAAC 


2880 




AAAACGCAAT 


AACACATCAT 


GACATTAATC 


ATGCGTTTTA 


AGACTTTAAA 


ATTAGCGATA 


2940 




CTTTTAAAAT 


CTTGATGATA 


TTCATATATC 


AAGTATGCGC 


CATACATATG 


AAGTGGATAG 


3000 


30 


CTGCATAACG 


CACTGCATTA 


TCAACTTGAA 


TGTATGAGTT 


GAACAACTAT 


GTCATAAATA 


3060 




AAAGCCCCCT 


TTTCACAATA 


TACATTTACA 


TATTGTGGTA 


AAGGGGGCTC 


TCATTTTCTA 


3120 




CGAATACTAA 


AATGGATTTT 


ATTTTCAAAT 


GTGTAAACTA 


GACAAACACT 


GCCTGATACA 


3180 


35 


CGTACAAAAT 


AATGATACTA 


ATAATGATTG 


TCAAATTGGT 


CGTCATACCT 


ATAAATGGCA 


3240 




GTGTTCGATA 


TTTAAACTGA 


ATACCATAAG 


AAATAATTGC 


AACACcTACC 


GGGAACATCC 


3300 


40 


AAGTGACCAA 


CAATGTCGTC 


TTAATCATAT 


CATCTGATAC 


TGGTAACAAG 


ACATATACTA 


3360 


ACAATCCCGC 


AACTAATGCT 


AATCCATAAT 


GCAAACATAA 


ATATTTAATA 


GTAGCAGGTA 


3420 




TATACTTTCT 


TTCCAGAGTA AAATTCAACA TGACACCTAG 


CAAAATCATT 


GATAACGGCA 


3480 


45 


TATTTGCATG 


GGAAAGTATG 


CTAAAGAAAT 


CGATTGCCAC 


ATGTGGTAAA 


TGGATGTGAC 


3540 




TTATATTCAA 


TATAAACATT 


ACAATGTATG 


TAACGAGTGG 


CACTGATTGT 


AATAATTTCT 


3600 




TACCTAAATA 


TTTAAAATCG 


AATTGATCAC 


TACCTTCACT 


AAAGTAGCTA 


CCTACAAAGT 


3660 


50 


AAGTAATTCC 


AAACATCACA 


AAGGCACCAC 


CTATATCAGC 


CATAACAAAA 


TAAATAAGTC 


3720 




CCGTTTTAGG 


CCATATCACT 


TCAATTAGTG 


GATATGCAAA 


CAATCCAATA 


TTCATAGCAC 


3780 
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ACGGATTGGC 


TTTTTGTT A C 

— ' X X X X VhJ X X AVr 


PA A PTTT 7A TP 


A 71 TV TV TV"* A A *P 

LAAAA I LAA I 


U 1 l\j%-Mf\AL- 1 A 


GTTCACCTTC 


300 


TCGTCCaGCA 


TCTGTTGCAA 




i™ TCAC1 1 I A 


TTRTPTR AAA 

1 1A1 L I AAAA 


T*TAACGCTTT 


360 


TACTGTTTTA 


AATTGTTTGC 




A TAT a AfTxAr'TA 
AA1AALAALA 


Vj I 1 1 1 \-J\ 1 A 1 


A TTTJV JV T" 

A I I 1AGGTAT 


420 


AATTGGAAGG 


TCTTCTAATC 




1 AAAI 111 IA 




CAGGTGTCGC 


480 


ATTTGTCACT 


AGATGACCTA 




vjAV^AA I A 1 A 1 


iLHjl 1AI III 


/"* A TV TV r^T 1 TV TV 

CAAACjTAAC c 


54 0 


ATTACGCTTC 


TGATTTATTT 


GTAAAGCATC 


AGCAATATCT 


L.I HjCVjAL.I\j 


ATGGTTTTTC 


600 


AGCTAATATT 


AAAGATTTCA 


TAAATTATCC 


TTTCTCATAC 


ullLlll 1 AI 


1 1 LbxAALb I(j 


660 


CTTCATCTAT 


TCCACTAATC 


TTTGATTTAA ATTCAATGAT 


TV/"* A TV A TV" AT* 

IvjLAAA IvjAT 


GTG TTAAATG 


720 


TATTGTAACA 


TGTTAATATC 

X VJ X X r\n X X V. 


ACTATTAACT 


TTCATTTCAG 


i i ■ I \r> TV TV TA <"PTV 

I IUAAAI AL.1 


7* m IV T1 TV TV T*TV TV TV 

ATATAATAAA 


780 


AGTAACAAAA 


AGTACGGAGG 


TAATGACATG 


AGCATAGTTC 


Ab 1 IAIATVjA 


TA T *» T* TV O TV /•'TV TV 

TATTACACAA 


840 


ATAAAATCGT 


T CATTG AA PA 


TTCGAATTAT 


GAATCAGCAT 


lAIALI 1A1A 


TT TV TV TV nW^OT 

TAAACTTCCT 


900 


CAACAGTACA 


ATGAAATAGA 
n x unnn x nun 


TGTATTAATA 


ACCGATGCGA 


TrY™* TV TV T*^ TV /"»/-*< 

1 IvjAAIC-ACC 


TXtGTGTATTT 


960 


TCGATTAAAG 


AAAA CYS ATTf 

rtrtrtrt Vw\j/\ 1 1 L 


AATCAAAGCA 


ATCATATTGT 


CTTTTGCATA 


CGATAAAaAT 


1020 


AAATTCAAAG 


T CAT AGfi PC P 


TTTCGTGGCT 


GACAATTATG 


TATTATCTG T 


CGATACGTTT 


1080 


GAAACGCTAT 




GACTTCGAAC 


CAACCTGACG 


ATGCCGTCTT 


TAACTTTTCT 


1140 


TTTGAAGAAG 


CJCATTCAAPA 


ATACAAACCA 


TTAATGAAAG 


TTATTCAAG C 


AAGTTATAAC 


1200 


TTCACTGACT 


ATTAPATACA 


AGCCCGTACA 


AGATTAGAAG 


7A TA 7A T'TV TOO A 

AAvjA 1 A I bLA 


TV TV iv TV m 

ccaaccaaat 


126 0 


AT CATTC CTT 


ATP A P AAA Cld 


GTTTTATCGT 


GCTTTCAGCA 


TV TV T V P PV /-^Tv 

AA1 TACACAC 


aactacattt 


1320 


AAATATCAGG 


CACAGTCAPP 


A A A/™* A m»trn 

A LAAOATATC 


ATTGATAGTT 


1 AbAL bAL LJ\ 


I CA1 CZATTTG 


1380 


TTTTTATTTG 


TTAGCGAAGG 


TTTACTTAAA 


GGTTATTTAT 


7v/™ , r* r r v rr , 7i & tat 
AV.L.I 1 UAAAi 


TV 1 TV 'rwi'r 1 TV O A TV 

I\iA 1 1 CACAA 


144 0 


cagtCaatcg 


CC GAG ATT A A 


ATACTTCAGT 


TCTCATGTAG 


TA 'P*!* TV PPPT*!"!' 
All Av-UU 111 


UAAACjCjI Ai C 


150 0 


GCTTTCGAGT 


TGCTTGCGTA 

* wl» X X VVV X #V 


TG CATTG CAA 


TATGCTTTTG 


Klnnl 1 1 lvjA 


T* Tv T'T' A O AAAA 
1A1 lAtjAAAA 


i c c n 


GTTTATTTTA 


AAATTCGTAA 

X X *— W X **** 


TAAAAATAAT 


AAACTCATCG 


■M/iV-Vj/il 1 IMA 


1 LtO I L I AvjVj 1 




TTCCATATCA 


ACTATGAGTA 


CATTAAATTC 


AAATTCGAAT 


rarrTa a c*ct 

CALb I AAL-bj 1 


A AA A f A T r""" A A 

AAAALrAl LAA 




ACAATCCCTG 


AAT AAAA PAP 


CAAGCAAATA 


CCCTACAGTA 


LA 1 LA 1 1 ALrV_ 


Albj 1 A 1 i Cj Itj 


174 0 


GGTTTTTrTA 

VJ\J X X X X A L X ** 


CTTTT~T("3T A A 
%~xlxxxi*jx nn 


ATATTGAAAA 


TTATAAGTAG 


1 I\j1"1 1 1"1~1A 


CT ATT AGGG C 


1800 


AGAATGCTTT 


ACAATAACAT 


GCAAGTGTCA 


ATTAAGGGGA 


GCACTTGCAT 


AAATAGTATA 


1860 


GGAGAGTGAG 


TAGTCTTGCA 


ATTTCTTGAT 


TTCTTAATCG 


CACTTTTACC 


TGCTTTATTC 


1920 


TGGGGAAGTG 


TCGTTCTTAT 


TAATGTGTTC 


GTCGGCGGTG 


GACCTTACAA 


CCAAATTCGT 


1980 
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TGTAATGAAT CAAATCAATA TCATTCATGT TCGATGATTT CTTCGCATTG TTTCTAGCTT 8160 

TAATTTATCA TTATTTAATT TTAATAACCA AGGAGATGAT AACGTCATTC TTTAGTACGC 8220 

TGTAATCCAT TCCCTTTTCA TCAAATTCAA ATTATAATTG TAATGCTTCT TCTACAGATT 82 80 

TATATTCCAT TTCAAATGCC TCTGCAACGC CTTTATTGGT TACGTGACCT TTGTAAGTAT 834 0 

TTAAACCTAA TGATAATGGT TGATTTGATT TAAATGCTTC TCTATACCCT TTATTAGCTA 84 0 0 

GCATGAGCGC ATAAGGTAGC GTAgCATTAT TTAAAGCTAA CGTCGAAGTA CGCGGTACTG 846 0 

CACCTGGCAT ATTTGCAACT GCATAATGAA CCACACCATG CTTAATATAT GTAGGATCAT 8520 

CATGTGTCGT AATTTTATCA GTTGtTTCAA AAATACCGCC TTGATCAATA GCAATGTCAA 858 0 

TAATAACTGA CCCATTTTTC ATTTGTTTAA TCATGTCTTC TGTTACAAGT CTTGGCGCTT 864 0 

TAGCACCTGG AATTAAAACT GCACCTATTA CTAAATCACT TTGTTTAACA TACAACTCAA 8700 

TATTCAACGG ATTTGACATA ATTGTATGTA CACGTCCACC GAATAAATCA TCTAATTGTT 8760 

GTAAACGCTT TGGATTAACA TCTAAAATCG TAACATCTGC ACCTAGTCCT AGTGCAATTT 8 820 

TAGCTGCATT TGTTCCTGCT TGACCACCAC CGATAATAGT TACTTTACCC TTAGGTACTC 88 80 

CTGGGACACC ACCTAGTAGA ATTCCCATAC CACCATTAAG TTTTTGTAGG AACTCTGCGC 894 0 

CAACTTGAGC TGACATTCTT CCTGCTACCT CACTCATTGG TGATAACAAT GGTAAAGATC 9000 

GGTCTGGTAA CTGCACAGTC TCATATGCAA TACTAATTAC TTTTCTATCT ATCAAAGCTT 9060 

30 GTGTTAATTT TTCTTCATTT GCTAAATGAa gatAaGTGAA TAATACAAGC CCTTCTTTAA 9120 

AATATGGATA TTCAGATTCA AGTGGTTCTT TAACTTTAAT AACCATATCC ACATCCCAAA 9180 

CTTTTGCTTG TTCAGCAACA ATCTCAGCAC CTGCTTCTTT GTAATCTACA TCTTCAAAGA 9240 

ATGATCCTGA ACCCGcATTT GTTTCCACTA AAACAGTATG 92 80 
(2) INFORMATION FOR SEQ ID NO: 132; 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4669 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 

CTGATTAATC TCTTGTTGTC GTGTATTTAC TAATTGAATC GTTGGTGTCT GAACACGTCC 6 0 

CAGGGATAGC TGTGCATCAT ACTTTGTTGT TAGTGCACGC GTTGCATTAA TCCCAACAAT 120 

CCAATCTGCC TCACTTCTCG CTAACGCTGC ATAATACAAA TCGTTATATT GACGACCGTC 180 
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TTP-AAPTTPA 


aapappsTar 


CCATTGGCAT ACCTAAATAA ACTGGgAATA CCATTTTCAT 


6360 




GTGTAAAAPT 


TP A APT A TP A 
1 OAAL. 1 A 1\jA 


AATGCTTGAA 


CAAATTTAAC 


GCTTCCGAAA 


TCAAaGTTTG 


6420 


5 


CTTTAPPAPP 


A a TATTPATA 


CCATGAACAT 


TTTCAACACC 


GTGATATGAA 


GAAAGATAGT 


6480 




CAGP PATTTP 


TP-PAPTTPPB 


ATTACTGTTG 


CTCCTGTTTT 


CTTTGCTAGT 


TCCACAACAT 


6540 


10 


CAPPAAAATf; 


ATP A A A ATP A 
A 1 V-AAAA 1 UA 


CCGTGCGTTA 


AAACGATATA 


GTCTACCTGC 


ACTGTTTCAA 


6600 


TATTPAAATP 


A P A ptts/w 


TTATTTGAAA 


TAAACGGATC 


TACGATAACC 


TTTTTGTTGT 


6660 






ATA a A fppmm 

AIAAAIUjI 1 


GATTGACCAT 


GAAATGATAA 


CTTCATTTGA 


GCATCCTCCT 


6720 


15 


A TP & IV TT A PT 


A 1 A 1 AAA 1 1 1 


AGTACCCTTT 


TGCCACTTAA 


TTATAACAAA 


TTCTCAAATT 


6780 


i. 1 AAAAA1 1\j 


AAA lirrrwp»pm 

AAAAlL-lAtjl 


TAATGTATTA 


GCTCGATTTT 


GAAATCTAAT 


AATAATTGGC 


6840 




ata & a iTry & 


AU 1 AATATTA 


TGTTGAGGAG 


TGTTTATAAA 


ATGACaaAAA 


TATCAAAAAT 


6900 


20 




1 HjAAC-AATC 


AACAAGCTGA 


TGCAGCATGG 


attacaacac 


CGTTGAATGT 


6960 




niAl 1A1 111 


A PTP O A 1^ A /~» 

ACTGGATaC c 


GTAGCGAACC 


CCATGAAAGA 


ttatttgcat 


TATTGATTAA 


7020 




u AAAliA 1 KjKj 1 


aaacaagtac 


TATTTTGTCC 


AAAAATGGAA 


gtcgaagaag 


TCAAAGCATC 


7080 


25 


Av~L 111 LALA 


/-»rf>^ T\ A A rp/^/i 

CjOrGAAATCG 


TTGGATATTT 


AGACACTGAA 


aacccttttt 


CACTTT AT C C 


7140 






A A A A A T**T» A /~» 

AA 1 AAATTAC 


TAATTGAAAG 


CGAGCACTTA 


acagtagcac 


GCCAAAAACA 


7200 




nl 1 AA1 C 1 V- 1 


<j<j 111 L.AA1 G 


TCAATTCATT 


CGGAGATGTT 


gatttaacaa 


TCAAACAATT 


7260 


30 


oAVjAAAIAI 1 


AAA1 LCbAAG 


ATGAAATTAG 


CAAAATACGT 


aaagctgctg 


AGTTAGCAGA 


7320 




lAAtjlljlAlC- 


AAA A POTV^ 

CjAAATAGGTG 


TTTCTTATTT 


AAAAGAAGGT 


gtgactgaat 


GTGAAGTAGT 


7380 




PA A PP AT ATT 


bAb t-AAACT A 


TCAAACAATA 


TGGCGTCAAT 


gaaatgagtt 


TTGATACGAT 


7440 


35 


VjO 111 1 Ai 1 1 




CCGCATCACC 


TCATGGCACA 


ccaggagatc 


GCAGATTAAA 


7500 






T A TPT A r*T A T 

1 A1U1AC1A1 


TTGATTTAGG 


TGTAATTTAT 


gagcattatt 


GTAGCGATAT 


7560 


40 


P. AP A PPT APT 


A ' 1 *" I ' AAA * T w 1 ** T *0 

A J. 1 AAA! 1 Ha 


GTGAACCTAG 


CAAAGAAGCA 


caagaaattt 


ATAATATTGT 


7620 


ATT AP. A A PP A 


p a a ariTPT^ 


CAATCCAAGC 


AATTAAACCT 


ggaataccat 


TAAAAGATAT 


7680 




PP. ATP AT ATP 


/"* /""T 1 A C* A A A T A 

tjC I Atj AAA 1 A 


TTATTTCAGA 


AAAAGGTTAT 


ggtgaatatt 


TCCCTCATCG 


7740 


45 


PTT A Pj^Tr" A T* 




TACAAGAACA 


TGAATATCAA 


gatgtttcaa 


GTACTAATTC 


7 8 00 




T A aTTTPTTR 

1 AMI I ivj 1 1 A 


bAAbL 1 \j<jUA 


TGGTTATTAC 


AATCGAACCA 


GGTATTTATG 


TACCTGGTGT 


7860 




TGCAGGTGTA 


AGAATTGAAG 


ATGACATACT 


TGTCACTAAT 


GAAGGATATG 


AAGTATTAAC 


7920 


50 


ACATTACGAA 


AAATAAGGAG 


TGGGATAAAA 


ATGAAAAGCT 


TGTTACAAGC 


GCATTCTCAT 


7980 




TCAGTCAAAC 


ACTGCCAATA 


TAACATTGTA 


GCGCCTAAGA 


CATAAATTTT 


TATCCAAGTC 


8040 
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ATCTAATGGC GTCATTATAT CTTGAACTAT TAAGATATCT TTTCGTATTT TCTGATTAAA 4 56 0 

AAGTGCTTTG TTGATAATAT TTGCAACTAG GAATGTATCA TAACTTGATG ATAGAACAGG 4620 

TAAATCATGT TCATTCGCAA AATTAATAAC TTTATTAGAT GGCTTAAATC CACCAGTAAT 4 6 80 

TAATATAGCC GTACCTCTTT TTAAAGCTTC AATCTGCACA TCTTCACGAT TTCCGACAAT 474 0 

CAATAATGTC TTTGGACCAA TATACTTTAA AATATCTTTG AGTTCCATTG CTCCAATTGC 4800 

AAATTTAGAT ACCATCTTAG TGATACCTTT GTTGCCACCT AACACTTGGC CATCAATAAT 4 860 

ATTGACAATT TCATTAAAAG TTAAATGTTC AATTTCATTA CGATTACGTT TTTCGATTCG 4 920 

AACCGTACCA ACACGATCTA TCGTTGCGAC CATGCCCATT TTATCAGCAT CTTTmATTGc 4 98 0 

ACGATATGCT GTCCCytCaG ATACGTTTAA AAATTTAGCG ATTTTACGCA CCGAAATTTT 504 0 

AGAG CCTATA GATAACGATT CAATATAATC TAAAATTTGT TCATGTTTTG TCATTCTTTA 5100 

CCTCTTCTTT TCGAACAGTA TTAACTACAT TATAACTTTA TTTTGGATAA AAAGCATTGA 5160 

AGTGAAATGA AATAATGATC GTTtCACCTA TTTTATTTTT TGAAAATATA CAACAAACAC 5220 

AAAGATCACA AAATCTTTAA TTTTAAATGG AAAAATCCAT TATTATTTAT TAGAATGTAA 5280 

GTGAGGAGGG ATGTACTAAT GTATAAAAAT ATATTACTTG GTGTAGACAC TCAGTTAAAA 534 0 

AATGAAAAAG CACTAAAAGA AGTGTCTAAA TTAGCTGGCG AAGGTACAGT CGTAACAGTT 5400 

TTAAACGCAA TCAGCGAACA AGaTGCTCAA GCATCAATTA AAGCAGGTGT TCATTTAAAC 5460 

30 AAACTTACTG AAGAACGAAG CAAGCGATTG GAAAAAACAC GCAAAGCTTT AGAAGATTAT 5520 

GGTATTGATT ATGACCAAAT AATTGTTCGT GGTAATGCAA AAGAAGAACT ATTAAAACAT 5580 

GCTAATAGCG GTAAATATGA AATTGTTGTT TTAAGTAACC GTAAAGCAGA AGACAAAAAG 564 0 

35 AAATTTGTAC TTGGAAGTGT CAGCCACAAA GTAGCAAAAC GTGCGACTAT CCCTGTATTA 5700 

ATCQTTAAAT AAAATTTTTA TCCAGAATCA CAAATAATCT TTCAATCATG ATGCAGTCTC 576 0 

AAACGACTGA GTAAATACAA GAAACGATTA TGACTGTGGT TCTGGATTTT TTATATCGTA 582 0 

GTAAATTTAT AATCAATGTC TAATTGTATA AAACTAAAAT TACGAGAGTA GGTCAGAAAT 5880 

GATAAAGAAC CACTGATGTC CCCCGTCCAC GTCGTAACTG AATCAGTAGA ATATAAAAAC 594 0 

ACCCACTAAA AATATGCAGA CGATAACTTC CACATAGATT AGCGAGGTGT TTTTTAGTGT 6000 

AAAAT CTATA TTCTATTTAA AACTGAACAG ATTCACCTGG TTTTAAAATT TGCACGTCCC 606 0 

CTACATTAAC AGCATCTTTA AATTGTTGTG GATCTTGTTC GATTAATGGG AATGTATCAT 6120 

AATGAATCGG TACAGAAATT TTTGGTTTAA TAAATTCATT AATAGCATAA CTTGCATCAT 6180 

CAATACCCAT CGTAAAATTA TCTCCAATTG GTACAAAACA TACATCAACT GGATGACGTT 624 0 
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CAATATCATT 


AATAATCAAT 


TGCCCTTTAG 


AACGTAATCG 


ACATCTGATT 


TCATTACCTT 


2760 


CATCGACTGC 


AAATACCCAT 


ATTTTCAAGC 


CTTTGATGTC 


AGCAATTGTA 


TTAACAAACT 


2820 


GAGATGCTTC 


ATTTGGCTGA 


ATACCGAATT 


GCTCCAATAC 


ATCTTCAGTT 


ATTTTAACTT 


2880 


GGCAGAATCC 


ATCATCCATA 


AGTTCGAAAT 


GTTGTAAAAC 


ATAACCTTGA 


AACGGCAACA 


2940 


TTTTTGGGTC 


CTTCTCCATC 


ATTTTATTTA 


AAAGCGCATT 


ATGATCAATA 


TCATGCCCAA 


3000 


TTAACTTTCC 


AGCAATTTCC 


ATAGTATGTT 


CTGAGGTATT 


GTTAAAAAGG 


AATCGCCCAG 


3060 


TATCACCGAC 


GATACCAAGA 


TATAAAACGC 


TCGCGATATC 


TTTATTAACA 


ATTGCTTCAT 


3120 


CATTAAAATG 


TGAGATTAAA 


TCGTAAATGA 


TTTCACTTGT 


AGATGACGCG 


TTCGTATTAA 


3180 


CTAAATTAAT 


ATCACCATAC 


TGATCAACTG 


CAGGATGATG 


ATCTATTTTA 


ATAAGTTTAC 


3240 


GACCTGTACT 


ATAACGTTCA 


TCGTCAATTC 


GTGGAGCATT 


GGCAGTATCA 


CATACAATTA 


3300 


CAAGCGCATC 


TTGATATGTT 


TTATCATCAA 


TGTTATCTAA 


CTCTCCAATA 


AAACTTAATG 


3360 


ATGATTCCGC 


TTCACCCACT 


GCAAATACTT 


GCTTTTGCGG 


AAATTTCTGC 


TGAATATAGT 


3420 


ATTTTAAACC 


AAGTTGTGAA 


CCATATGCAT 


CAGGATCTGG 


TCTAACATGT 


CTGTGTATAA 


3480 


TAATTGTATC 


GTTGTCTTCG 


ATACATTTCA 


TAATTTCATT 


CAAAGTACTA 


ATCATTTTCA 


3540 


TACTCCCTTT 


TTTAGAAAAG 


TTGCTTAATT 


TAAGCATTAG 


TCTATATCAA 


AATATCTAAA 


3600 


TTATAAAAAT 


TGTTACTACC 


ATATTAAACT 


ATTTGCCCGT 


TTTAATTATT 


TAGATATATA 


3660 


TATTTTCATA 


CTATTTAGTT 


CAGGGGCCCC 


AACACAGAGA 


AATTGGACCC 


CTAATTTCTA 


3720 


CAAACAATGC 


aAGTTGGGGT 


GGGGC CCCAA 


CGTTTGTGCG 


AAATCTATCT 


TATGCCTATT 


3780 


TTCTCTGCTA 


AGTTCCTATA 


CTTCGTCAAA 


CATTTGGCAT 


ATCACGAGAG 


CGCTCGCTAC 


3840 


TTTGTCGTTT 


TGACTATGCA 


TGTTCACTTC 


TATTTTGG CG 


AAGTTTCTTC 


CGACGTCTAG 


3900 


TATGCCAAAG 


CGCACTGTTA 


TATGTGATTC 


AATAGGTACT 


GTTTTAATAT 


ACACGATATT 


3960 


TAAGTTCTCT 


ATCATGACAT 


TACCTTTTTT 


AAATTTACGC 


ATTTCATATT 


GTATTGTTTC 


4020 


TTCTATAATA 


CTTACAAATG 


CCGCTTTACT 


TACTGTTCCG 


TAATGATTGA 


TTAAAAGTGG 


4080 


TGAAACTTCT 


ACTGTAATTC 


CATCTTGATT 


CATTGTTATA 


TATTTGGCGA 


TTTGATCGTT 


4140 


AATTGTTTCA 


CCCATCTGAG 


GCTGTCTTCC 


TAAAAGTTGC 


ATAGACTTTA 


AAACATCTTG 


4200 


TCTATTAATC 


ACACCCACTG 


TCTTTTTATT 


ACTCGAAACG 


ACAGGAATCA 


ATTCAATACC 


4260 


TTCCCAAATC 


ATCATATGCG 


CACAACTTGC 


TACTGTACTC 


ATAGCATTTA 


CATAAATAGG 


4320 


ATTTCGCGTC 


ATCACTTTAT 


CTATTTCGTC 


GTCGTCCTTT 


GTATTAATCA 


TCTCTCGACT 


4380 


TGTTACAATA 


CCTACTAATT 


TATACGACTC 


ATTGACTACC 


GGAAATCTTG 


TATGGCCAGT 


444 0 
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ACACTTCTTA CACCGTCAGA CTCTAATTGG AATATGCCAG TCGTATCTCC TTGCGACAAC 960 

AATTCAAACA CTTTTTGATC ATCAAACGGA ATCTTTTCGA TATCAATATT AATACCTAAA 102 0 

TCTTTTTTGA CTTGTGTTAA GATTTGATGA ATAATCGATA AGTTTCTCAA CCCTAGAAAA 10 80 

TCTATTTTTA ATAACCCAAT ACGTTCGGCT TCAGTCATTG TCCATTGCGT TAATAATCCT 114 0 

GTATCCCCTT TCGTTAAAGG GGCATATTCA TATAATGGAT GGTCATTAAT AATAATTCCT 1200 

GCCGCATGTG TAGATGTATG TCTTGGTAAA CCTTCTAACT TTTTACAAAT ACTGAACCAG 1260 

CGTTCATGTC GATGGTTTCG ATGTACAAAC TCTTTAAAAT CGTCAATTTG ATATGCTTCA 132 0 

TCAAGTGTAA TTCCTAATTT ATGTGGGATT AAACTTGAAA TTTCATTTAA TGTAACTTCA 1380 

TCAAACCCCA TAATTCTTCC AACATCTCTA GCAACTGCTC TTGCAAGCAG ATGACCGAAA 1440 

GTCACAATTC CAGATACATG TAGCTCGCCA TATTTTTCTT GGACGTACTG AATGACCCTT 1500 

TCTCGGCGTG TATCTTCAAA GTCAATATCA ATATCAGGCA TTGTTACACG TTCTGGGTTT 1560 

AAAAAACGTT CAAATAATAG ATTGAATTTA ATAGGATCAA TCGTTGTAAT TCCCAATAAA 1620 

TAACTGACCA GTGAGCCAGC TGAAGAACCA CGACCAGGAC CTACCATCAC ATCATTCGTT 1680 

TTCGCATAAT GGATTAAATC ACTTACTATT AAGAAATAAT CTTCAAAACC CATATTAGTA 1740 

ATAACTTTAT ACTCATATTT CAATCGCTCT AAATAGACGT CATAATTAAG TTCTAATTTT 1800 

TTCAATTGTG TAACTAAGAC ACGCCACAAA TATTTTT T AG CTG ATT CATC ATT AGGTGT C 1860 

30 TCATATTGAG GAAGTAGAGA TTGATGATAT TTTAATTCTG CATCACACTT TTGAGCTATA 192 0 

ACATCAACCT GCGTTAAATA TTCTTGGTTA ATATCTAATT GATTAATTTC CTTTTCAGTT 1980 

AAAAAATGTG CACCAAAATC TTCTTGATCA TGAATTAAGT CTAATTTTGT ATTGTCTCTA 204 0 

ATAGCTGCTA ATGCAGAAAT CGTATCGGCA TCTTGACGTG TTTGGTAACA AACATtTTGA 2100 

ATCCAAACAT GTTTTCTACC TTGAATCGAA ATACTAAGGT GGT CCATATA TGTGTCATTA 2160 

TGGGTTTCAA ACACTTGTAC AATATCACGA TGTTGATCAC CGACTTTTTT AAAAATGATA 2220 

ATCATATTGT TAGAAAATCG TTTTAATAAT TCAAACGACA CATGTTCTAA TGCATTCATT 2280 

TTTATTTCCG ATGATAGTTG ATACAAATCT TTTAATCCAT CATTATTTTT AGCTAGAACA 2 34 0 

ACTGTTTCGA CTGTATTTAA TCCATTTGTC ACATATATTG TCATACCAAA AATCGGTTTA 24 00 

ATGTTATTTG CTATACATGC ATCATAAAAT TTAGGAAAAC CATACAATAC ATTGGTGTCA 2460 

GTTATGGCAA GTGCATCAAC ATTTTCAGAC ACAGCAAGTC TTACgGCATC TTCTATTTTT 2520 

AAGCTTGAAT TTAACAAATC ATAAGCCGTA TGAATATTTA AATATGCCAG CATGATTGAA 2580 

TGGCCCCTTT CTATTAGTTA AGTTTTGTGC GTAAAGCTGT AGCAAGTTGC TCAAATTCAT 2 64 0 
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GAATCAAACC GGCCATGTCG AAGCAGTACA AATTACGTTT GATCCAGAGG TTACTTCCTT 54 60 

TGAAAATATA TTAGACATAT ATTTCAAAAC ATTTGACCCA ACTGATGATC AAGGGCAATT 5520 

5 

TTTCGATAGA GGCGAAAGCT ATCAACCAGT CATTTTCTAT CATGATGAAC ATCAGAAAAA 55 8 0 

GGCTGCTGAG TTTAAAAAGC AACAATTAAA TGAACAAGGT ATTTTCAAGA AACCAGTGAT 564 0 

TACACCTATT AAA C CAT AT A AAAATTTCTA TCCAGCTGAA GACTACCATC AAGATTATTA 57 00 

10 

CAAAAAGAAC CCGGTACATT ATTACCAATA TCAACGTGGT TCAGGTAGAA AAGCGTTTAT 57 6 0 

AGAAT CACAT TGGGGGAATC AAAATGCTTA AAAAAGATAA AAGTGAACTA ACAGATATAG 582 0 

15 AATATATTGT TACACAAGAn AACGGCACTG AACCACCATT TATGAATGAA TATTGGAATC 58 80 

ATTTTGCTAA AGGATTTATG TAGATAAAnT TCnGGTAAAC CTTG 5924 
(2) INFORMATION FOR SEQ ID NO: 131: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9280 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 131: 

GGCCGTTnAA AATCTCCAAA ATAnAAAAAC CCATCTTGTT CCAATGTTTT AAAATCGCCa 6 0 

30 

TCCaACACTT GaTCaATAGC TTGCAACAAC GTTGAACGTG TTTTaCCAAA AGCATCaAAC 120 

GCTCCCACTA AAATCAGTGC TTCAAGTAAC TTTCTCGTTT TGACTCTCTT CGGTATACGT 18 0 

35 CTAGCAAAAT CAAAGAAATC TTTAAATTTG CCGTTCTGAT AACGTTCATC AACAATCACT 24 0 

TTCACACTTT GATAACCAAC ACCTTTAATT GTACCAATTG ATAAATAAAT GCCTTCTTGG 3 00 

GAAGGTTTAT AAAACCAATG ACTTTCGTTA ATGTTCGGTG GCAATATAGT GATACCTTGT 36 0 

40 TTTTTTGCTT CTTCTATCAT TTGAGCAGTT TTCTTCTCAC TTCCAATAAC ATTACTTAAA 42 0 

ATATTTGCGT AAAAATAATT TGGATAATGG ACTTTTAAAA AGCTCATAAT GTATGCAATT 48 0 

TTAGAATAGC TGACAGCATG TGCTCTAGGA AAACCATAAT CAGCAAATTT CAGAATCAAA 54 0 

*° TCAAATATTT GCTTACTAAT GTCTTCGTGA TAACCATTTT GCTTTGCACC TTCTATAAAA 60 0 

TGTTGACGCT CACTTTCAAG AACAGCTCTA TTTTTTTTAC TCATTGCTCT TCTTAAAATA 66 0 

TCCGCTTCAC CATAACTGAA GTTTGCAAAT GTGCTCGCTA TTTGCATAAT TTGCTCTTGA 72 0 

50 

TAAATAATAA CACCGTAAGT ATTTTTTAAT ATAGGTTCTA AATGCGGATG TAAATATTGA 78 0 
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TGAATCACAT CCAGCAATAA AAGCTCCAAT AGCAGTGTAG TCATTGCATA GTTAGCTAAC 3 66 0 

CATATAGACA TCAAAATGAC ATCATAGTAT TTTCAAGTGC AAAAAAGTAC TTTTTTGTGT 372 0 

TAAACGTTTT CATAAATTAT GCAAAATCAT TATTTCTATC ACACTTTATG ATAAAAATTG 3 78 0 

TGTTAAATTA AAGATAACTT AGTAATAAAA AATGAAATGA TAGAAGAAGG AGGATAATTA 384 0 

TGACTTTATC CATTCTAGTt GCACATGACT TGCAACGAGT AATTGGTTTt GAAAATCAAT 3900 

TACCTTGGcA CCTACCAAAT GATTTGAAGC ATGTTAAAAA ATTATCAACA GGTCATACTT 3960 

TAGTAATGGG TCGTAAGACA TTTGAATCGA TTGGTAAACC ACTACCGAAT CGTCGAAATG 4 02 0 

TTGTACTTAC TTCAGATACA AGTTTCAACG TAGAnGGCGT TGATGTAATT CACTCTATTG 40 8 0 

AAGATATTTA CCAACTACCG GGCCATGTTT TCATATTTGG AGGGCAAACA TTATTTGAAG 414 0 

AAATGATTGA TAAAGTGGAC GACATGTATA TTACTGTTAT TGAAGGTAAA TTCCGTGGTG 420 0 

20 ATACGTTCTT TCCACCTTAT mCATTkGAgr CTGGGAAGTT GCCTCTTCAG TTGAAGGTAA 4260 

ACTAGATGAG AAAAATACAA TTCCACATAC CTTTCTACAT TTAATTCGTA AAAAATAAGG 4 320 

GGG AAAACGA CCATGACAAA ACAGATTATA GTAACAGACT CAACATCCGA TTTAT CTAAA 43 80 

GAATACTTAG AAGCAAACAA CATTCATGTA ATTCCTTTAA GTTTAACTAT TGAAGGAGCT 444 0 

TCATACGTTG ACCAAGTAGA TATTACATCA GAAGAATTTA TTAATCATAT TGAAAATGAT 450 0 

GAAGATGTAA AGACAAGTCA GCCAGCCATA GGTGAATTTA TATCTGCTTA TGAAGAACTA 4 56 0 

GGAAAAGATG GCTCTGAAAT CATAAGTATT CATCTTTCTT CAGGATTAAG TGGTACATAT 4 62 0 

AACACTGCTT ACCAAGCAAG TCAAATGGTA GATGCTAATG TAACTGTTAT TGATTCAAAA 4 680 

TCTATTTCTT TTGGTTTAGG GTATCAAATA CAACACCTAG TAGAGCTTGT AAAAgAaGGT 4 74 0 

GtCTCAACTT CTGAAATAGT TAAAAAGTTA AATCATTTAA GAGAAAACAT TAAATTATTT 4 800 

GTAGTTATAG GGCAATTGAA TCAATTAATT AAAGGTGGCA GAATTAGTAA AACAAAAGGT 4 86 0 

TTGATTGGTA ATCTTATGAA AATTAAACCA ATTGGTACAC TAGATGATGG TCGCTTAGAG 4 920 

CTTGTGCmCA ATGCGAGAAC TCaAAATTCk AGTATCCAAT ACTTGAAAAA GGAAATTGCT 4 980 

GAATTTATAG GAGATCATGA AATCAAATCC ATTGGTGTCG CACATGCTAA CGTCATTGAA 504 0 

45 TATGTTGATA AATTGAAGAA AGTTTTTAAT GAAGCTTTTC ATGTGAATAA TTACGATATA 5100 

AATGTAACTA CACCAGTTAT TTCTGCACAT ACTGGTCAAG GTGCGATTGG CCTCGTAGTC 5160 

CTTAAGAAGT AAATTTAATC TTTTCAGTGT TAATTACTTC CATTTCAATC CTTTATAGAC 522 0 

TAAATTTATA ATTAGATAGA TAGAGGAGGT AATTCATATG ACAAAAGAAT ATGCAACATT 528 0 

AG CAGG AGGA TGTTTCTGGT GCATGGTTAA AC CATTTACA TCATATCCAG GCATCAAGTC 5340 
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AATTTAATTG 


GAGGAATTAA ATATGAATGC 


ATATGATGCT 


TATATGAAAG 


AAATTGCGCA 


1860 


ALAAA IAjCG i 


GGCGAATTAA 


CTCAAAATGG 


TTTTACAAGT 


TTAGAAACGA 


GCGAACAGct 


1920 


A 1 ^-VjVjALj I AI 


ATGAACCAAG 


TAAATGCTGA 


TGACACTACT 


TTTGTAGTTA 


TTAACTCTAC 


1980 


Rfpn r^r~\{~* ot*/"* t 1 
A1ULU<jL 1G 1 


GCAGCTGGAT 


TAGCAAGACC 


AGCTGCAGTA 


GCAGTTGCAA 


CACAAAATGA 


2040 


AtJA lAGACCT 


ACAAATACAG 


TTACAGTTTT 


TGCTGGGCAA 


GATAAAGAAG 


CAACTGCTAC 


2100 


AAI LidaAtjAA 


TTCATTCAGC 


AAGCACCATC 


TAGTCCTTCG 


TATGCTTTAT 


TCAAAGGTCA 


2160 


AUATTTAGTT 


TATTTTATGC 


CTAGAGAATT 


TATCGAAGGT 


AGAGATATTA 


ATGACATTGC 


2220 


AATGGACTTA 


AAGGATGCCT 


TTGACGAAAA 


TTGTAAATAG 


TACACATAAA 


TAAATATAAA 


2280 


GGTTAACACA 


TTTTATAATA 


TTAAAAATGG 


TGTCTGTCAT 


TGAAAATAGA 


GAATATAGTT 


2340 


GTATTCTATT 


TGTTAAATAA 


AGTCCGTTTT 


TACCaACTAT 


ATTTTCTAGA 


AATTTAACTG 


2400 


TTTTAATAGG 


ACATCAAACA 


TAATATT r*aA 


ATCaTGTGTT 


AACCTCTTTT 


TTAAAATTTT 


2460 


TTAGCATTAA 


AGTTATAGAT 

riij x x n x nun x 




AATTACCAAT 


TGGAAACATA 


TATCACGTTA 


2520 


CGATGGGGTA 


GGTACTTAAT 


CAGCATTTTA 


TAAATAAAGT 


AACGGAATTC 


ATGATATTAA 


2580 


TATCATATTC 


CTAAAATGAG 


TGATAACAAA 


ATGCTACATA 


AAGTTAAGTT 


ATATCAAACT 


2640 


AAATATACAT 


ACTATAAATA 


ATGAAAATGA 


GGTGTTATCG 


CATATGTTGA 


ATTCATTTGA 


2700 


TGCAGCATAT 


CACAGTCTTT 


GTGAAGAAGT 


TTTAGAAATA 


GGAAATACAC 


GAAATGATCG 


2760 


CACAAATACA 


GGTACGATTT 


CGAAATTTGG 

V W^W^ XXX V7V7 


TCATCAACTT 


CGCTTTGACT 


TATCTAAAGG 


2820 


ATTTCCACTA 


TTAACGACAA 


AGAAAGTTTC 


TTTTAAATTA 


GTAGCAACCG 


AATTATTATG 


2880 


GTTCATTAAA 


GGAGATACAA 


ACATCCAATA 


CTTATTAAAA 


TATAATAATA 


ATATATGGAA 


2940 


CGAATGGGCT 


TTTGAAAATT 


ATATCAAATC 


AGACGAGTAT 


AAAGGTCCAG 


ATATGACAGA 


3000 


TTTCDGGCAT 


CGTGCATTGA 


GTGATCCTGA 


ATTTAACGAA 


CAATATAAAG 


AACAAATGAA 


3060 


ACAATTTAAG 


CAACGTATTC 


TTGAAGATGA 


TACATTTGCG 


AAGCAATTCG 


GGGATTTAGG 


3120 


AAATGTTTAT 


GGTAAACAAT 


GGCGAGATTG 


GGTTGATAAA 


GATGGTAATC 


ATTTTGATCA 


3180 


ACTTAAAACA 


GTAATTGAAC 


AAATTAAGCA 


TAATCCAGAT 


TCAAGGCGAC 


ACATCGTATC 


3240 


TLjCATGGAAT 


CCAACAGAAA 


TTGATACAAT 


GGCACTTCCG 


CCTTGTCATA 


CCATGTTCCA 


3300 


GTTTTATGTC 


CAAGATGGTA 


AGTTAAGTTG 


CCAGTTATAC 


CAACGTAGCG 


CAGATATCTT 


3360 


TTTAGGTGTG 


CCATTTAATA 


TCcGCagctA 


CGCTTTATTG 


ACACACCTTA 


TTGCCAAAGA 


3420 


ATGTGGACTT 


GAAGTGGGTG 


AATTTGTGCA 


TACATTTGGA 


GATGCACATA 


TTTATTCAAA 


3480 
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TAACCCCATT TTACCTGGAA AAATCgTTTG CGATGCaATm GCaTTtGaAT ATAaATACAT 6 0 

TTTACGTATa GAATTATAAA AgGTTTCATT CaAATCTTAG GGTCAAAAAT GTTATAATAT 120 

TTTTATGTCA AATTTAAAAC AGTAACACTT ATTTACAAGG TTGCAATATT TTGAAGTAAT 180 

AAAGGAAGTG TCGCGTATTT TAACTTTTTC AGAGCAAAAT GCACTCGCGA AAATAGATGA 240 

TTTAATGAAT ACTTATTGCA ATCAATGTCC AATCAAAACT CGTCTGCGTA AATTAGAGGG 3 00 

GAAAACGAAG GCGCATCATT TTTGTATCAA TGAGTGTTCA ATAGGGAAAG AAATAAAACA 360 

ATTAGGAAAT GAACTTCAAT AGGAGGAAGT CAAATGAAAA TTATATCTAT ATCAGAAACA 420 

CCGAACCACA ACACAATGAA GATTACACTT AGTGAAAGCA GAGAAGGTAT GACATCAGAT 4 80 

ACGTATA CTA AAGTTGATGA TTCACAGCCA GCATTTATTA ATGACATCTT AAAGGTTGAA 54 0 

GGCGTTAAAT CAATTTTCCA TGTTATGGAC TTTATTTCAG TAGATAAAGA AAATGACGCA 6 00 

20 AATTGGGAAA CAGTATTG C C AAAAGTAGAG GCTGTATTCG AATAAATTTT TCATCAACTA 660 

GTATTCGGGG GGAATAAAGT ATATGGAAAT TTTACGTATA GAGCCAACAC CAAGTCCAAA 720 

TACAATGAAA GTTGTTTTGT CATATACAAG AGAAGACAAG TTATCTAATA CTTATAAAAA 780 

AGTAGAAGAA ACACAACCAA GATTTATAAA TCAGTTGTTA TCTATAGATG GTATCACTTC 84 0 

CATTTTTCAT GTCATGAACT TCTTAGCTGT TGATAAGGCA CCAAAAGCTG ATTGGGAAGT 900 

CATATTACCT GATATTAAAG CTGCTTTTTC TGATGCGAAT AAGGTTTTAG AATCTGTAAA 960 

TGAACCTCAA ATTGACAATC ATTTTGGTGA AATTAAAGCT GAATTATTAA CTTTTAAGGG 1020 

TATACCGTAT CAAATTAAGC TAACTTCTGC TGACCAAGAA TTAAGAGAAC AATTACCACA 108 0 

AACATATGTT GAC CATATGA CTCAAGCGCA AACAGCACAT GACAATATTG TTTTTATGCG 1140 

TAAATGGCTA GATTTAGGAA ATCGCTATGG AAATATTCAA GAAGTAATGG ATGGTGTCCT 1200 

AGAAGAAGTG CTAGCTACCT ATCCAGAATC ACAGTTACCC GTATTGGTAA AACATGCTTT 1260 

AGAAGAAAAT CACGCAACTA ATAATTATCA TTTCTATCGA CATGTCTCTT TGGATGAATA 1320 

TCATGCAACT GATAATTGGA AGACTCGATT ACGAATGTTA AACCATTTTC CAAAGCCGAC 1380 

TTTTGAAGAT ATACCGCTGC TTGATTTAGC TTTATCTGAT GAAAAAGTAC CGGTTAGACG 144 0 

45 TCAAGCGATT GTATTATTAG GTATGATTGA AAGTAAAGAA ATTTTACCGT ATTTATATAA 1500 

GGGGCTTCGT GATAAAAGTC CTGCTGTAAG AAGAACAGCA GGGGATTGCA TAAGCGATTT 1560 

AGGGTATCCA GAGGCACTAC CAGAAATGGT GCTACTATTA GATGATCCAC AGAAAATCGT 1620 

50 

TAGGTGGCGT GCTGCTATGT TTATCTTTGA TGAAGGTAAT GCAGAGCAGC TTCCCGCACT 1680 

AAAAGCCCAT ATTAATGACA ATGCGTTTGA AGTTAAATTA CAAATTGAAA TGGCCATATC 174 0 
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ACACCTATAA 


A PA A A PP A PCI 


TOO r oop r 
1 LLaLGGACT 


TCTTTAATTG 


ATGG ATG AT C 


AATTTGCTTT 


4260 


AATTGTTCTT 


TAAAATAATP 


TPPT A hWi^t 
1 LL 1AAH LI 


R I\ OOO O 

AAAGAGLGGl 


lTGGTAAATl 


^ «p/*i R TOR R 

CTCAT LAACG 


4320 


ATAACATCTA 


ATGCAGP AA T 


X 1 \j LAGLA 


o r r oo r r ptp 
LAAGLAAG X G 


GAI 1ALLALL 


R R R TVTTO R R 

AAA I Vj 1 1 GAA 


4380 


CCATGTGAGC 


PAGGTG T A A A 

V..TVJVJ 1U1 


PR ORTOTR RT 


ACTTCTTTAT 


lTG lT AATAl 


AACAGAAATT 


4440 


GGG AAGACTn 


PA PP A PPT 21 P 


1GL 1 1 1 ALL I 


R R IV TV IV fT* R H 

AAAATATAGA 


CATCAGGTTT 


TACATTATCC 


4500 


CAATC C GT AG 


PAAATAATTT 


A HC*f*C* A A OO a 


L L x AAT C L IXj 


CTTGGATTTC 


/"«rtt/^ iv it TV rry TV 

GT CAG CAATA 


4560 


AATAAGACAT 


T ATP/FTP ATP 


A P A T A A TTHT 


L i AAI HjL 1 1 


T CAAAT AT C L 


TTCTGGCGGT 


4 620 


ATATTTATAC 


PPGPTTPAPP 


TTP.A UTrnrT 

X X unn X X Ljvj 1 


1 L 1 AL 1 AAAA 


CTGCTGCAGT 


ATTTTCATTA 


4680 


ATTGCAGPTT 


TPAATP.PATP 


TAPATPTPPA 


a a Tv tt* a a (^ r n r r 
AAA1 LAAL 1 I 


TTCTAAATCC 


ATCTAATAAC 


4 74 0 


GGAPPATAAP 


P A rCTTCT.T A 


1 1L XGL I I L 1 


CjAACjA taatg 


AAACTGGCGC 


CATTGTT CGA 


4 800 


C CATGG AAP#T 


tappatta a a 


TPPA BTTBTT 

IGLAAlvjAi I 


TCTG CTTTAT 


Mill y^rp/^ T\ TV 

TTGG L.TCAAT 


TCCTTTAACA 


4860 


TCGTATC,PPP 


A P.PP.TPP.TP.P 
/vLjV^vj x L.Vj 1 


TP.PTTTP AAA 
x IjL ill LftAA 


Gv- 1L*111L1A 


CTG CTT CAG C 


ACCTG TATT c 


4920 


ATTPrHT A A A P. 


til 1 Al Li 1 1 


a cc**vcr*f* ROT 
AL L x G V- Unb T 


TTACAAATTT 


TTTCGTACCA 


TTCACCTAAG 


4 980 


TT ATP APT JIT 


PA A A APPAPO 


i GAAAL 1 AAA 


GTCACTTTAT 


CAGCTTGATC 


TTTTAATG CT 


5040 


TO A ATA ATTT 


TPP-TI ATY2TPT 
X LVjVjAI vj 1 L 1 


A i\sAL LI 1 Ltvj 


1 1 AACACSCGG 


TV TV T 1 TV T 1 /"*' 

AAT ATG CAG A 


m TV TV ^TV rrtTV fT^ /^l 

TAACATAT C C 


5100 


ATATATTTAT 


x GLL x i LAGG 


AIL 1 1 1 AALL 


CATAC CCCTT 


CAGCTTcTGa 


AATGaCAATT 


5160 


CIC, r A A TfST: T A 




1GL 1 LLG1 AA 


TGATTTGTTA 


TV/^f/^TV TV rr* TV TV rn 

ACTCAATAAT 


TTTTT CAGAT 


5220 


TT A P.T P A TP & 
x x rvtS x i**ri X V«A 


T A TPTPPPPT 
1A1L1 L.LLL 1 


1 1 1 LAI LAI 1 


T R T"7\ TV /^PUTT 

1 A rAACT ATT 


TV TV r+ R T^/^ TV TV TV 

ATACaTGAAA 


TV < mn TV m^«/^ TV TV 

CATTAT CCAA 


5280 


ATAATTAPAT 


T A fJTTTTP A A 
x *\\3 X X X X Lnn 


APP AP ATAPT 
AVjLAoA 1 AL 1 


r P r PT 4 ^0 A OO R R 

111 LLALLAA 


R R R R R TO R R 

AAAAGA I G AA 


R **P TV 7V O TV /"TT 1 TV 

AT AATC ACT A 


534 0 


APiTTTPATTA 


A A " [ — i — i 'PTPT A 


111 iLwiftilrtL 


LL 1 I ALA J. 1 1 


R T* R R TP R /"» R T» 

Al AA1GACAT 


AA TTAC TT AA 


54 00 


ATP.aTTAPA A 
r\ x *JOJ. x .r\v_ 


P.PAAAAPAAT 


1 G A 1 AA 1111 


ALAL 1 1 AATC 


R R R R HT*T\ T 1 

AAAAG TATAT 


mrT"T* TV / Mil TV TV /"I IV 

TTTACTAAGA 


5460 


ATA'I*TTTTAT 

AiAi. XXX XnX 


TTATA A AT AT 
x XAXAAA1A1 


TO A A A A OO A O 
1 LtAAAAL LnL 


TA AO A A R 

I AALAAA rTG 


ORTRORORRT 

LATACACAAT 


AT CATTAGTG 


552 0 


GTAACAGTTA 


AACACTTATT 


TATCTTTACG 


GGGTAATGGG 


TTAAAACCCT 


TnC ATT AAAA 


5580 


TTGGATGnCC 


ATAAAATTAG 


GG 








5602 


(2) INFORMATION FOR SEQ ID NO: 130: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5924 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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TTGTCTGTAG AAATTGAGGA GCTAATTTCT CTGTGTCGGG GCTCCACCCC AACTTGCACA 24 6 0 

CTATTGTAAG CTGACTTTCC GCCAGCCTCT GTGTTGGGGC CCCGCCAACT TGCACACTAT 2 520 

TGTAAGCTGA CTTTCCACCA GCCTCTGTGT TGGGGCCCCG ACTATTTTTG AAAAGAGCGT 25 8 0 

GTTACACGGG CATTGTTTTA CAGTCAACTA CTGCTAAAAT AAAATTAACG AGCTTAGGGC 264 0 

TTTGTTTTCT GTCCCAAGCT CGTTAAATCA CATATGATAA TTAATTATGC CCAACCACGA 2 700 

TATCTAGCTG CTTCTGCTGT ACGTTTAATA CCTATGATAT ATGCTGCAAG TCTCATATCT 2 76 0 

ATTTTTCGGT TTTGAGACAA TTCGTAAATC GTATCAAATG CCGCTTCTAA TTTTTCACGT 282 0 

AGCTTTTCAT TAACTTCTTC TTCAGACCAA TAATAACCTT GATTATTTTG TACCCATTCG 28 8 0 

AAGTAAGAAA CCGTtACACC ACCAGCACTT GCTAATACGT CTGGAACTAA TAATATACCA 294 0 

CGTTCAGTTA AAATACGTGT TGCTTCTGGT GTTGTAGGTC CATTAGCAGC TTCAACAACG 3000 

20 ATACTAGCTT TAATATCATG TGCATTGTCT TCTGTAATTT GGTTTGAAAT AGCCGCTGGT 306 0 

ACTAAAATGT CACAATCTAA TTCAAACAAT TCTTTATTTG AGATTGTTTC TTCAAATAAA 3120 

TTTGTTACCG TACCAAAACT ATCACGACGG TCTAATAAAT AATCTATATC TAAGCCATTT 3180 

GGATCGTGTA ATGCACCGTA AGCATCAGAG ATACCTACAA TTTTTGCACC TAAATCATAT 324 0 

AAGAATTTAG CTAAGAAACT TCCGGCATTA CCGAAACCTT GAATAACAAC CTTGGCACCT 330 0 

TCAATTTGCA TATTACGACG TTTTGCAGCT TGTTCAATTG CAATAACTAC ACCTAGTGCA 336 0 

GTTGATCTGT CGCGTCCATG AGAACCACCC AATACAATTG GTTTACCTGT GATGAAACCT 3420 

GGTGAATTAA ATTTATCTAA TGCACTATAT TCATCCATCA TCCAAGCCAT AATTTGTGAG 348 0 

TTTGTAAATA CATCTGGTGC TGGAATATCT TTGTTCGGAC CTACGAATTG TGAAATTGCT 354 0 

CTTACATATC CGCGTGATAA ACGTTCAACT TCATGAATGC TCATTTGACG TGGATCACAA 3600 

ACGATACCAC CCTTACCACC ACCGTATGGT AAGTTTACAA TGCCACATTT CAAAGTCATC 366 0 

40 CACATTGATA ATGCTTTTAC TTCTTCTTCA TCAACATCTG GGTGGAAACG CACGCCCCCT 372 0 

TTTGTTGGTC CAACAGCATC ATTATGTTGC GCACGGTAAC CTGTGAATGT TTTTACTGTG 3 780 

CCATCATCCA TTCGTACAGG GATACGCACT TGTAACATTC TTAAAGGTTC TTTAATTAAA 384 0 

TCGTACATTC CTCCGTCAAA TCCCAATTTA TGCAATGCTT CTTTAATAAT TCCTTGAGTA 3 900 

GAAGTTACTA AATTATTGTT CTCAGTCATG ATCCTTTTCG CCTCTTCTTT ACCTAATGAT 3 96 0 

TTCGCTTTCA AACATATTGT AACATAACGT ATTCCTTTTT AAAGCCCTTA CAAACTGATT 402 0 

GTTACAACTT TTTGACATTA TTGAAATACA TGTCTTATTT TTTCAAGTGC AAGGTCCAAT 4 080 

TCTTCTTTAG TAATAATTAA TGGTGGTGCA AAACGAATGA CAGTATCATG CGTTTCTTTA 414 0 
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GAAGATAAAG AAGGTTTATT CGATGCTGTC CATACAATTA AAGGTTCTTT ACGTATTTTC 66 0 

GAAGGTATGA TTCAAACGAT GACAATTAAT AAAGAACGAC TCAATCAAAC TGTTAAAGAA 72 0 

GATTTTTCAA ATGCAACGGA ACTAGCAGAT TATTTAGTAA CTAAAAATAT TCCATTTAGA 78 0 

ACTGCACATG AAATTGTAGG AAAAATCGTC TTAGAATGTA TACAACAAGG TCATTATTTA 84 0 

TTAGATGTTC CTTTAGCAAC ATATCAACAA CATCATTCTA GTATTGATGC CGATATTTAC 900 
GATTATTTGC AGCCTGAAAA TTGTTTAAAA CGACGTCAAA GTTACGGTTC AACAGGTCAA 96 0 

TCATCGGTCA AACAACAACT TGATGTTGCT AAACAATTAC TATCACAATA AATACGTTAA 102 0 

TCTACCTACC CACAATGTCT ATTAAAATTA CATTGTGGGT ATTTTAATGC TCTCTTCGTC 1080 

TTGTTGAACA TCACATTTTT AAGATTCCTA AAATGTTTGA TAATTCTTTT AAATTTATAT 114 0 

TACAAAAATG TTATAAATTG TAAAAGAAAT GTGTAAAGCG TTTTCACAAG CAGGTTTTTG 12 00 

TAGTATTTTA AAATTGTTAG ACTACAAATA AAGAGATGAA AGGATAAAGA CTATGACTAA 12 SO 

CTCTTCGAAA AGCTTCACTA AATTTATGGC TGCTTCTGCT GTTTTTACTA TGGGATTTTT 13 2 0 

ATCAGTACCT ACTGCTGGCG CTGAACAAAC AAATCAAATT GCAAATAAAC CTCAGGCTAT 13 30 

TCAATGGCAT ACAAATTTAA CGAATGAGCG ATTCACTACT ATCGCACATC GTGGCGCAAG 14 4 0 

TGGCTATGCA CCCGAGCATA CGTTTCAAGC ATATGATAAG AGTCATAATG AGTTAAAAGC 1500 

ATCTTATATC GAAATTGATT TACAACGTAC CAAAGATGGC CATTTAGTTG CTATGCATGA 1560 

TGAAACTGTT AACCGTACAA CAAATGGACA CGGTAAAGTT GAGGATTATA CCCTTGATGA 1620 

ATTAAAACAG TTAGATGCAG GAAGTTGGTT TAATAAAAAA TATCCAAAAT ACGCAAGAGC 16 80 

AAGTTATAAA AATGCTAAAG TACCCACTTT AGATGAAATT TTAGAACGTT ATGGCCCGAA 174 0 

TGCAAACTAT TATATTGAAA CAAAGTCACC TGATGTATAC CCAGGAATGG AAGAACAATT 18 00 

ATTAGCTTCA TTGAAAAAGC ATCACCTTTT AAATAACAAT AAATTAAAAA ATGGACATGT 1860 

40 AATGATT CAA TCATTTTCTG ACGAAAGTTT AAAGAAAATT CATCGTCAAA ATAAGCATGT 1920 

GCCATTAGTA AAATTAGTTG ATAAAGGTGA ACTACAACAA TTTAACGACC AACGCTTAAA 19 80 

AGAGATACGC TCTTATGCGA TTGGATTAGG TCCTGATTAT ACAGATTTAA CTGAACAAAA 2 04 0 

45 

TAG C CAT CAT TTAAAAGACT TAGGATTTAT AGTACATCCT TATACAGTGA ATGAAAAAGC 2100 

TGATATGTTA CGATTAAATA AATATGGCGT TGATGGTGTC TTTACAAATT TCGCTGATAA 2160 

ATATAAAGAA GTCATTAAGT AGTAATGTTA AACTAGAAAA CATAAATACA AAAAT AT AG C 22 2 0 

TATTACTATA AAAAACAGCA GTAAGATATT TCCAAATTGA AATTATCCTA CTGCTGTCTT 22 8 0 
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CGAACAGAGC ATTCGAAGGC TTTAAAGTTG AACAAGTTAA AGACTTATTC TATGCAACAT 5280 

TCACTAAGTA TAATGACAAT ATCGATGCGG CTATCGTCTT CGAAAAAGTT GATTTAAATA 534 0 

5 

ATACAATTGG TGAAATTGCA CAAAATAACA ATTTAACTCA ATTACGTATT GCAGAAACTG 54 00 

AAAAATACCC TCACGTTACT TACTTTATGA GTGGTGGACG TAACGAGGAA TTTAAAGGTG 546 0 

AACGCCGTCG TTTAATTGAT TCACCTAAAG TTGCAACGTA TGACTTGAAA C CAGAAATG A 552 0 

10 

GTGCTTATGA AGTTAAAGAT GCATTATTAG AAGAGTTAAA TAAAGGTGAC TTGGACTTAA 558 0 

TTATTTTAAA CTTTGCTAAC CCTGATATGG TTGGACATAG TGGTATGCTT GAGCCGACAA 564 0 

15 TCAAAGCAAT CGAAGCGGTT GATGAATGTT TAGGAGAAGT GGTTGATAAG ATTTTAGACA 5700 

TGGACGGTTA TGCAATTATT ACTGCTGACC ATGGTAACTC TGATCAAGTA TTGACGGaTG 5760 

ATGATCAACC AATGACTACG CAwACAACGA ACCCAGTACC AGTGATTGTA ACAAAAGAAG 582 0 

20 GCGTTACACT TAGAGAAACT GGTCGCTTAG GTGACTTAGC ACCTACATTA TTAGATTTAT 58 80 

TAAATGTAGA ACAACCTGAA GATATGACAG GTGAaTCTTT AATTAAACAC TAATATTGTA 5940 

AAAGATGTTA AGTAAACGCT TAATGACACT TATTTTTTGA AAATAATAGT AATATCnTTT 6000 

25 

TGTTAAATGA AAGAATAAAG CTATAATAAT TATAGAATAA CTATTTAn 604 8 

(2) INFORMATION FOR SEQ ID NO: 129: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 5602 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 9: 

AAAGAAGTGC AAGATATCAT CGCATTAATT AAGTCGTTAC AAAgTGTAAT TGTAGACaTC 60 

40 GCTTCCAATA ATGTTGATAC AATTATGCCT GGTTATACTC ATTTACAGCG TGCACAGCCA 12 0 

ATTTCATTTG CACATCATAT TATGACTTAT TTTTGGATGT TACAACGAGA CCAACAACGA 180 

TTTGAAGATA GTTTAAAACG AATCGATATT AATCCTTTAG GTGCAGCAGC CTTAAGTGGT 24 0 

45 ACCACATACC CTATCGATAG ACACGAGACA ACAGCATTGT TGAACTTTGG CAGTCTCTAT 300 

GAGAATAGCC TAGATGCTGT TAGTGACAGA GACTATATTA TTGAAACATT GCATAATATT 3 60 

TCTTTAACGA TGGTTCACTT ATCACGCTTT GCAGAGGAAA TTATTTTCTG GTCCACAGAC 420 

50 

GAAGCTAAAT TCATTACATT ATCAGATGCA TTTTCAACTG GCTCATCTAT TATGCCACAA 4 80 

AAGAAAAATC CTGATATGGC AGAATTAATT AGAGGTAAAG TTGGTCGAAC GACTGGTCAT 54 0 
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TCTTTAGGTT 


TTGAAAATGA 


CTTCACTCAT 


ATTTCAACTG 


GTGGCGGCGC 


GTCATTAGAG 


3480 


TACCTAGAAG 


GTAAAGAATT 


GCCTGGTATC 


AAAGCAATCA 


ATAATAAATA 


ATAAAGTGAT 


3540 


AGTTTAAAGT 


GATGTGGCAT 


GTTTGTTTAA 


CATTGTTACG 


GGAAAACAGT 


CACAAGATGA 


3600 


CATCGTGTTT 


CATCACTTTT 


CAAAAATATT 


TACAAAACAA 


GGAGTGTCTT 


TAATGAGAAC 


3660 


ACCAATTATA 


GCTGGTAACT 


GGAAAATGAA 


CAAAACAGTA 


CAAGAAGCAA 


AAGatTCGTC 


3720 


AATACATTAC 


CAACACTACC 


AGATTCAAAA 


GAAGTAGAAT 


CAGTAATTTG 


TGCACCAGCA 


3780 


ATTCAATTAG 


ATGCATTAAC 


TACTGCAGTT 


AAAGAAGGAA 


AAGCACAAGG 


TTTAGAAATC 


3840 


GGTGCTCAAA 


ATACGTATTT 


CGAAGATAAT 


GGTGCGTTCA 


CAGGTGAAAC 


GTCTCCAGTT 


3900 


GCATTAGCAG 


ATTTAGGCGT 


TAAATACGTT 


GTTATCGGTC 


ATTCTGAACG 


TCGTGAATTA 


3960 


TTCCACGAAA 


CAGATGAAGA 


AATTAACAAA 


AAAGCGCACG 


CTATTTTCAA 


ACATGGAATG 


4020 


ACTCCAATTA 


TATGTGTTGG 


TGAAACAGAC 


GAAGAGCGTG 


AAAGTGGTAA 


AGCTAACGAT 


4080 


GTTGTAGGTG 


AGCAAGTTAA 


GAAAGCTGTT 


GCAGGTTTAT 


CTGAAGATCA 


ACTTAAATCA 


4140 


GTTGTAATTG 


CTTATGAACC 


AATCTGGGCA 


ATCGGAACTG 


GTAAATCATC 


AACATCTGAA 


4200 


GATGCAAATG 


AAATGTGTGC 


ATTTGTACGT 


CAAACTATTG 


CTGACTTATC 


AAGCAAAGAA 


4260 


GTATCAGAAG 


CAACTCGTAT 


TCAATATGGT 


GGTAGTGTTA 


AACCTAACAA 


CATTAAAGAA 


4320 


TACATGGCAC 


AAACTGATAT 


TGATGGGGCA 


TTAGTAGGTG 


GCGCATCACT 


TAAAGTTGAA 


4380 


GATTTCGTAC 


AATTGTTAGA 


AGGTGCAAAA 


TAATCATGGC 


TAAGAAACCa 


ACTGCGTTAA 


4440 


TTATTTTAGA 


TGGTTTTG CG 


AACCGCGAAA 


GCGAACATGG 


TAATGCGGTA 


AAATTAGCAA 


4500 


ACAAGCCTAA 


TTTTGATCGT 


TATTACAACA 


AATATCCAAC 


GACTCAAATC 


GAAGCGAGTG 


4560 


GCTTAGATGT 


TGGACTACCT GAAGgACAAA TGGGTAACTC AGAAGTTGGT CATATGAATA 


4620 


TCGGTGCAGG 


ACGTATCGTT 


TATCAAAGTT 


TAACTCGAAT 


CAATAAATCA 


ATTGAAGACG 


4680 


GTGATTTCTT 


TGAAAATGAT 


GTTTTAAATA 


ATGCAATTGC 


ACACGTGAAT 


TCACATGATT 


4740 


CAGCGTTACA 


CATCTTTGGT 


TTATTGTCTG 


ACGGTGGTGT 


ACACAGTCAT 


TACAAACATT 


4800 


TATTTGCTTT 


GTTAGAACTT 


GCTAAAAAAC 


AAGGTGTTGA 


AAAAGTTTAC 


GTACACGCAT 


4860 


TTTTAGATGG 


CCGTGACGTA 


GATCAAAAAT 


CCGCTTTGAA 


ATACATCGAA 


GAGACTGAAG 


4920 


CTAAATTCAA 


TGAATTAGGC 


ATTGGTCAAT 


TTGCATCTGT 


GTCTGGTCGT 


TATTATGCAA 


4980 


TGGATCGTGA 


CAAACGTTGG 


GAACGTGAAG 


AAAAAGCTTA 


CAATGCTATT 


CGTAATTTTG 


5040 


ATGCCCCAAC 


TTATGCAACT 


GCCAAAGAAG 


GTGTAGAAGC 


AAGCTATAAT 


GAGGGCTTAA 


5100 
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ACAGTTGTTT CAGGTGCTTC ATGTACTACA AACTCATTAG CACCAGTTGC TAAAGTTTTA 16 80 

AACGATGACT TTGGTTTAGT TGAAGGTTTA ATGACTACAA TTCACGCTTA CACAGGTGAT 1740 

5 

CAAAATACAC AAGACGCACC TCACAGAAAA GGTGACAAAC GTCGTGCTCG TGCAGCGGCA 1800 

GAAAACATCA TCCCTAACTC AACAGGTGCT GCTAAAG CTA TCGGTAAAGT TATTCCTGAA 1860 

ATCGATGGTA AATTAGATGG TGGTGCACAA CGTGTTCCTG TAGCTACAGG TTCATTAACT 192 0 

w 

GAATTAACAG TAGTATTAGA AAAACAAGAC GTAACAGTTG AACAAGTTAA CGAAGCTATG 1980 

AAAAATGCTT CAAACGAATC ATTCGGTtAC ACTGAAGACG AAATCGTTTC TTCAGACGTT 2040 

15 GTAGGTATGA CTTACGGTTC ATTATTCGAC GCTACACAAA CTCGTGTAAT GTCAGTTGGC 2100 

GACCGTCAAT TAGTTAAAGT TGCAGCTTGG TATGATAACG AAATGTCATA TACTGCACAA 2160 

TTAGTTCGTA CATT AG CAT A CTTAGCTGAA CTTTCTAAAT AATTTTAGTA TAGTTTTTAT 2220 

20 TCAAATACGC TAGTGCTCAG AACTATTTAG CATTAATTAA AGCTTATGAG TAAGCGGGGA 22 80 

GCACAAACGC TTCTCCGCTT ATTTTTATAT AAAATTTCCT AATTACAAGG AGGAAACACC 2340 

ATGGCTAAAA AAATTGTTTC TGATTTAGAT CTTAAAGGTA AAACAGTCCT AGTACGTGCT 24 00 

25 GATTTTAACG TACCTTTAAA AGACGGTGAA ATTACTAATG ACAACCGTAT CGTTCAAGCT 24 60 

TTACCTACAA TTCAATACAT CATCGAACAA GGTGGTAAAA TCGTACTATT TTCACATTTA 252 0 

GGTAAAGTGA AAGAAGAAAG TGATAAAGCA AAATTAACTT TACGTCCAGT TGCTGAAGAC 2 580 

30 

TTATCTAAGA AATTAGATAA AGAAGTTGTT TTCGTACCAG AAACACGCGG CGAAAAACTT 2640 

GAAGCTGCTA TTAAAGACCT TAAAGAAGGC GACGTATTAT TAGTTGAAAA TACACGTTAT 2700 

35 GAAGATTTAG ACGGTAAAAA AGAATCTAAA AATGATCCAG AATTAGGTAA ATACTGGGCA 2760 

TCTTTAGGTG ATGTGTTTGT AAATGATGCT TTTGGTACTG CGCATCGTGA GCATGCATCT 2 820 

AATGTTGGTA TTTCTACACA TTTAGAAACT GCAGCTGGAT TCTTAATGGA TAAAGAAATT 2 8 80 

40 AAGTTTATTG GCGGCGTAGT TAACGATCCA CATAAACCAG TTGTTGCTAT TTTAGGTGGA 2 94 0 

GCAAAAGTAT CTGACAAAAT TAATGTCATC AAAAACTTAG TTAACATAGC TGATAAAATT 3 000 

ATCATCGGCG GAGGTATGGC TTATACTTTC TTAAAAGCGC AAGGTAAAGA AATTGGTATT 3060 

45 TCATTATTAG AAGAAGATAA AATCGACTTC GCAAAAGATT TATTAGAAAA AGATGGTGAT 312 0 

AAAATTGTAT TACCAGTAGA CACTAAAGTT GCTAAAGAAT TTTCTAATGA TGCCAAAATC 3180 

ACTGTAGTAC CATCTGATTC AATTCCAGCA GACCAAGAAG GTATGGATAT TGGACCAAAC 324 0 

50 

ACTGTAAAAT TATTTGCAGA TGAATTAGAA GGTGCGCACA CTGTTGTATG GAATGGACCT 33 00 

ATGGGTGTAT TCGAGTTCAG TAACTTTGCA CAAGGTACAA TTGGTGTATG TAAAGCAATT 336 0 

55 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 

TGAAATnGAA TAGTACTATT GCAAGTGTAA AGAGGTTAAT TTTTGCCnCA CGCGGGACTT 6 0 

AAAAAGGCAA CCACTGGTTG TGACATATCC TTATTTACAT TTATAAATAT AAGGAGGAGG 120 

TAGTAGTGAA AGACTTATTG CAAGCACAGC AAAAGCTTAT ACCGGATCTC ATAGATAAAA 180 

TGTATAAACG TTTTTCTATT CTTACTACTA TCTCAAAAAA TCAGCCTGTC GGACGTCGAA 24 0 

15 GTTTAAGCGA ACATATGGAT ATGACTGAAC GTGTACTGCG TTCTGAAACA GATATGCTTA 3 00 

AGAAACAAGA TTTGATAAAA GTTAAGCCTA CCGGAATGGA AATTACAGCT GAAGGTGAGC 3 60 

AACTGATTTC GCAATTGAAA GGTTACTTTG ATATCTATGC AGATGATAAT CGTCTGTCAG 4 20 

AAGGTATTAA GAATAAATTT CAAATTAAGG AAGTTCATGT TGTTCCTGGT GATGCTGATA 4 80 

ATAGTCAATC TGTTAAAACA GAATTAGGTA GACAAGCAGG TCAATTACTT GAAGGCATAT 54 0 

TACAAGAAGA CGCGATAGTT GCTGTAACTG GCGGATCCAC GATGGCATGT GTTAGTGAAG 6 00 

CAATTCATTT ATTACCATAT AATGTATTCT TCGTACCAGC CAGAGGTGGA CTAGGCGAAA 66 0 

ATGTTGTCTT TCAGGCAAAC ACAATTGCAG CCAGTATGGc aCAACAAGCT GGCGGTTATT 720 

ATACGACGAT GTATGTACCT GATAATGTCA GTGAAaCAAC ATATAATACA TTGTTGTTAG 7 80 

AGCCATCAGT CATAAACACT TTAGACAAAA TTAAACAAGC AAACGTTATA TTACACGGCA 84 0 

TTGGTGATGC GCTGAAGATG GCGCATCGAC GTCAATCACC TGAAAAGGTC ATTGAACAAC 900 

35 TTCAACATCA TCAAGCTGTC GGAGAGGCAT TTGGTTATTA TTTTGATACA CAAGGTCAAA 96 0 

TTGTCCATAA GGTTAAAACA ATTGGACTTC AATTAGAAGA CCTTGAATCA AAAGACTTTA 1020 

TTTTTGCAGT TGCAGGAGGC AAATCGAAAG GTGAAGCAAT TAAAGCATAC TTGACGATTG 10 8 0 

CACCCAAGAA TACAGTGTTA ATCACTGATG AAGCCGCAGC AAAGATAATA CTTGAATAAG 114 0 

AGATAAAAAG TTTAATACTT TTTAAATATC ATTTTAAAGG AGGCCATTAT AATGGCAGTA 12 0 0 

AAAGTAGCAA TTAATGGTTT TGGTAGAATT GGTCGTTTAG CATTCAGAAG AATTCAAGAA 12 6 0 

GTAGAAGGTC TTGAAGTTGT AGCAGTAAAC GACTTAACAG ATGACGACAT GTTAGCGCAT 13 2 0 

TTATTAAAAT ATGACACTAT GCAAGGTCGT TTCACAGGTG AAGTAGAGGT AGTTGATGGT 13 8 0 

GGTTTCCGCG TAAATGGTAA AGAAGTTAAA TCATTCAGTG AACCAGATGC AAGCAAATTA 14 4 0 

CCTTGGAAAG ACTTAAATAT CGATGTAGTA TTAGAATGTA CTGGTTT CT A CACTGATAAA 15 0 0 
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AAATTCATTA GATTCAGTGG ACCAAGATAC AGAGAAATCA AAATATTATG AGCAAAATTC 11100 

TG AAGCGAC T TTATCAACTA AATCAACCGA TAAAGTAGAA TCAACTGAAA TGAGAAAGCT 11160 

AAGTTCAGAT AAAAACAAAG TTGGTCATGA AGAGCAACAT GTACTTTCTA AACCTTCAGA 1122 0 

ACATGATAAA GAGACTAGAA TTGATTCTGA GTCTTCAAGA ACTGATTCAG ACAGCTCGAT 112 30 

GCAGACAGAG AAAATAAAAA AAGACAGTTC AGATGGAAAT AAAAGTAGTA ATCTGAAATC 1134 0 

TGAAGTAATA TCAGACAAAT CAAATACAGT ACCAAAATTG TCGGAATCTG ATGATGAAGT 114 0 0 

AAATAATCAG AAGCCATTAA CTTTACCGGA AGAACAGAAA TTGAAAAGAC AGCAAAGTCA 114 60 

AAATGAGCAA ACAAAAACCT ATACATATGG TGATAGCGAA CAAAATGACA AGTCTAATCA 11520 

TGAAAATGAT TTAAGTCATC ATATACCATC GATAAGTGAT GATAAAGATA ACGTCATGAG 11580 

AGAAAATCAT ATTGTTGACG ATAATCCTGA TAATGATATC AATACACCAT CATTATCAAA 11640 

20 AACAGATGAC GATCGAAAAC TTGATGAAAA AATTCATGTT GAAGATAAAC ATAAACAAAA 11700 

TGCAGACTCG TCTGAAACGG TGGGATATCA AAGTCAGTCA ACTGCATCTC ATCGTAGCAC 11760 

TGAAAAAAGA AATATTTCTA TTAATGACCA TGATAAATTA AACGGTCAAA AAACAAATAC 11820 

AAAGACATCG GCAAATAATA ATCAAAAAAA GGCTACATCA AAATTGAACA AAGGGCGCGC 11880 

TACGAATAAT AATTATAGTG ACATTTTGAA AAAGTTTTGG ATGATGTATT GGCCTAAATT 11940 

AGTTATTCTA ATGGGTATTA TTATTCTAAT TGTTATTTTG AATGCCATTT TTAATAATGT 120 00 

GAACAAAAAT GATCGCATGA ATGATAATAA TGATGCAGAT GCTCaAAAAT ATACGACAAC 12060 

GATGAAAAAT GCCAATAACA CAGTTAAATC GGTCGTTACA GTTGAAAATG AAACATCAAA 12120 

AGATTCmTCA TTACCTAAAG ATAAAGCATC TCaAGACGAA GTGGGATCAG GTGTTGTATA 12180 

TAAAAAATCT GGAGATACGT TATATATTGT TACGAATGCA CACGTTGTCG GTGATAAAGA 1224 0 

AAATCaAAAA ATAACTTTCT CGAATAATAA AAGTGTTGTT GGGAAAGTGC TTGGTAAAGA 12 300 

TAAATGGTCA GATTTAGCTG TTGTTAAAGC AACTTCTTCA GACAGTTCAG TGAAAGAGAT 12360 

AGCTATTGGA GATTCAAATA ATTTAGTGTT AGGAGAGCCA ATATTAGTCG TAGGTAATCC 12420 

ACTTGGTGTA GACTTTAAAG GCACTGTGAC AGAAGGTATT ATTTCAGGTC TGAACAGAAA 12480 

TGTTCCTATT GATTTCGATA AAGATAATAA ATATGATATG TTGATGAAAG CTTTCCAAAT 12 540 

TGATGCATCA GTAAATCCAG GTAACTCGGG TGGTGCTGTC GTCAATAGAG AAGGAAAATT 12600 

AATAGGTGTA GTTGCAGCTA AAATTAGTAT GCCAAACGTT GAAAnTATGT CATTTGCA 1265 8 

(2) INFORMATION FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6048 base pairs 
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AAGATGAACA 


PATPAAP.ATP 


UAl 1 1 IAGTG 


AAAGATAGAT 


ATGACGATTT 


AG T ATT CTT A 


9300 


TTTGAAAATG 


AA'1'ITP.PA AP 


AAvjA 1 Vjvj 111 


OIV TVOlvr'TV JVIVf 

GAAGAGAAAT 


TPPPTV'lS A AT* 

ILL LT1 \j AAAT 


T* A A A TTVTH T* 

TAAATTGTAT 


9360 


AGTTTACTTT 


AAPAGPTP A A 


1 1 vjl Al AA1 L 


bAA Jl 1 I\j X I A 


RTT?l A A A A 

LAO AAAAAT 


AATTGTTT C G 


9420 


TTGAAGAAAA 


ATAAATTGTA 


TA'T U J U '^*TA AAA 
Inl ill AAAA 


r* A A TV TV TV^*»/^T*R 

vjAAAAAGvjX A 


lALlAIGAIb 


TAT CAAAT G A 


9480 


ATAACCTATG 


GP ATTTTP.T P 

X IX X 1 I*. 


ap aptipp apt 


A A /^"P'TTV IK TV TV 

AAC-1 1AAGAA 


A TV A OO/^'P 

TCATGACLGT 


A AAA T , ^» ' 1 »' T ' 

ATAAATG a TT 


954 0 


CGACACTTTA 


TCGTPATTAP 


GAtGATATPT 


1 L v_vj\j i AAAvj 


IvjCjCjLAAI I I 


AAA TT'f"' /^'p*T' A 

AAA 1 ILL II A 


9600 


GTGAGACCTT 


TGPTATTTAT 

i VJ v 1/1X X X f\ X 


1*7* Zl P P A T 2i fin 
1 1 AvjV_-Hl AVjVj 




TGTACTTAAC 


TTATTTATTT 


966 0 


AAAGGAGTTG 


TAPATGTTAA 


TPP ATPP A RP 


1 1 ITjAIL 1 1 A 


CCTTATTTAT 


GGGTACTTGT 


9720 


CGTTTTAGTA 


TT*TTTAGAAf2 
x x x x x nunnvj 




APP AP ATA IVP 
Avj UAvjA 1 AAC 


GCGAI ITG1TA 


TGG CTGTAAT 


9 78 0 


GGTTAAGCAP 




AZVPAAPPTA A 
AALAALLi 1 AA 


ft ft ft ft ^^^p^p^p^* 
AAAACjL 1 1 1 Cj 


TTTTACGGTT 


TGTTAGGTG C 


9840 


A TTTPT A TTT 


r\\jJA 1 111 l/ivj 


LA 1 1 A ITCTT 


AATTAGTATT 


ATCGCGAACT 


TTTGGTTTAT 


9900 


TPA AHPTP.P A 


uuAbLw 111 


A PTT A TV T»T"P A 
ALl IAAI I TA 


TATGTCAAT C 


AAAAATCTGT 


GGCAGTTCTT 


9960 


T AAA PAP PP A 


GAAATTP.AA A 


CTrrrp a Apr* 


TPP 7\ O T\ T 1 /^ TV T* 
1 VjvjAvjA 1 ViA I 


CATCATTATG 


ATGAATCTGG 


1002 0 


TP.AAPAPATT 

X VJJvVVjrt.vjrt 1 1 


/IMAvjLAAvj 1 A 


ALAAAI LAI 1 


L 1 viGGGAA LI 


GTGTTGAAAA 


TAGAATTTGC 


10080 


AGAT A T CG P A 


TTTPP P A TTP 

1 X HjULaJ, X Vj 


Al 1 v, 1A1\jL I 


I vjL 1(jL I 1 1 A 


gCTATTGCTG TAACACTTCC 


1014 0 


TAAAGTTGGT 


-T\ X 1 V^rlV_ 1 1 lu 


HTflTT A TPP A 
vj 1 VjVj 1 A 1\jvjA 


/ I 1 * r* ft p^ j^^p^^ ft 

L I I AGGTLAG 


TTCGTAGTCA 


TGTTCCTAGG 


10200 


TGGAATGATT 


vjvj x vj x inl 1L 


T A A lYZCnT^V A 
lrtMI\yV_vjl 1A 


I vjLAvjL-AALA 


TGGTTTGTAG 


AGCTATTAAA 


1026 0 


CAAAT AT P PA 


GC5APTTP.AAP 


PTPP APPPTt- 
vj 1 vj\_/\vjv-v- 1 L 


CC CC* ATrV^TT 

LvjLvjA1L\j1 1 


GGTTGGGTAG 


GTGTTAAATT 


10320 


AGTTGTPATC, 

•"VJ X X VJ X \ X VJ 


GTA TT AG PP.P 


APPPanziPA r r 


PTPT A 
LvjLIVjIAI ILs 


CCTGAGCACT 


TCCCACATGG 


103 80 


CGTATTATGT; 

V— VJ X J» X X f\ X VJVJ 


P A A TPT A TTT 


TP'rnp a p a pt 

1 v- 1 vjvjtAv-^\vj 1 


A PT A A TTVV A 

AL I AAl 1 viGA 


TTAGTAATTA 


TCGGTTGGTT 


1044 0 


AGGTTPAGTT 


GTTAAA A ATA 


A A A ZlZtTPPPTA 


TA A A T A A TTP 
1AAA1AAI lVj 


ATGTGAAGCG 


GACAATCTTA 


10500 


ATTTAGTTTA 

r\ XXX n\J XXX f\ 


AP^iTTPTPPT 


1111 v^Al 1 1 A 


A I IUAGIvjAI 


TTATGAAAAA 


TGGATTTTGA 


1056 0 


AGAATGTGAA 


TPAA A APATP 


PP AT AT APT A 
v_vj/\ 1 A 1 no 1 A 


TT A APA A A AT 

I IAAvjAAAAI 


GTGCCTTTTA 


TATTTAGCAT 


10620 


TTT'iTCAATA 


GAAATTATAT 


A.P. aTTTTfi A A 
r\\jj\ 1111 >vrtM 


pp a a attapp 
vjLAAAI 1Avj\j 


TGTTAATGTG 


TCATAATGAT 


t n c o r\ 
10680 


AAGTGATTTT 


ATTGA ATGGA 


PTPP A P A TT A 
vj 1 vjvj/V^/- 1 * i in 


PTPP ATA TTP 


GTAAAAAACA 


TGT AATT C C T 


1 074 C 


AAAAGTCAGT 


nACCsaCGTA 


AGCGTCGTGA 


ATTCTTCCAC 


AACGAAGACA 


GAGAAGAAAA 


108C0 


TTTAAATCAA 


CATCAAGATA 


AACAAAATAT 


AGATAATACA 


ACATCAAAAA 


AAGCAGATAA 


10860 


G CAAAT A CAT 


AAAGATTCAA 


TTGATAAGCA 


CGAACGTTTT 


AAAAATAGTT 


TATCATCGCA 


10920 
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GTTTTAGCAT CAAAAGGAAG AGAACCATAT CAAATCATGC CAGGGCATAT TAAGGTGCCA 7 500 

CATCGAGATG ATTTAATTGG CCTTGAAGCA GCTTACAAAA AGTTCGGTGG TGGCCCTGTT 7 560 

GATTAATAAA AGATTTATTG ATGAAGGTAA AACTATTGAT GTTTATTTAT TCGAAGCATT 762 0 

AAATAACCAG ATAATCATTG CTATACCAGA TTGGTTTTGG TCATATCAGA TGGCAATGAC 7 6 80 

ATTAGATGAA GAAACTTGTT TTGAAGCAAT ACTCATGCAA TTGTTTGTTT TTAAAGAAGA 7740 

GGAAGAGGCA GAATCGATTG CATCACAACT AACAGATTGG ATAGAAACAT ATAAAAAGGA 78 00 

GAAAGACTAA TGAACTTAAA GCAAGAAGTT GAGTCTAGAA AGACTTTTGC GATTATTTCA 7860 

CATCCCGATG CAGGGAAAAC AACGTTAACT GAAAAACTAT TGTACTTCAG TGGTGCTATT 7920 

CGTGAAGCGG GTACAGTTAA AGGGAAGAAG ACTGGTAAAT TTGCGACAAG TGACTGGATG 7980 

AAAGTTGAAC AAGAGCGTGG TATTTCTGTA ACTAGTTCAG TAATGCAATT TGATTACGAT 8040 

GATTATAAAA TCAATATCTT AGATACACCA GGACATGAAG ACTTTTCAGA AGATACGTAT 8100 

AGAACATTAA TGGCAGTTGA CAGTGCTGTC ATGGTCATAG ACTGTGCAAA AGGTATTGAA 8160 

CCACAAACAT TGAAGTTATT TAAAGTTTGT AAAATGCGTG GTATTCCAAT CTTTACATTC 8220 

ATTAATAAAT TAGACCGAGT AGGTAAAGAA CCATTTGAAT TATTAGATGA AATCGAAGAG 82 80 

ACATTAAATA TTGAAACATA CCCTATGAAT TGGCCAATTG GTATGGGACA AAGTTTCTTT 834 0 

GG CATC ATT G ATAGAAAGTC TAAAACAATT GAACCATTTA GAGATGAAGA AAATATATTA 84 00 

30 CATTTGAATG ATGATTTTGA GTTGGAAGAA GATCATGCAA TTACAAATGA TAGTGATTTT 84 60 

GAACAAGCGA TTGAAGAATT AATGTTGGTT GAAGAAGCGG GTGAAGCCTT TGATAATGAC 8520 

GCGCTGTTGA GTGGAGACTT AACACCTGTA TTTTTCGGTT CAGCTTTAGC TAACTTTGGT 8580 

35 GTACAAAATT TCTTAAATGC ATATGTTGAT TTTGCGCCAA TGCCAAATGC GAGACAAACA 8640 

AAAG&AGACG TTGAAGTAAG CCCGTTTGAT GATTCATTTT CAGGATTTAT CTTTAAAATT 8700 

CAAGCCAACA TGGACCCTAA ACACCGTGAT AGAATTGCCT TTATGCGTGT CGTTAGTGGT 8760 

GCATTTGAAC GTGGTATGGA TGTTACTTTG CAACGTACTA ATAAAAAGCA AAAGATCACA 8820 

CGTTCAACGT CATTTATGGC AGACGATAAA GAAACTGTGA ATCATGCTGT AGCAGGCGAT BB80 

ATCATTGGAC TATATGATAC TGGTAATTAT CAAATTGGAG ATACTTTAGT TGGTGGAAAA 8 94 0 

CAAACCTACA GTTTCCAAGA TTTACCACAA TTTACGCCAG AAATTTTTAT GAAAGTTTCT 9000 

GCTAAAAACG TCATGAAACA GAAGCATTTC CATAAAGGTA TTGAAGAATT AGTACAAGAA 90 60 

GGTGCGATTC AATACTATAA AACATTACAC ACAAACCAAA TTATTTTAGG TGCTGTTGGT 9120 

CAGTTACAAT TTGAAGTTTT CGAACATAGA ATGAAAAACG AATATAATGT TGATGTTGTT 9180 
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TTGAGTAACC ATTAATAGCC ACCCTCCGTT AGTTTGAAAA TTTTATTTAA GTGTAACTTA 5 70 0 

TTTTACGGCA TTATAAAAGA AATAAAGACG CAAAGTCGTT ACATTTATAG CAATTTTAAT 576 0 

CTATAGATGA ATTGATACAA AATAAAACGT TATTTTATAA AGCAATTTAT TGTTCTATGT 5 82 0 

TTTATTTGTA TATTTAAAAT TATCCAGTAT ACAATTATAG CATATTTTTG GAAACAATTA 5 8 80 

TGATATTATA CCATGTTACA AGATGGTTTT AATAATTTAA GATGAGCCAT AATTGTAAAA 5 94 0 

CTAATTCATA ATACCGTATG TTTTATTTTT AATAGTAGAA ATTAGAAAAT GCTGATTAGT 6000 

AGGATATAAC AGTGAAATTA TAAATTTATT AACATCAACA AAACGTGTAT AATAAACATA 6 060 

TTGTAGAAAA AGGAGCGGTT CAGTTTGGAT GCAAGTACGT TGTTTAAGAA AGTAAAAGTA 6120 

AAGCGTGTAT TGGGTTCTTT AGAACAACAA ATAGATGATA TCACTACTGA TTCACGTACA 6180 

GCGAGAGAAG GTAGCATTTT TGTCGCTTCA GTTGGATATA CTGTAGACAG TCATAAGTTC 624 0 

TGTCAAAATG TAGCTGATCA AGGGTGTAAG TTGGTAGTGG TCAATAAAGA ACAATCATTA 6 3 00 

CCAGCTAACG TAACACAAGT GGTTGTGCCG GACACATTAA GAGTAGCTAG TATTCTAGCA 6360 

CACACATTAT ATGATTATCC GAGTCATCAG TTAGTGACAT TTGGTGTAaC GGGTACAAAT 642 0 

GGTAAAACTT CTATTGCGAC GATGATTCAT TTAATTCAAA GAAAGTTACA AAAAAATAGT 64 80 

GCATATTTAG GAACTAATGG TTTCCAAATT AATGAAACAA AGACAAAAGG TGCAAATACG 6 54 0 

ACACCAGAAA CAGTTTCTTT AACTAAGAAA ATTAAAGAAG CAGTTG AT GC AGGCGCTGAA 66 00 

30 TCTATGACAT TAGAAGTATC AAGCCATGGC TTAGTATTAG GACGACTGCG AGGCGTTGAA 66 6 0 

TTTGACGTTG CAATATTTTC AAATTTAACA CAAGACCATT TAGATTTTCA TGGCACAATG 6 72 0 

GAAGCATACG GACACGCGAA GTCTTTATTG TTTAGTCAAT TAGGTGAAGA TTTGTCGAAA 67 8 0 

GAAAAGTATG TCGTGTTAAA CAATGACGAT TCATTTTCTG AGTATTTAAG AACAGTGACG 6 84 0 

CCTTATGAAG TATTTAGTTA TGGAATTGAT GAGGAAGCCC AATTTATGGC TAAAAATATT 6 90 0 

CAAGAATCTT TACAAGGTGT CAGCTTTGAT TTTGTAACGC CTTTTGGAAC TTACCCAGTA 6 96 0 

AAATCGCCTT ATGTTGGTAA GTTTAATATT TCTAATATTA TGGCGGCAAT GATTGCGGTG 7 02 0 

TGGAGTAAAG GTACATCTTT AGAAACGATT ATTAAAGCTG TTGAAAATTT AGAACCTGTT 70 8 0 

GAAGGGCGAT TAGAAGTTTT AGATCCTTCG TTACCTATTG ATTTAATTAT CGATTATGCA 714 0 

CATACAGCTG ATGGTATGAA CAAATTAATC GATGCAGTAC AGCCTTTTGT AAAGCAAAAG 72 0 0 

TTGATATTTT TAGTTGGTAT GGCAGGCGAA CGTGATTTAA CTAAAACGCC TGAAATGGGG 72 6 0 

CGAGTTGCCT GTCGTGCAGA TTATGTCATT TTCACACCGG ATAATCCGGC AAATGATGAC 732 0 

CCGAAAATGT TAACGGCAGA ATTAGCCAAA GGTGCAACAC ATCAAAACTA TATTGAATTT 73 8 0 
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TGTAAAAGAA 


TACCAGGGAA 


CaACAATAAA 


TGGcGCTTTG 


TCACATCAAC 


AATTTGTCTC 


3900 




AATTGAGCTT 


TAACTGGACG 


AGTATTATAA 


TTTGTTAACT 


TTACATCGAC 


AAAATAATAT 


3960 


5 


AATATCCATG 


CAATTAAAAC 


GACTAAAGAC 


ATCATGAAGG 


CAAAGCGTGT 


TGGGTGCACT 


4020 




TTGATAAGTA 


GATTCATAAA 


AACCATACCT 


ACCAATAGGC 


CTAACAACCA 


TGAAAAATAA 


4080 


10 


ACATAGCCCA 


TTTGTTTGCC 


ACGTTTATCT 


TCTTCAACAC 


TGGATAACAT 


AATGACCCAA 


4140 


ATAGGACTAA 


CTGCAATACC 


GAGCATCATA 


GCACTAAATA 


TGATTACAAA 


AGGTGATGCT 


4200 




GG AAA CC AAA 


TAACTAAAAA 


TAAACTTGTA 


AATGCTAAAA 
CCTATAACAA 


TAAATCCAGT 
AGTTTGTAGA 


CGTTAAAACG 
TGCATCAGCA 


4260 
4320 


15 


ATTTTTGTGC 


CGAATTTTTT 


CAGTAAAAAT 




ATAAAATGTA 


TTGAAAATGC 


TAGAGACGTT 


ATTGCTACAG 


CAATGGATGT 


AACTGTTGGC 


4380 




AAGAAATTAA 


TATAGCTTAG 


GATATACATG 


CCTCTCGCAA 


ATTCCATTAA AAATAAGATA 


4440 


20 


ATAAGCaTTA 


AAATGAAATT 


TTTATGATTA 


GCGTAATTAT 


TTAACGAAGA 


ATCTTGCATA 


4500 




TAAAGGAACC 


TTTCCATAAA 


TCTCTTGTGG 


TTGTGATGAA 


TGACCGATTA 


AATCAAGTAA 


4560 




GTCTCGACAT 


ATTGTCTGTG 


TAGCATACTT 


AATTTTATCT 


TGTTCCATTG 


TACTAATCAT 


4620 


25 


GTTAGTTAAT 


TGCTCATTAC 


CGTTAGTTAA 


ACTTGCTACA 


ATTTTTATTG 


CTTCTTCTGG 


4680 




AGTATCAGCG 


ATTTTACCAA 


AACCTTTTTC 


TTCAAAGTAA 


AGGGCATTTT 


CAAGCTCTTG 


4740 




ACCAGGTGCA 


GGATTTAGGA 


AAATCATTGG 


AATACAACGG 


GCGAAACCTT 


CAGTTATTGT 


4800 


30 


GATACCACCA 


GGTTTCGTAA 


TCATAAGTTG 


ACTTGATGCC 


ATCCATTCAT 


TCATGTGTTT 


4860 




GGTATAACCT 


AGAATCAATA 


CATTCTCGTT 


AGATTTAAAC 


TTAGCTGTTA 


AAGAACGCTT 


4920 




TAGCTCTTTG 


CTCTTACCAC 


AAATCATAAC 


TACTTGTGCA 


TTTGCaCTTT 


tCGCTAATAT 


4980 


35 


ATCAGTAATC 


ATCGTGTCAA 


AACCTTTAGA 


TACACCAAAT 


GCACCAGCTG 


aCATTAAAAT 


5040 




AGTTTGCTTA 


TCTGGATCTA 


AGTTGTTGTC 


TATTAACCAC 


TGCTTTTGAT 


TAATAGGCGT 


5100 


40 


TTCAAATTTG 


TTATCAATAG 


GAATACCTGT 


CaCTTTAACT 


GTTGAAGGAT 


CAATACCTAC 


5160 


GTCTATGAAG 


TCTTGTTTCG 


TTTCTTTTGT 


TGCCACATAA 


TATCTTGTTG 


AATACGGCGT 


5220 




AATCCAGTTT 


TTATGTAAGC 


GATAGTCTGT 


CATCACTGTA 


GCAACTGGAA 


TATTAATGTT 


5280 


45 


AAATTGCTCA 


GTTAGTACCG 


ACATAACTGG 


TGTAGGAAAC 


GTTAATAATA 


TTAAATCTGG 


5340 




CTTTTCTTTT 


ATCAATAAAT 


TAATTAACTT 


ATTAAGTCCA 


TAGTATTTGT 


AAAAACATTT 


5400 




GTCTAGTTTA 


TCTGGGCGGC 


TGTAATAAAA 


CCCTTTGTAC 


ATATTTCTAA 


AATATTTAAA 


5460 


50 


GCTATTGATA 


TACCATTTTT 


TACAAATAGA 


AGTCAAAATT 


GGATGAGCTT 


CCATAAATAA 


5520 




ATCGTGCTCA 


ATGACGCTTA 


AATGGTCTAG 


ATTCATATCA 


TTAAGTTGAT 


TAACGATACT 


5580 
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U T CGG T AAAG 


AAAT ATTGC C 


ATTTATTGAC 


TCGACGTTTT 


CTACACTGAA 


AGTAGGTAAT 


2100 




Z""' ft ft /"* f** T»*T» TV rn 

(-AACjCjTTAT 


TAGTAGGGGA 


TAGTTTAGCG 


GGTAGTATTG 


CCTTATTAAC 


GGCGTTGACC 


2160 


5 


ft T 1 ^/^ 7\ is ft 

I ATCCAACGA 


tttttagtcg 


TGTAGCAATG 


TTAAGTCCAC 


ATTCAGATGA 


AAAAGTATTA 


2220 




/"* ft T* ft ft /*• ft 

(j A I AAG CT AA 


ATCAATGTGC 


AAATAAAGAA 


CAATTGACAA 


TTTGGCATGT 


CATTGGTCTA 


2260 


10 


/■* A T'O TV ft ft ft ft /•"» 

UATUAAAAAG 


ATTTTACTTT 


ACCAACAAAT 


GGTAAGCGTG 


CCGATTTCTT 


AACACCGAAT 


2340 


TA O ft O TV ft M'l' ft /""» 

ACjAGAATTAG 


CTGAACAAAT 


TAAGAAATAT 


AATATAACTT 


ATTATTACGA 


TGAATTTGAT 


2400 




GGTGGTCACC 


AATGGAAAGA 


TTGGAAACCA 


TTGCTGTCAG 


ATATATTATT 


GTATTTTTTA 


2460 


15 


ft /*"Tfft ft ft ft ft /"i m 

agtaaaaaca 


CAGATGATCA 


ACTTTATGAA 


TAATTTACAT 


TAGTAGATTT 


AGTATGAATT 


2520 




GTCTTCATAT 


AGTCTGGTCT 


ATAATATAAT 


TTATAAAAGA 


TTTTACTGTT 


TAATTTAATT 


2580 




taaatttgac 


GAAATTGCAA 


AAGATGTATA 


ATGAATTATT 


TTTAATGTAA 


CGGTTTTCAA 


2640 


20 


AGAAATTTGA 


TATAATAGCA 


ATAGGTTAAA 


CAAAGGAGGA 


ATTCAGATGA 


TTTTAGGATT 


2700 




AG CATT AATT 


CCATCAAAGT 


CATTTCAAGA 


AGCGGTGGAT 


TCTTACCGTA 


AAAGATATGA 


2760 




TAAACAGTAT 


TCACGAATTA 


AACCACATGT 


GACAATTAAA 


GCGCCATTTG 


AAATTAAAGA 


2820 


25 


tggtgattta 


GATTCTGTCA 


TTGAACAGGT 


TAGAGCTCGT 


ATTAATGGTA 


TACCAGCAGT 


2880 




agaagttcat 


GCTACAAAAG 


CTTCTAGCTT 


CAAACCAACG 


AACAATGTGA 


TTTACTTTAA 


2940 




agttgcgaag 


ACGGACGACT 


TAGAAGAATT 


GTTTAATCGC 


TTTAATGGAG 


AAGATTTCTA 


3000 


30 


tggagaagct 


GAACATGTTT 


TTGTGCCACA 


CTTTACAATA 


GCACAAGGAC 


TATCTAGCCA 


3060 




agaattcgaa 


GATATTTTTG 


GTCaAGTAGC 


ATT AG CTGGG 


GTAGACCaTA 


AAGAAATTAT 


3120 




cgatgaatta 


ACTTTGTTAC 


GTTTTGACGA 


TGACGAAGAT 


AAATGGAAAG 


TTATTGAAAC 


3180 


35 


gtttaaatta 


GCTTAAGTAA 


CATAATAGTA 


TTGTTAATCG 


TAGTATGTTT 


GAATTAATAA 


3240 




gaaaatggtc 


ATTTTTATTG 


AATGTAATAA 


AAATGACCAT 


TTTCTTTATT 


TTAAAATACG 


3300 


40 


TTTTAACCTT 


ACTTAGCTTT 


TTCTCTATTT 


ACTATAAAGT 


rGCTTCCATA 


AAATACAGCT 


3360 


aagactaaaa 


AGATTAATGC 


CGAGAAATAA 


AATGTATTGT 


TTAAATTGTT 


GGTAAATTGT 


3420 




gtaattaatc 


CGCCAAATAA 


TGGCCCTATC 


ATTGAGCCGA 


ATCCTTGGAT 


ACTATTAAAA 


3480 


45 


acaccccaag 


TTTCTTCTTG 


TTCATCTGAT 


TTGATAAATC 


GTGCCATAAA 


GGTATTCCAT 


3540 




gctggtaata 


AGATGCCATA 


CATT AG AC CG 


ATAGCTAAAG 


CGATAATCCA 


CAAGATGTGA 


3600 




atattaacaa 


TCATAGATAG 


AGTAAAAATT 


AATATCATGT 


ATAAAATAAA 


TCCGCTTAGA 


3660 


50 


ATAACACCAT 


ACATAAAGTT 


TCTGCTGCGG 


TTATCTATTA 


GTTTCGATAA 


AAATAGCATC 


3720 




gaaactgcac 


AGCCGATACC 


ACCAATAATG 


ATTGCAACAG 


TATATTCAAT 


TGTGCTTACG 


3780 
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15 



GGTACAGGTA ATATTGTCGG TGTATCTACT GCAATATTTA TAGGAGGACC TGGTGCAGTA 30 0 

TTTTGGATGT GGATTACTGC GTTTTTAGGT GCAAGTAGTG CTTTTATTGA ATCTACACTT 36 0 

GGTCAAATAT TCAAGAGAGT TGAAAATAAT GAATACCGTG GTGGACCAGC GTATTATATT 42 0 

GAATATGGTA TTGGTGGTAA ATTTGGTAAA ATTTACGGAA TTATCTTTGC TATTGTTACG 48 0 

ATTATCTCAG TAGGTCTATT GCTTCCTGGT GTGCAATCTA ACGCTATAGC AAGTTCTATG 54 0 

CATAATGCGA TTCATGTTCC ACAATGGTTA ATGGGTGGTA TTGTTGTAGT TATTTTGGGA 600 

TTAATTATTT TTGGTGGTGT ACGTATTATT GCCAATGTTG CAACAGCCGT TGTACCATTT 66 0 

ATGGCAATTA TTTACATACT GATGGCTGTC ATTATCATTT GTATCAATAT ACAAGAAGTG 720 

CCAGCGTTAT TTGCATTAAT TTTCAAATCA GCATTTGGAT TACAATCTGC TTTTGGTGGT 78 0 

ATCGTTGGCG CAATGATAGA GATTGGTGTT AAACGTGGAT TATATTCAAA TGAGGCTGGT 84 0 

20 CAAGGTACAG GTCCACACGC AGCAGCGGCa gcAGaAGTAT CACATCCAAG TAAACAAGGT 900 

CTAGTACAAG CATTTTCAGT TTATATTGAT ACATTATTTG TATGTACTGC AACTGCTCTG 96 0 

ATTATACTTA TTTCTGGTAC ATATAATGTG ACTGATGGTA CGGTTAATGC GAATGGCACA 102 0 

25 CCGCATTTAA TTAAAGATGG CGGTATTTAT GTTgAAAATG CAACAGGTAA AG ATT ATT CA 1080 

GGTACTGCGA TGTATGCACA AGCCGGCATt GATAAAGCGT TCCATGGCAG TGGTTATCAA 114 0 

TTTGATCCTA CTTTCTCTGG CGTAGgTTCG TACTTTATTG cATTTGCTTT ATTCTTCTTT 1200 

GCATTTACTA CAATTTTGTC GTACTACTAC ATTACAGAAA CAAATGTTGC TTATTTAACG 1260 

CGTAATCAAA ATAATCAAGT TTCATCGATA TTTATTAATA TTGCTCGTGT GATTATTTTG 1320 

TTCGCTACAT TTTACGGTGC AGTTAAAACA GCTGATGTAG CATGGGCATT CGGTGATTTA 13 80 

GGTGTAGGTC TAATGGCTTG GTTAAATATC ATTGCGATTT GGATTTTACA TAAGCCTGCC 144 0 

GTAAATGCTT TAAAAGATTA TGAAATTCAA AAGAAACGTT TAGGCAACGG TTATAATGCA 1500 

GTTTATCAAC CTGATCCGAA TAAATTACCT AATGCTGTCT TTTGGTTGAA GACGTATCCA 156 0 

GAACGTTTAA AACAAGCACG TGCCAAAAAG TAATCTACTT TTGTTTATAG TATATGTAGT 162 0 

GATCATTTGA TAAAAAAGAA AAGTATTGAG AATTTTAGGt GCTCAGAAAT TTGAATTTTA 1680 

AAAATATAGT GTCTCTTGGT ACAATAACAA TACAACTACT AGGGGCACTT TTTTATGTCA 174 0 

GAATTTAAAA CTGGTAAGAT TAATAAACAT GTTTTATATA GTAATATTTT AAATAGAGAT 180 0 

GTCACGTTAA GTATTTATTT ACCAGAATCT TATAATCAAC TTGTTAAATA TAATGTCATT 186 0 

50 CTTTGCTTTG ACGGATTAGA TTTTTTACGT TTCGGGAGAA TACAACGTAC ATATGAATCG 192 0 

TTAATCAAAG AAGCGCGTAT TGATGATGCG ATCATTGTTG GATTCCATTA TGAAGACGTT 198 0 
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CTCATGTTAA 


AGCGCCACAA 


AATTGAAGCA 


TTATTTTTTG 


CATTAACAAT 


GGCATTATCT 


720 


GGAATTTTGA 


ATCCAGCATT 


AAAAAATATA 


TTCGATAGAG 


AAAGACCTAC 


ATTGCTGCGT 


780 


TTAATTGATA 


TAACAGGATT 


TAGTTTTCCT 


AGCGGTCATG 


CTATGGGATC 


AACTGCATAT 


840 


TTTGGAAGXG 


GTATCTATCT 


ATTAAAT CG A 


TTAAATCAAG 


GTAATTCAAA 


AGGTATTCTT 


900 


ATAGGGTTAT 


GTGCAGCTAT 


GATTTTATTG 


ATTTCCATAT 


CACGTGTATA 


TCTAGGTGTA 


960 


CATTATCCAA 


CAGATATTAT 


TGCCGGCATT 


ATTGGTGGAT 


TATTTTGcAT 


TATTTTATCA 


1020 


ACGTTATTAC 


TTAGAAATAA 


ATTAATAAAT 


TAAATAGTAA 


AAAAACAAAA 


GCAGTAAACC 


1080 


TAAAGTGTCG 


TAAGGGTTTA 


CTGCTTTTAT 


AAAACGTTGT 


TATAACGTAT 


ATTGTCTTTT 


1140 


ACGGGCATAT 


AAnAGGGGAA 


TATTTGAnAA 


TGACCAATCC 


AACAAGAACG 


AAACGTTGTG 


1200 


GGGGGGATGT 


TCTATGTGGT 


ATTGATAATC 


ATTTTCAACT 


ACTATTATAC 


ATTAGTGAGA 


1260 


ATCATTGTCA 


ATTAGAAACT 


AAAACTTTTT 


TTGAATATTT 


TTTAAGAATA 


GTAAATAAAA 


1320 


CGCATGATTA 


CGCTATTTTA 


GAAAATAAAA 


AAATTTGTAT 


TTCTCATTAG 


AATTAGAATA 


1380 


TTTAAAAGTG 


ATGAGGTTTA 


AACATTATAT 


TGTTTACATA 


CTCCTTTTGA 


ATTCATACAT 


1440 


TATGAAATGT 


tACTTCCAAG 


TTCAAAAT CG 


CACATTGAAA 


TGATGTGTGA 


AATGTTTAAA 


1500 


CTACGGTCAT 


tTTGTGmAAA 


TAAAGrTAAT 


AACTATTCAT 


TTTACAATAG 


TGAAAAGTCA 


1560 


GTATATGACA 


ACAATTAATA 


TTGCGGTAAG 


GCCTTGTGTT 


ACAGTATTCT 


ATATTTAAGT 


1620 


ACTG CAATCA 


GAATTAACAG 


AATGCCATTA 


ACTGATTATT 


AAATATTTGA 


GTTAATAAAT 


1680 


AATTAATGAT 


TGTAGCTTGA 


AAAATTTAAA 


ACATGGTTAT 


TGATTTGTGA 


TAAAATTTAA 


1740 


ACGTAAACAA 


ACTAATTTAA 


AAAGCAACTA 


TTGTATAGAA 


AAATACAAAA 


TTTAAAATAT 


1800 


ATTACCTTAT 


TAGAAAAA 










1818 



(2) INFORMATION FOR SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 12658 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(0) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 127: 
TGTTTAAACA ATAGGGGGAA TCTTATGATT GAAAAATTAG TAACCTTTTT AAATGAGGTT 6 0 

50 GTTTGGAGTA AGCCATTAGT TTATGGTTTG CTAATTACTG GTGTGCTATT TACATTGCGT 12 0 

ATqCGATTTT TTCAAGTTAG A C A TTTT AAA GAAATGATTC GATTAATGTT TCAAGGAGAG 180 
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AAATACCGAG TGTTAATTTA ATACCTTATG CTCGTAAGCT AGAAGTATTG TATGAAGATG 1920 

CTTTTATCAT CAT AG TT ACT AAACCAAACA ATCAAAATTG TACGCCTTCG AGAGAACATC 1980 

5 CTCATGAAAG TTTAATCGAA CAAGTACTAT ATCATTGTCA GGAACATGGT GAAAATATTA 2 04 0 

ACCCACATAT TGTTACGCGT CTAGATCGTA ATACAACTGG TATTGTGATA TTCGCTAAAT 2100 

ATGGACATAT CCATCATTTA TTTTCTAAAG TAAACTTGAA AAAAATATAT ACTTGCCTTG 2160 

10 

TATATGGTAA AACCCATACA TCTGGTATTA TTGAAGCTAA TATTAGACGG TCAAAGGATA 2220 

GGATTATAAC TAGAGAAGTT GCCTCGGATG GTAAATACGC TAAAACATCT TATGAAGTAA 2280 

TAAATCAGAA TGATAAATAC AGTTTATGCA AAGTTCATTT GCATACGGGA CGTACACATC 2340 

15 

AAATTCGTGT ACATTTTCAA CATATTGGGC ATCCAATTGT GGGAGATTCT TTGTATGATG 2400 

GTTTTCATGA CAAAATTCAT GGTCAAGTAC TGCAATGTAC GCAAATATAT TTTGTTCATC 24 60 

CAATCAATAA GAACAATATT TATATTACAA TTGATTATAA GCAATTACTT AAATTATnCA 2520 

20 

ATCAACTCTA ATnCACACAG GGGGTGTAAG TATGTCAATG AnCACAGATG AAAAAGAGCG 2580 

TGT 2583 
25 (2) INFORMATION FOR SEQ ID NO: 126: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1818 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 
30 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 

35 

ATCAAGTGAT ACATTTAACT GGTAAAGGAT TAAnAGATGC TCAAGTTAAA AAATCnGGAT 60 

ATATACAATA TGAATTTGTT AAAGAGGATT TnACAGATTT ATTnGCAATT ACGGATACAG 120 

TAATAAGTAG AGCTGGATCA AATGCGATTT ATGAGTTCTT AACATTACGT ATACCAATGT 180 

40 

TATTAGTACC ATTAGGTTTA GATCAATCCC GAGGCGACCA AATTGACAAT GCAAATCATT 24 0 

TTGCTGATAA AGGATATGCT AAAGCGATTG ATGAAGAACA ATTAACAGCA CAAATTTTAT 300 

TACAAGAACT AAATGAAATG GAACAGGAAA GAACTCGAAT TATCAATAAT ATGAAATCGT 360 

45 

ATGAACAAAG TTATACGAAA GAAGCTTTAT TTGATAAGAT GATTAAAGAC GCATTGAATT -4 2 0 

AATGGGGGGT AATGCTTTAT GAGTCAATGG AAACGTATCT CTTTGCTCAT CGTTTTTACA 4 80 

50 TTGGTTTTTG GAATTATCGC GTTTTTCCAC GAATCAAGAC TTGGGAAATG GATTGATAAT 54 0 

GAAGTTTATG AGTTTGTATA TTCATCAGAG AGCTTTATTA CGACATCTAT CATGCTTGGG 600 

55 



10 
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20 
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TAAAAAAATT ATTATCAATG ATGAACTAGA ATTGACTGAA TTCCACCAAG AACTTACTTA 120 

TATTTTAGAC AACATAnAAG GGAATAATAA TTATGGTAAG GAATTTGTTG CAACCGTTGA 180 

AGAAACATTC GACATTGAAT AaAGCGGGGT GgaAGCACTA TGAATCAATG GGATCAGTTC 24 0 

TTAACACCTT ATAAG CAAG C GGTTGATGAG TTGAAAGkGA AcTTaAAGGC ATGCGCAAAC 3 00 

AATATGAAGT TGGTGAACAA GCGTCGCCAA TAGAATTTGT TACTGGTCGT GTTAAACCAA 36 0 

TCGCTAGTAT TATAGATAAG GCAAACAAAC GACAAATACC ATTTGATAGG TTAAGAGAAG 42 0 

AAATGTACGA TATCGCTGGT TTAAGAATGA TGTGCCAATT TGTTGAAGAT ATTGATGTTG 4 80 

TCGTCAATAT TTTAAGACAA AGAmAAGATT TTAAAGTAAT TGAAGAACGA GATTATATTC 54 0 

GTAACACTAA AGAAAGTGGT TACCGCTCGT ATCATGTCAT TATTGAATAT CCAATTGAAA 600 

CATTACAAGG CCAAAAATTT ATATTGGCTG AGATTCAGAT TCGTACATTA GCAATGAATT 66 0 

TCTGGGCAAC GATTGAACAT ACTTTACGAT ATAAATATGA TGGTGCTTAT CCGGATGAAA 72 0 

TTCAACATCG TTTGGAAAGA GCGGCAGAAG CAGCGTATTT ACTTGATGAA GAGATGTCTG 78 0 

AAATTAAAGA TGAAATTCAG GAAGCTCAAA AATATTACAC GCAAAAACGT TCTAAAAAAC 84 0 

25 ATGAAAATGA TTAACGAGGT GTTATAAATC ATGCGTTATA CAATTTTAAC TAAAGGTGAC 90 0 

TCCAAGTCTA ATGCCTTAAA GCATAAAATG ATGAACTATA TGAAAGrTTT TcGCATGaTT 96 0 

GaGGATrGTG AAAaTCCTGA AATTGTTATT yCAGTTGGTG GTGACGGTAC ATTACTACAA 102 0 

^0 GCATTCCATC AGTATAGCCA CATGTTATCA AAAGTGGCAT TTGTTGGAGT TCATACAGGT 108 0 

CATTTAGGAT TTTATGCGGA TTGGTTACCT CATGAAGTTG AAAAATTAAT CATCGAAATT 114 0 

AATAATTCAG AGTTTCAGGT CATTGAATAT CCATTGCTTG AAATTATTAT GAGATACAAC 12 0 0 

GACAACGGCT ATG AAA CAAG GTATTTAGCA TTAAATGAAG CAACGATGAA AACTGAAAAT 126 0 

GGCTCAACAC TTGTTGTGGA TGTTAACTTA AGAGGGAAAC ACTTTGAGCG ATTTAGAGGC 132 0 

GATGGATTAT GTGTATCAAC ACCTTCGGGT TCAACGGCTT ATAACAAAGC GCTAGGTGGC 1380 

GCACTGATAC ATCCTTCACT TGAAGCAATG CAAATTACAG AAATTGCCTC GATAAATAAT 14 4 0 

CGTGTGTTTA GAACGGTAGG ATCACCACTT GTATTACCAA AGCATCATAC ATGTTTAATA 15C0 

TCACCAGTTA ATCATGATAC CATTAGAATG ACGATAGATC ATGTTAGTAT CAAACATAAA 156 0 

45 

AATGTTAATT CAATACAATA CCGTGTAGCA AATGAAAAAG TGAGGTTTGC ACGTTTTAGA 16 20 

CCATTCCCAT TCTGGAAACG TGTGCACGAT TCTTTCATAT CAAGTGATGA AGAACGATGA 16 80 

50 AATTTAAGTA TCATATATCA CAACAAGAAA CTGTTAAAAC TTTTTTAGCA CGACATGATT 174 0 

TTTCTAAGAA GACAGTGAGC GCCATTAAAA ATAATGGCGC TTTAATTGTT AATGATGAAC 180 0 
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ACATCTTCAA CTTCTGGTAA AACTGCCACT GTAGCTGTTG AAGTATGAAT ACGTCCACCT 64 8 0 

GATTCTGTTT CAGGCACACG TTGAACGCGG TGCGCACCAT TTTCAAATTT CAATTTACTA 654 0 

5 TACGCGCCAT TACCAGAAAC TGAGAAACTA ATTTCTTTGT AACCACCATG GTCACTTTCA 6 6 00 

GACGCTTCTA CTATTTCAGT TTTGAATCCT TGTGATTCAG CATACTTTGA ATACATACGC 6 6 60 

ATTAAATCAC CAGCAAAAAT CGCAGCCTCA TCACCACCTG CTGCTGCTCT TATTTCTACA 6 72 0 

w 

ATAACGTCTT TGTCATCATT AGGATCTTTA GGAATCAATA ATATTTTAAG CTCTTCTTCA 6 7 BO 

AGATTTGGAA GTTCAGCTTT AATACCATTA CTCTCCTCTT TTAACATTTC TACTTCTTCT 684 0 

TTATCATCAG TCTCACTTAA CATTTCTTCA ATATCAGCTA ATTCTTCTTT TTTAGCTTTA 6 900 

15 

TAGTTACGAT AAACATCTAC AGTTTTTTGT AAATCAGCTT GCTCTTTAGA ATATTTACGT 6 960 

AATTTATCTG AATCATTTAC AACATCTGGG TCACTTAACA GTTCATTTAA CTGTTCGTAT 7020 

2Q CTTTCTTCTA CAATATCTAA TTGATCAAAC ACTTATAATT CCTCCTTATT ATTATCACTA 7030 

GGTGCTACGA TATGGTGCGC GCGACAACGT GGCTCATAAC TTTCATTGGC ACCTACTAAG 714 0 

ATAATCGGAT CATCGATTTT AGCTGGTTTA C CATTTATT A ATCGTTGCGT TCTACTAGAT 7 200 

25 GAAGAACCAC AAACAGCACA AACTGCTTGA AGTTTCGTTA CTTGTTCACT GACAGC CATC 72 60 

AATTTAGGCA TTGGTTCGAA CGGTTCGCCC CTAAAATCCA TATCTAATCC AGCAACAATA 7320 

ACACGGTGTC CATCTGCTGA TAGTTTTTCT ACTATACTTA CAATTTCATC GTCAAAAAAT 73 8 0 

30 TGCAcTTCGT CTATTC CTAT AACATCAACA TTAGTTAAGT CGTGCGTCAT AATTTCACTT 744 0 

GCTTTAGAAA TATTAATCGC TTCAATGGCA TTACCATTAT GAGAGACCAC TTTTTCTTTA 7500 

TGATATCGAT CATCAATCGC CGGTTTAAAT ACAACGACTT TTTGTTTAGC GTATATACCC 7560 

35 

CTTCTTAGAC GTCTTATTAG TTCTTCGGAT TTACCGCTAA A CAT ACT AC C TGTAATACAT 762 0 

TCTATCCAAC CGGAATGGTA AGTTTCATAC ATTGAGAGTn CCAC C TTTTT CAAAACATAA 76 80 

TCGCTTTATT ATATCATATT TCAAATATTC ATAAATGTCT TTnTCATAAT TATATCGATA 774 0 

40 

TTGTACATGA ACAATTATTT TA 77 62 
(2) INFORMATION FOR SEQ ID NO: 125: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25B3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 5: 
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GCACTTGGAG CAGCTAGAGG TTCATTTATG ATTTGTAATA ATTGTCTACC TACAGAATGG 46 80 

CTTGGCATTC TAACAGCAAC TGATGATAAA CCTCCAGAAA CTTTTCGACA TAGATAGCCT 4 74 0 

AGCTTTAACG GCAATATAAA CGAAATAGGG CCCGGCCAGA ATGCCTGCAT TAACTTTTCT 4 300 

ACGCGTGGAT CCAAAGTATA TGTAAAATCT TTTAATTGAC CTTTACTGTG TATATGAACA 4 86 0 

ATAAGCGGAT TGTCAGATGG ACGGCCTTTA GCTTCATATA TTTTAGCTAC AGCTTCTTCA 4 920 

TCTGTCGCAT TTGCTGCAAG TCCATAAACT GTTTCAGTTG GTAAACCTAT TAAACCACCG 4 98 0 

TTTAAAACAA TGTCTTTTAT TTCATTAATT TTAGGATATT GCTGTAAATC TTCATTATAT 504 0 

TCTCTAACAT CCCAAATTTT AGTATCCAAC TTAATCACGC CTTTCTTATT TATCATAATA 510 0 

TAAAGCAAAA AGCTATGCAC TTAACTAATC ATAGCAAAGG CATAACTTCT AATTACCATT 516 0 

TAAATGAGAC GATTCGATCG TGGCCATTTA TATCTTTAAT AATGTCGATT TTTTTGTCAG 522 0 

GAAATTTATT TAAAATTATT GATTTAAGTG CCTCACCTTG ATTGTAACCA ATTTCAAAAA 52 80 

CAACTGGGCT GCCTTTTTCC ATAACGTGAG GTAAATCTTC AATGATTGAT T CAT AAA TAG 534 0 

CATATCCATG GTTATCTGCA AACAATGCCT GATGTGGTTC GAATCTCGTA ACCGTTGGAG 54 0 0 

ACATCGTAAC CATATCTTTT TCATCTATAT ATGGTGGATT AGATATCAAG CCGTTCAACT 54 60 

TGATACCTTC ATTAATTAAG GGCTTTAATG CATCCCCTGT TAAAAATTGT ATTTGTGATT 552 0 

GATGCTTCTC AG CATTATT A CGAGCCATAT TCATTGCTTC AAGTGAAATA TCAGTAGCAA 558 0 

30 TAACATTTAA ATCCGGCTTT TCACATTTCA AAGTAATTGC AAGTACACCA CTACCCGTTC 564 0 

CGATATCTAC GATTGTTGCA TCATCTTCTA ACTGTTGTAA GAAATGCAAC ATTACTTCTT 57 0 0 

CAGTTTCAGG TCTTGGTATC AAACAATTTG AGTTTACATC AAACGTTCTA CCATAAAATG 5760 

AGGCAAAGCC AACTATATAC TGTATAGGCT CTCCTAATAA CATACGTTGT AATGCTAAGT 5820 

CGAACTTCAT AATCATCGCT TTCGGCATAT CATCATGCAT GTGGACTACA AAGTCCGTAC 58 80 

GCGTCCATTG AAATACATCT AACATTAACC ATTCAGCTCG TGTTTGTTCA AACCCTTTTT 594 0 

GTTGTGTTAA ATGAATTGCT TCATCTAACT TTTCTTTATA ATTCACCATT ATTAAGTTCT 6 00 0 

TTCAATTTAT CTGTCTGCTC TGATAAAGTC AG TG CAT CT A TAATTTCTTC TAAATGGCCT 6 06 0 

TCCATAATTT GCCCTAATTT TTGAAGCGTT AG AC CT AT AC GATGGTCTGT TACACGGCTT 612 0 

TGTGGATAAT TATAAGTTCG AATACGTTCT GAACGATCAC CAGTACCGAC TGCTGATTTA 618 0 

CGTTGTGACG CATACTTTTG TTGTTCTTCT TGAACTTTCA TATCGTATAA ACGTGCTTTT 624 0 

AACACTTTCA TTGCTTTTTC ACGGTTTTGA ATTTGAGACT TCTcAGAAGA TGTTGCAATG 6 3 00 

ACACCAGTTG GTAAATGGGT AATACGTACT GCAGAGTCAG TTGTGTTTAC GTGCTGACCA 6 3 60 
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TTGTGCTTCC ATAACCGCTT CCGATACAAA ATTTTCCGAT GCGATTAACT CTATGTTGCT 2880 

ATTTTGTCTC TGAAATTCTC TCTCGATTGC TTCTGCGATA ACTTTATCTT GCTTGGTGAT 2940 

ATAAGACATA AAATCTCCCC TTCTTTCAAA AAAACTTATT GGTATTTAGC ACGTTCGCCA 3 000 

CCAATCTTTT TCGGCCTAGA TGTGGCAATA GTTACAATTG CCTGTCCTAC TTG CTTTACT 3 060 

GAGGTCCTTA CAGGTACACA TACATGTTTA ATATGCATGC CTATTAACGT TTGACCAATA 3120 

TCAATTCCAC AAGGAACAGT AATATGTTCG ACCACGATCG GATCCTTCAT ATGCTGAAAA 3180 

GCGTATGTTG CCAAACTCCC TCCAGCATGT ACATCTGGAA CGACGGAAAC TTCTTCCATT 3240 

GTTAATGGAT TATACTGAGA TTTTTCTATT GTTATCGCTC TGTTGATATG TTCACATCCT 3300 

TGAAAAGCAA AAGTAACGCC TGTCTCTTTA CTCACAACAT CTAATGCATT AAAAATAGTT 3360 

TCTGCAACTT CCaTCGAACC GACAGTCCCT ATTTTTTCGC CAATGACTTC CGATGTTGAA 3420 

CATCCAATTA AACATATATC TCCTTTATTA AAAAAGGACA TATCTTTTAA TTCGTCTAAT 34 80 

AACATTGTCA AATCTTTCAT AAAAGCCCAC CCTTCCTAAA AATAAAAAAG GAATATAGCA 354 0 

AAGTGCTACA CTCCTCTATT ATAACTTATT TAACTGTTAA CATATACTAA TTATACAGAA 3600 

TTCCTACTAG CAAATAATAT CTTTTAATTT TAAAATTAAA CTTACAAGTT CTTCATAGGT 3660 

ATGTACATAC ATTTCTTTTG TTCCACCGTA TGGATCTATA ACTTCTCCTG CTTCTTTtAC 3720 

ATATTCATGC AATGTGAAAA CATGATTTTG CAAACCAAAG TGTGCCTCTA TTAATTCTTT 3780 

30 GTGCGAATAC GACATCGTCA AAATAATATC TGCTTTCAAA TCTGCTTCAG TAAATTGTTG 384 0 

CGATAAGGTC GTTTCAGCTA AATGATGTTC TTCAACTAAG TCTTCAACAT AATTCGAAAC 3900 

ACCTTGATTG TTCACAGCGA ATATACCTCT TGATTCAAAT TGATGATTTG GCATAACCTC 396 0 

TTTTGCAATA CTTTCCGCTA ATGGGCTACG ACATGTGTTA CCTGTACAAA CGAATAAAAT 4 020 

CTTCATAGTT CACATCCTTT AATAATGTGA TTACCTGCAG CTTTTAACAT GCGATTCATA 4 080 

ATTGCTTCTG T ATT AT CATT CAGCTCAAAG CCGTATATAT ACGCCGCTGA AATATTTTCA 414 0 

TTTTCATCAA GTGAATGTAA CACATCATAA AGATTATGAC TTGCTTGTTT AACATCATTG 4200 

TCATCCTGAC ATAATTGAAT GAATTGCGCT TCACTTGGTA TAAACGCCAC CTTATTACTC 4260 

GG CACAATAA AAGCTATAGA AGACCAATCT TTACCGTCAT TTCCAATTTT GCTCTCAATA 4320 

TCTGTAATAA TTGTAAGTGG TGTATTGGGT GAGTAATGCT TATACTTCAT ACCTGGTGCA 43 8 0 

ATTGGCTGTT CAGTATCATT ATAATCAGCA TGGGCGATAC TATTCGGAAG TATTTCTGTA 444 0 

50 ATCATTGCTG CTGTTATAGA ACCAGGT CTT GCAATTTTAT AAGGAAAAGA TGTGCAATCT 4 500 

AAAACCGTAC TTTCTAATCC TTCTTCACTT TGTTCAGCTT GAACAATACC ATCGATACGG 4 560 
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ATCTTTTTCA TCAAACTACT TATCTCCGAT 
CAGCGTCGCC TAACCCTGGT GTGATATATG 
5 AATATAAATA TCTACATCTG GATGTGCTTC 

AATTAAACAC ATGAAGCGAA TATTTTTAGC 
AATTGCTGAT GCGCCTGTTG CTAACATAGG 

w 

ATCTTGAGGT AACTTAGCAA AATACTCTAC 
ACCGATATGT CCAACTCTGG CTGCAGGTAC 
ACCAGCTCTT AAAATTGGAA CGATAGCTAA 

15 

TTTAGTTACA GGCGTTTCAA TATCAACATC 
TGCCATCAAC ATACCAACTT CGTCTACAAG 

2Q ATCTCTAATA TAGCTTAGTT TGTGTTGAAT 

CATAAAAATT ACTCCTATCT TTGTGTATGT 
TTTCGCAACG CGTTCTTTAG CTTGTTGTAA 

25 TAAACTGATG ATTTTTGCAA CTTCCTCAAA 

AGCAGGTGTA CCTAAACGTA TACCACTCGT 
GGTATTTTTG TTACATGTGA TACCAACTGA 

30 AAGTCCTATA GACCCTTTTA CATCAACAGC 

AATTCTAAAT CCTTCATTAA TTAATGCTTC 
TTGTTGATAC GTTTTGAAAT TATTTTCTAA 

35 

AATAACATGC TCAAGAGGTC CACCTTGAAT 
TTTATATTCT TCCTTACATA AAATCATACC 
TGTAGTTGTT ACAAAATCAG CATATTCTAC 

40 

TCCTGCAATA TGTGCCATGT CTACCATTAA 
AAACTTTTTG AAGTCAATTG TTCTTGAATA 
CTTATGCTCT AACGCTAATT TACGAACTTC 

45 

TACTCCATAT TCAACGAAAT TGTAGAATTT 
CAAGTGACCA CCATGACTCA AATTCATACC 
SO TAAGTAAACA GCCATGTTCG CTTGTGAACC 

TCCAAACAAT GCTTTAGCAC G ATCAATTG C 
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TCTTCTATTT AGTACCAAAC AATCTATCTC 10 8 0 

CTTTGTCATT aGCTTTTCAT CAAGTGCAGC 114 0 

ATGCATCTTT TCTACGCCTT CTGGTGCTGC 12 0 0 

GCCACGTTTC TTCAATGAAG TAATAGCTTC 12 6 0 

ATCAACAACA ATGATTTGTC TTTCAGTAAT 13 20 

AGCCTTTAAT GTTTCGGGAT CTCGATATAA 13 80 

TAAACTTAAA ATACCATCAG TCATACCTAA 144 0 

TTTTTTACCA GCTAATCGTT TAGCCGTCAT 15 00 

CTGAAGCTCT AAGTCTCTAG TTACTTCATA 15 60 

TTCTCTAAAT TCTTTAGTAC CTGTATTTAC 162 0 

TAATGGATGA TCGAAAACGT GTACTTTACT 16 8 0 

TTATTGATAT AGAGGATATT CAGCTGTTAA 174 0 

TTTTTCTTCA TCTTTACTAT TTTTCAATGC 18 00 

AGCTTTTTCA TCAAATCCAC GCGTTGTTGC 18 60 

TACAAAAGGT TTTTCTTGAT CGAACGGAAT 1920 

ATCTAAAGTC TCTTCAGCTT CTTTACCAGT 198 0 

TACTAAGTGA TTATCTGTAC CGCCAGAAAC 2 04 0 

TGCAAGAACT TTTGCGTTTT TAACCACTTG 2100 

CGCTTCTCCA AAAGCAACTG CTTTtGCTgC 216 0 

ACCAGGGAAA ATTGTTTTAT CTATGTCTTT 2220 

ACCACGtGGT CCGcGTAATG TTTTGTGTGT 2280 

TGGATTTGGA TGTAAACCTG CCGCTACTAA 234 0 

CTTAGCGTTT ACTTCATCTG CGATTTCTTT 24 0 0 

TGCTGATGCT CCTGCCACAA TAAGCTTAGG 24 6 0 

ATCATAATTG ATTCGTTCTG TGTCTTTATC 2 52 0 

ACCACTAAAA TTAACAGGCG CTCCATGTGT 2 5 80 

TAAAACTGTG TCGCCCATTT CTAATGCAAC 264 0 

TGAATGTGGT TGAACATTGA CATGTTCAGC 27 00 

GATGCTTTCA GTAACATCTA CAAACTCACA 27 6 0 
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15 



20 



GTATTTCAAA TATTAAACTA ACCCCTTCTA TCTAAAATTT AAGGTTAGTT TAATATTGTT 24 0 

ACATTCAAAA TTTCAAGATG ACGGAAATGT CATTTCTTAT GATGTCCTCT TCGTATTTTT 3 00 

TCAAATTCTG CAAGGATTTC AGAAGATAAC GGAATTCGAG TTCTTGGCTT GTTTTCACTT 360 

ATATCATCTA ATGATTTACT CACATCAATT TCATTTTCTT TTAAATCTCT CCACATTTCG 4 20 

CGAGATGATA TTCTATATGC ACCTGATCCA AAGATAGCAT GTTGcTCACT CaTATCACTT 4 80 

GTTACAACTG TAATATGcTT AGtATGCTTG tCaTAAAGtT CaTAAACCAT AACGGTTCTA 540 

ATGGAAACCA ATCAGCTG 558 
(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7762 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: I 
25 GCTTCAGACA TnTGATGATA TAATCTCTCA 

ACATnTTGCT TATTTGTTCC AATGACTTTT 
TCTGTAACAC TTCGCCAAAA CTAAAACTGG 
30 TCCACCTGAA TCTGTCAAAA TAAAATAAGA 

TCCAAAGGTT CAATCAATTC AATTCTGTCA 
CGAACTTTCG GGTTTTTATG CATTGGATAT 

35 

AAGCGTCTAA CCGCTTTAAA TATATTTTCC 
GCTGTCATrA GAATGAATTT kTtGTCATGG 
TCAT CAACTG TATATTTCAT AGCATCAATC 

40 

GAATATTTCC CTTCACTTAA CAAATGCGAT 
TCAGCTAATA CACCAACTAA TTGTCTATTC 

45 TAACTTCTAA GCCCTGCTTC AACGTGTCCA 

CCACCTGCAA ATGTCGTCAT CGTATCACCA 
TGAATCACTT GTTCTAATTG AGTGATTGAT 

50 GATTTCATAA TATTCAAATC GTATTTTGGT 

AGCATTTCTC TATGCTGTGC TGTAACAACA 

55 



!EQ ID NO: 124 : 

TCATCGATTA ATTCTTTTGC AGCTTGATAC 60 

AATGTGCCAG CTTCAACACC TTCAGGACGT 120 

CTTATTAAAT GATGGCGCTT CTTCCTGAAT 180 

TTTTnTAGCA AAATTATGGA AATCTATACG 240 

TGACTACCTA AAATCTTTTG AGCCACCTCT 3 00 

ACCAGTGCTA AATCAGTATA CTCATCTATT 360 

aTGGGTTTCC CGATATTTTC TCGTCGGTGT 4 20 

TATTTATCCA TGATGTTAGA TTTATAATTG 4 80 

GCAGTATTAC CAGTGACAAC AACACTTTCT 54 0 

GCAGCATTTT TAGTAGGTGC AAAATGTAAG 600 

ACCTCTTCTG GAAAAGGTGA ATATTTATCA 660 

ATCGGCACTT GGTTATAAAA TGCCGCTAAA 720 

TGTACAAGTA CCATGTCTGG TTTTTCTAAT 780 

TTAGAAGTTA TCTCAGAAAG TGTCTGTCCT 840 

TTGATTTCAA AGGTACTTAA TACTGAATCA 900 

ACAATTGGCT CGAGCATTTT TTCTTGTTCC 960 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 

ATGAATATAT TTnnAAATAA ATTATTATGG ATTGCACCAA TnGCCACTAT GATTATCTTG 6 0 

5 

GTAATCTTTT CTTTAGCTTT TTATCCTGCA TATAATCCTA AACCAAAAGA TTTACCAATT 12 0 

GG TAT ATT AA ACGAGGATAA AGGTACAACG ATTCAAGATA AAAATGTTAA CATTGGTAAA 180 

AAATTAGAGG ATAAATTATT AGATAGTGAT TCTAATAAAA TTAAATGGGT TAAGGTTGAT 24 0 

w 

AGTGAAAAAG ACCTTGAAAA AGATTTGAAA GATCAAAAAA TCTTTGGAGT AG CT ATT ATT 3 00 

GATAAAGACT TTTCAAAAGA TGCTATGAGT AAAACACAAA AAGTAGTTAT GGATAGTAAA 360 

15 AAAGAAGAAA TGCAACAAAA AGTTGCTTCA GGTGAAATTC CGCCACAAGT GGTTCAACAA 420 

ATGAAACAAA AAATGGGGAA TCAACAAGTA GAGGTTAAGC AGGCTAAATT TAAAACGATT 4 80 

GTAAGTGAAG GATCAAGCTT ACAAGGTTCA CAAATTGCAT CAGCTGTGTT AACTGGTATG 54 0 

20 GGTGATAATA TTAATGCTCA AATTACGAAG CAAAGTTTGG AAACATTAAC GAGTCAAAAT 6 00 

GTTAAAGTCA ATGCCGCGGA CATCAATGGT TTGACGAATC CAGTAAAAGT GGATAATGAA 660 

AAACTTAATA AAGTTAAAGA TCACCAAGCA GGTGGTAATG CACCATTCCT AATGTTTATG 72 0 

25 CCAATTTGGA TAGGTTCAAT CGTAACGTCT ATCTTATTGT TCTTTGCATT TAGAACTAGT 78 0 

AACAATATCG TCGTGCAACA TCGTATCaTT GCtTCAATTG GACAGATGAT ATTTGCAGTT 84 0 

GTTGCAGCAT TTGCAGGTAG CTTTGTTTAT ATTTATTTCA TGCAAGG CGT TCAAAGATTT 90 0 

30 

GATTTTGACC ATCCAAATCG TATCGCAATT TTTGTAGCAT TTGCGATTCT TGGTTT CGTG 96 0 

GGCCTTATTT TAGGTGTTAT GGTATGGCTA GGTATGAAGT CAGTTCCAAT TTTCTT CATT 102 0 

TTAATGTTCT TTAGTATGCA ACTTGTAACG TTACCTAAAC AAATGTTGCC TGAAAGTTAT 108 0 

35 

CAAAAATATG TATATGATTG GAATCCATTC ACACACTATG CAACAAGTGT AAGAGAcTAT 114 0 

TATACTTGAA TCATCATATT GAATTAAATA GTACAATGTG GATGTTTATA GGGT 1194 
40 (2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 558 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 
45 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 

50 

GACCGACCTA TACATCCGTA TAAGTATTTC TTGATATAAG TCTTCTAAAT CATAATGATT 6 0 
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40 



GTATTTTAAT 


TGCTGGAAGT 


TCGTTGGCTT 


TAGCAGGCTT 


GATAATGCAA 


CAAATGATGC 


6240 


AAAATAAGTT 


TGTTAGTCCG 


ACTACAGCTG 


GAACGATGGA 


ATGGGCTAAA 


CTAGGTATTT 


6300 


TAATTGCTTT 


ATTGTTCTTT 


CCAACCGGTC 


ATATTTTATT 


AAAACTAGTA 


TTTGCTGTTA 


6360 


TTTGCAGTAT 


TTGCGGTACG 


TTTTTATTTG 


TTAAAATCAT 


TGATTTTATA 


AAAGTGAAAG 


6420 


ATGTCATTTT 


TGTACCGCTT 


TTAGGAATTA 


TGATGGGTGG 


GATTGTTGCA AGTTcACAAC 


6480 


CTTCATCTCA 


TTGCGCACGA 


ATGCTGTTCA 


AAGCATTGGT 


AACTGGCTTA 


ACGGGAACTT 


6540 


TGCCATTATC 


ACAAGTGGAC 


GCTATGAAAT 


TTTATATTTA AGTATTCCTC 


TTTTAGCATT 


6600 


GACATATCTT 


TTTGCTAATC 


ATTTCACGAT 


TGTAGGAATG 


GGTAAAGACT 


TTACTAATAA 


6660 


TTTAGGTTTG 


AGTTACGAAA 


AATTAATTAA 


CATCGCATTG 


TTTATTACTG 


CAACTATTAC 


6720 


AGCATTGGTA 


GTGGTGACTG 


TTGGAACATT 


ACCGTTCTTA 


GGACTAGTAA 


TACCAAATAT 


6780 


TATTTCAATT 


TATCGAGGTG 


ATCATTTGAA 


AAATGCTATC 


CCTCATACGA 


TGATGTTAGG 


6840 


TGCCATCTTT 


GTATTATTTT 


CTGATATAGT 


TGGCAGAATT 


GTTGTTTATC 


CATATGAAAT 


6900 


AAATATTGGT 


TTAACAATAG 


GTGTATTTGG 


AACAATCATT 


TTCCTTATCT 


TGCTTATGAA 


6960 


AGGTAGGAAA 


AATTATGCGC 


aACAATAATA 


AAAAAATAAT 


GCTTTTAATT 


GCAGTAACGT 


7020 


TATTAATTAG 


TATGCTGTAC 


TTATTTGTAG 


GTATTGATTT 


TGAAATATTT 


GAATATCAAT 


7080 


TTTCAAGTCG 


TTTAAGAAAG 


TT CAT ATT AA 


TTATTTTAGT 


AGGTGCTGCC 


ATTGCAACTT 


7140 


CAGTGGTGAT 


TTTTCAAGCG 


ATTACAAATA 


ACCGTCTATT 


GACACCATCA 


ATAATGGGGT 


7200 


TAGATGCAGT 


TTATTTATTT 


AT CAAAGTAT 


TGCCAGTCTT 


TTTATTTGGA 


ATTCAATCGG 


7260 


TATGGGTTAC 


TAATGTATAT 


TTGAACTTTA 


TATTAACACT 


TATAACGATG 


GTGTTATTCG 


7320 


CACTAATCCT 


ATTCCAAGGT 


ATCTTTAAAA 


TCGGACATTT 


TTCAATTTAT 


TTTATCTTAC 


7380 


TTATTGGTGT CCTTTTAGGA 


ACATTTTTTA 


GAAGCATAAC 


AGGTTTTATT 


CAACTGATTA 


7440 


TGGATCCTGA 


GTCATTTTTA 


GCAATACAAA 


GTAGTATGTT 


TGCTAATTTT 


AATGCTTCTA 


7500 


ATTCGAATTT 


AGTTACTTTC 


TCAGCAGTGC 


TATTAGTAAT 


CTTATTAGTC 


ATTACAATTT 


7560 


TACTATTGCC 


TTATTTAGAT 


GTATTGCTTT 


TAGGTCGTGC 


TGAAGCAATT 


AATCTTGGGA 


7620 


TATCGTATGA AAAATTAACG 


CGAATT 








7646 



(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1194 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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ATGTTTTGGA 


GACAAAATAT 


ATCTCAAATG 


TGCGTTGAAA 


CAGAATTTAA 


AGTATCAAAA 


4440 


GACATTGCAA 


GTTGGAAGAC 


TTTATCTGAA 


GCTGAACAAG 


ACACATTTAA 


AAAAGCATTA 


4500 


GCTGGTTTAA 


CAGGCTTAGA 


TACACATCAA 


GCAGATGATG 


GCATGCCTTT 


AGTTATGCTA 


4560 


CATACGACTG 


ACTTAAGGAA 


AAAAGCAGTT 


TATTCATTTA 


TGGCGATGAT 


GGAGCAAATA 


4620 


CACGCGAAAA 


GCTATTCACA 


TATTTTCACA 


ACACTATTAC 


CATCTAGTGA 


AaCAAACTAC 


4680 


CTATTAGATG 


AATGGGTTTT 


AGAGGAACCC 


CATTTAAAAT 


ATAAATCTGA 


TAAAATTGTT 


4740 


GCTAATTATC 


ACAAACTTTG 


GGGTAAAGAA 


GCTTCGATAT 


ACGACCAATA 


TATGGCCAGA 


4800 


GTTACGAGTG 


TATTTTTAGA 


AACATTCTTA 


TTCTTCTCAG 


GTTTCTATTA 


TCCACTATAT 


4860 


CTTGCTGGTC 


AAGGGAAAAT 


GACGACATCA 


GGTGAAATCA 


TTCGTAAAAT 


TCTTTTAGAT 


4920 


GAATCTATTC 


ATGGTGTATT 


TACCGGTTTA 


GATGCACAGC 


ATTTACGAAA 


TGAACTATCT 


4980 


GAAAGTGAGA 


AACAAAAAGC 


AGATCAAGAA 


ATGTATAAAT 


TGCTAAATGA 


CTTGTATTTA 


5040 


AATGAAGAGT 


CATACACAAA 


AATGTTATAC 


GATGATCTTG 


GAATCACTGA 


AGATGTGCTA 


5100 


AACTATGTTA 


AATATAATGG 


AAACAAAGCA 


CTTTCAAACT 


TAGGCTTTGa 


ACCTTATTTT 


5160 


GAGGAACGTG 


AATTTAACCC 


AATCATTGAG 


AATGCCTTAG 


ATACAACAAC 


TAAAAACCAT 


5220 


GACTTCTTCT 


CAGTAAAAGG 


TGATGGTTAT 


GTATTAGCAT 


TAAACGTAGA 


AGCATTACAA 


5280 


GATGATGACT 


TTGTATTTGA 


CAACAAATAA 


CAATTAAATT 


AAAAGACCTT 


CACATGTAAA 


5340 


GGGAAATAGC 


GATTCGTTTC 


GTCTTGTCTC 


CTACATGTTG 


AAGGTCTTTT 


TTTATGTGTA 


5400 


TCTAACTCAT 


TATGAGTCTG 


AGTAAGAAAT 


CAATGCTCTA 


AGATGTACAA 


TGCTATTTAT 


5460 


ATTGGCAGTA 


GTTGGCGGGG 


CCCCAACACA 


GAAGCAGGCG 


GAAAGTCAGC 


TAACAATATT 


5520 


GTGCAAGTTG 


GCGGGGCCCC 


AACATAGAAG 


CAGGCGGAAA 


GTCAGCTAAC 


AATAATGTGC 


5580 


AAGTTGGCGG 


GGCCCCAACA 


TAAAAGCAGG 


CGGAAAGTCA 


GCTAACAATA 


TTGTGCAAGT 


5640 


TCGGgCGGGG 


CCCCAACATA 


AAGAAAAACT 


TTTTCCTTTA 


GAAATTATCA 


CTTCCaCaTG 


5700 


AGTTTTACTC 


ATGTATTCCT 


ATTTTTAAGT 


ACACATTAGC 


TGAGGCTAAT 


GTTAAGAACC 


5760 


ACTACTTAAT 


CAATCATTAG 


TAGTTTTTAT 


CATTTCCACT 


ATTCCCaGAC 


ATCaAAATCT 


5820 


TAAGTGTTCT 


ATTTTACTTT 


AAGTAAACAA 


AATACACATT 


CCGAAAAATT 


AAATTTCAGT 


5880 


TTAATTGCAA 


ATATCAATAA 


AATTGACACT 


AAATTATTTG 


AAAGGCTATT 


GAAATTATGG 


5940 


TCAAAAAACG 


CTACTATTAA 


TGAGAAATAT 


TATCAATGAT 


AATGATTATC 


ATTAATTTAA 


60CO 


AGGGAGAAAA 


ATTTGTAATG 


AAGTATTTAT 


TAAAGGGAAA 


TATTTTGCTT 


CTATTACTAA 


6060 
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ACATTTTTAA ACGCAGGCCG TGCGCGTCGT 
GTGGATGACA GCTTAAATTC AATTAACTTT 

5 

ATTGGGGGCG GCGTTGCAAT TAACTTATCT 
GGAATTAAAG GCGTAgCGAA AGGCGTTTTA 
AG CT ATGCAG ATCAACTTGG TCAACGCCCT 

10 

CATTATGATG TAGAAGAATT TTTAGATACT 
TTATCTACAA TATCAACTGG TTTAATTGTT 

15 GGTAAGGACT TTTATATGTT TGCACCTCAT 

GACGATATCG ATTTAGAAAA ATATTATGAT 
AAGAAAAAGA ATGCGCGTGA AATGTTGAAT 

20 TATCCATATT TAATGTTTAA AGATAATGCT 

CAAATTAAAA TGAGTAACTT ATGTACGGAA 
AATGACTATG GTATTGAAGA CGAAATTAAA 

25 AATATTGTTA ATGTAATGGA AAGCGGAAAA 

GCATTAACTG TTGTGAGTGA TGTAGCAAAT 
AACAGTGAAT TACATTCAGT TGGTCTTGGT 

30 

AATAAAATTG GTTATGAGTC AGAAGAAGCA 
ATGAATTTCT ACTCAATCGA ACGTTCAATG 
CAAGACTTTG AAAAGTCTGA TTATGCTAAT 

35 

GAATTTGAAC CTCAATTCGA AAAAGTACGT 
TCTGAGGATT GGAAGAAACT ACAACAAGAT 

40 AGATTAGCAA TTGCTCCAAC ACAAAGTATT 

ATGCCAATCG TTGACCAAAT TGAACGTCGT 
ATGCCATTCT TATCACCACA AACAATGTGG 

45 ATGAAATTAA TCGATTTAAT TGCGACAATT 

ATCCTTTATG TTAATTCTGA AATTTCTACA 
CACTATAAAG GATTAAAATC ACTTTACTAT 

50 

TGTACAAGTT GTTCTATCTA ACAATTAAAT 
TCTGAATTAG CAGATGATTA GACTGCTATG 



GGTGAGCTAG TGTCATGTTT CTTATTAGAA 2 640 

ATTGATTCAA CTGCAAAACA ATTAAGTAAA 2700 

AAATTGCGTG CACGTGGTGA AGCAATTAAA 2760 

CCTATTGCTA AGTCACTTGA AGGTGGCTTT 2 820 

GGTGCTGGTG CTGTGTACTT AAATATCTTC 2 8 80 

AAAAAAGTAA ATGCGGATGA AGATTTACGT 2 94 0 

CCATCTAAAT TCTTCGATTT AGCTAAAGAA 3000 

ACAGTTAAAG AAGAATATGG TGTGACATTA 3 060 

GACATGGTTG CAAACCCAAA TGTTGAGAAA 3120 

TTAATTGCGC AAACACAATT ACAATCAGGT 3180 

AACAGAGTGC ATCCGAATTC AAACATTGGA 3240 

ATTTTCCAAC TACAAGAAAC TTCAATTATT 3 3 00 

CGTGATATTT CTTGTAACTT GGGCTCATTA 3 36 0 

TTCAGAGATT CAGTTCACTC TGGTATGGAC 3420 

ATTCAAAATG CACCAGGAGT TAGAAAAGCT 34 80 

GTGATGAATT TACACGGTTA CCTAGCAAAA 3 54 0 

AAAGATTTTG CAAATATCTT CTTTATGATG 3 600 

GAAATCGCTA AAGAGCGTGG TATCAAATAT 3 660 

GGCAAATATT TCGAGTTCTA TACAACTCAA 3 72 0 

GAATTATTCG ATGGTATGGC TATTCCTACT 3 780 

GTTGAACAAT ATGGTTTATA TCATGCATAT 3 84 0 

TCTTATGTTC AAAATGCAAC AAGTTCTGTA 3 900 

ACTTATGGTA ATGCGGAAAC ATTTTACCCT 3 960 

TACTACAAAT CAGCATTCAA TACTGATCAG 4 020 

CAAACGCATA TTGACCAAGG TATCTCAACG 4 080 

CGTGAGTTAG CAAGATTATA TGTATATGCG 414 0 

ACTAGAAATA AATTATTAAG TGTAGAAGAA 4200 

GTTGAAAATG ACAAACAGCT AATCATCTGG 4260 

TCTGTATTTG TCAATTATTG AGTAACATTA 4320 



55 
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TGGCTGATGG TATTTATATA TAAAAAATAA TGGAATAAAC ATTGCTACTA AGTTTCGTGC 84 0 

TAATGATTTT TGAAAAACAG GAAGGTCACC TGCAAGTCTG AAAAACACTG ACATAAAACT 90 0 

5 

GAAACCAATA GCCGAAATTA AAATGGCAAT GATACCTTTT ACTTTAGGAT TCAATTTTAT 96 0 

CGCCTCTTTT ATATAAAATT AACGTATTTA TATTAGCATA AAACAACATG TTGTGCATAA 102 0 

ATAGTTGAAA TTTACTATAA AAAGACTATA ATAGACTGTA GCGAACAAAC GTTCTGTGTT 10 80 

TATTTGTCGG AATAATAGGG CATTACACTT TTATGAATGT TTGTGTTATT ACATAAAACA 114 0 

AATATCAATT CAGTATCAAG CTAATAAGCT TTTTCTTGAT TTCTGTTGAT ACAATTGAGA 12 00 

75 TTGACACAGA TTTAAAAAAA TCAAGTGATA TCTACTAAAA AATTTTTTTA AATTTGTTCA 12 60 

AGTTTTTCTA ATTTAGTATT GGTGCCTAGT TGGAACGTTT TACGAACATT CGATTAGAAA 1320 

ATGGCACTTT AAATCATAGT GTGTCTTATG TATAATGAAA CACATAATAT AGTGTTGGTG 13 8 0 

20 AAACGAAAAA gACACAATAT CTTGTGTTTT GTATGCAAAT GCTTTATTTA TGAAGAAATT 144 0 

ACATTTAAAA GTAATTTAAC ACAGAAATTT AATAGTTATT ATCAATTAAT AGTCATATTT 1500 

TTAGAAAATG TACTGAGCAA ATGGAAGATA TCCAATGATG TAAACACTAC ATATAGTGAT 156 0 

25 

TTTTATACAT TCAACCCATA TAAGCTACTA TTTTCTCAAA TATAAATCTA TGCAATTGGT 162 0 

TTACATTTGA GAAAATAAGT AGCTTCATTA TAGTTAATAC AATGCTGAGA TAACCATAGT 16 8 0 

AACCATGTTG TTAAAGCATT TTTTAATTGG AATGACTACT TTATTTAAAA GGGTTGAAGA 174 0 

30 

AAGAAGGTGA TCCAATGAAA ATAATATATT TTTCATTTAC TGGAAATGTC CGTCGTTTTA 18 00 

TTAAGAGAAC AGAAC7TGAA AATACGCTTG AGATTACAGC AGAAAATTGT ATGGAACCAG 18 60 

35 TTCATGAACC GTTTATTATC GTTACTGGCA CTATTGGATT TGGAGAAGTA CCAGAACCCG 192 0 

TTCAATCTTT TTTAGAAGTT AATCATCAAT ACATCAGAGG TGTGGCAGCT AGCGGTAATC 198 0 

GAAATTGGGG ACTAAATTTC GCAAAAGCGG GTCGCACGAT ATCAGAAGAG TATAATGTCC 2 04 0 

40 CTTTATTAAT GAAGTTTGAG TTACATGGAA AAAACAAAGA CGTTATTGAA TTTAAGAACA 2100 

AGGTGGGTAA TTTTAATGAA AACCATGGAA GAGAAAAAGT ACAATCATAT TGAATTAAAT 216 0 

AATGAGGTCA CTAAACGAaG AGAAGATGGA TTCTTTAGTT TAGAAAAAGA CCAAGAAGCT 22 2 0 

4i> TTAGTAGCTT ATTTAGAAGA AGTAAAAGAC AAAACAATCT TCTTCGACAC TGAAATCGAG 22 8 0 

CGTTTACGTT ATTTAGTAGA CAACGATTTT TATTTCAATG TGTTTGATAT TTATAGTGAA 234 0 

GCGGATCTAA TTGAAATCAC TGATTATGCA AAATCAATCC CGTTTAATTT TGCAAGTTAT 24 0 0 

50 

ATGTCAGCTA GTAAATTTTT CAAAGATTAC GCTTTGAAAA CAAATGATAA AAGTCAATAC 24 6 0 
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TCAACGTGGT AATCCAATGG ACTATGAAGG CTGGGCAGAA CCAGAAGGTA TGGAAACTTG 12 960 

GGATTTTGCG CACTGTTTAC CGTATTTTAA AAAATTAGAA AAAACATACG GTGCAGCGCC 13 020 

TTATGATAAA TTTAGAGGCC ATGATGGACC AATTAAGTTA AAACGAGGGC CAGCAACGAA 13 080 

TCCTTTATTC CAGTCATTCT TTGATGCAGG TGTTGAAGCA GGCTATCATA AAACACCTGA 1314 0 

TGTGAATGGA TTTAGACAAG AAGGTTTTGG ACCGTTCGAT AGTCAAGTAC ATCGTGGTCG 13200 

CCGAATGTCA GCTTCAAGAG CATATTTACA TCCAGCGATG AAGCGTAAAA ACTTAACCGT 13260 

TGAAACACGT GCCTTTGTAA CTGAAATTCA TTATGAAGGT AGAAGAGCAA CTGGTGTTAC 13 320 

GTATAAGAAA AATGGCAAAC T A CAT AC CAT CGATGCTAAT GAAGTCATTT TGTCTGGTGG 13380 

GGCATTCAAT ACGCCACAAT TACTACAATT ATCTGGTATC GGTGATTCAG AGTTC CTAAA 1344 0 

ATCAAAAGGC ATTGAGCCAC GTGTTCATTT ACCTGGTGTG GGTGAAAACT TTGAAGATCA 13500 

20 CTTAGAGG 13508 

(2) INFORMATION FOR SEQ ID NO: 121: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7646 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi} SEQUENCE DESCRIPTION: SEQ ID NO: 121: 

GTAAGTATTG TCTTGATTTC CTAATAAAGT TATATCTTGT AATTCATCTT GTTGACGGCC 6 0 

ATGTGCCATA TAAAGCGCTC CTTTAAATTT ATTTTTTTAT TATTTTGGCG TCTCGGCGTG 120 

CTTTTTCAAA CATGTAATAA CTTGCACCGA TAATAACGAC GTAAC CTAAT GTTGCATAGA 180 

AATC^GGAGA TTCTCCGAAT AGAATAAATC CAAGTATTGC TGTGAAAATT ATAGATGCAT 24 0 

ACGTAAAAAT AGAAATATCT TTTGCTGCTG CAAAACTATA TGCTAAAGTA ACACCAATTT 3 00 

GACCCACAGC GGCAgCTAAG CCAGCCCCTA ATAGATAAAG TATTTGCATC TGACTCATTG 3 60 

GTTCATAAGT ATATGCAGTG AAAGGTATTA AAACGATGAC AGAAAATAAG GAGAAGTAAA 420 

ATACTATAGT ATATGGTGCT TyTCTTGTAC TAAGTGCTCG AACACATGTA TATGCTGATG 4 80 

CTGCAAAAAT AC CTGAG AAT AAGCCAGCTA ATGATGGAAT CATAGATGAT GAAAATTCAG 54 0 

GTTTCACTAT TAAnAGCAaC CTAAAATAGC AATTATCATT GCTGTAATTT GaTACTTCCT 60 0 

TACCTTTTCA TGtAAGAaaA CAATGCTTaA TAAAATCGTC CAGAAAGGAT TGAGTTTCAT 66 0 

TAATGAATCG GCATCACTAA GTACCATATG ATCAATGGCA TAAATATTTA ACAATACACC 720 
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AACATCgCGA AgCgTTAGCA CG ATT AG AAA 


CATTAGATAC 


TGGAAAAACG 


TTAGAAGAAT 


11160 


CATATGCAGA 


TATGGATGAT 


ATTCATAATG 


TGTTTATGTA 


TTTTGCTGGA 


TTAGCAGATA 


11220 


AAGACGGTGG 


CGAAATGATT 


GATTCACCAA 


TTCCAGATAC 


AGAAAGCAAA 


ATTGTTAAAG 


11280 


AACCAGTAGG 


TGTAGTTACA 


CAAATTACAC 


CTTGGAATTA 


TCCGTTATTA 


CAAGCATCAT 


11340 


GGAAAATTGC 


GCCAGCGCTT 


GCTACGGGTT 


GTTCACTAGT 


TATGAAACCA 


AGTGAAATTA 


11400 


CACCATTAAC 


AACAATACGT 


GTTTTTGAAT 


TAATGGAAGA 


AGTTGGTTTC 


CCTAAAGGAA 


11460 


CAATTAATCT 


TATTCTAGGT 


GCAGGTTCTG 


AAGTTGGTGA 


CGTAATGTCA 


GGTCATAAAG 


11520 


AGGTTGACCT 


TGTATCATTT 


ACAGGTGGCA 


TTGAGACTGG 


TAAGCATATT 


ATGAAAAATG 


11580 


CTGCTAATAA 


TGTTACGAAT 


ATTGCCTTGG 


AACTTGGCGG 


TAAAAATCCA 


AACATTATCT 


11640 


TTGATGATGC 


TGATTTTGAA 


TTGGCAGTAG 


ACCAAGCGTT 


AAATGGTGGA 


TATTTCCATG 


11700 


CAGGTCAAGT 


TTGTTCAGCA 


GGATCAAGAA 


TATTAGTACA 


AAACAGTATT 


AAAGACAAAT 


11760 


TTGAGCAAGC 


ACTTATTGAT 


CGCGTGAAAA 


AAATCAAATT 


AGGTAATGGT 


TTTGATGCTG 


11820 


ATACTGAAAT 


GGGACCAGTG 


ATTTCAACAG 


AACATCGTAA 


TAAGATCGAA 


TCTTATATGG 


11380 


ATGTAGcTAA 


AGCAGAAGGC 


GCAACAATTG 


CTGTTGGTGG 


TAAACGTCCA 


GATAGAGATG 


11940 


ATTTAAAAGA 


TGGTCTATTC 


TTCGAGCCAA 


CAGTCATTAC 


AAATTGTGAT 


ACGTCAATGC 


12000 


GTATTGTACA 


AGAAGAGGTT 


TTCGGACCTG 


TCGTTACTGT 


AGAAGGCTTT 


GAAACTGAAC 


12060 


AAGAAGCGAT 


TCAATT AG CG 


AATGATTCTA 


TATATGGTTT 


AGCAGGTGCT 


GTATTTT CTA 


1212C 


AAGATATTGG 


AAAAGCACAA 


CGCGTTGCTA 


ACAAGTTGAA 


ACTTGGAACG 


GTGTGGATTA 


12180 


ATGATTTCCA 


TCCATATTTT 


GCACAAGCGC 


CATGGGGTGG 


ATACAAACAA 


TCAGGTATCG 


12240 


GTAGAGAATT 


AGGCAAAGAA 


GGCTTAGAAG 


AGTACCTTGT 


TTCAAAACAC 


ATTTTAACAA 


12300 


ATACAAATCC 


ACAATTAGTG 


AATTGGTTTA 


GCAAATAAAA 


ATTAGATAAG 


GTGAGTGCCA 


12360 


TTGTAAGAAC 


ACAAGACACT 


CACTTTGTTT 


TGTATAAGTG 


GCGAAATGTT 


GATTGATAAT 


12420 


TTGGACTAAA 


CG CAAAATGA 


ATCATAGATT 


ATTTCATTAC 


TGTTAGTAAC 


AATCGTAAAA 


12480 


GGAAAAGCGA 


GTGTTTTGGT 


TAGCTAAGTT 


TAGCAATTCA 


ACGATAACCA 


ATCAGCCACT 


12540 


AACAAATATT 


TCATGCAATA 


CTCACTTTGA 


AATACAACAA 


A CTTTGG AGG 


TCATAACGAT 


12 6 0 0 


GAGTAACAAA 


AACAAATCAT 


ATGATTATGT 


CATCATTGGA 


GGAGGCAGTG 


CAGGTTCTGT 


12660 


ACTAGGTAAT 


CGTCTGAGTG 


AAGATAAAGA 


TAAAGAAGTC 


TTAGTATTAG 


AAGCGGGTCG 


12720 


CAGTGATTAT 


TTTTGGGATT 


TATTTATCCA 


AATGCCTGCT 


GCGTTAATGT 


TCCCTTCAGG 


12780 
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ACAGAATATT GTTATCTATC ATTCCGGACA CTTAGGTGAC TCCCAACAAG ACATTGCATC 93 6 0 

ATTAGGTGGT GTTTCAAAAG TATTGATGAA TCATGATCAT GAATCTATAG GAGGTTCTAA 94 2 0 

TCAAGTTGAA GCCCCTTACT TTATACATGA AAATGATGTG GCTGCACTGA AACATAAGAT 94 80 

TTCTGTTCAA AAACAATTTA GTAATCGTGT AATGTTGGAT AAGGATTTAG AAGTTATTCC 954 0 

CGCGCCTGGA CATACACCAG GGACGACACT ATTTTTATGG GATGATGGTC ATCACCGTTA 96 00 

CTTATTTACT GGAGATTTTA TATGTTTTGA AGGGAAGAGA TGGCGTACAG TTATATTAGG 9660 

TTCAAGTGAT AGAGAAAAAT CTATTCAAAG TTTAGAGATG GTTAAAGAAT TAGATTTTGA 9720 

TGTACTTGTA CCTTGGGTTA CTATCAAAGA TGAACCGTTA GTTTATTTTG TAGAAAATGA 9780 

ATATGAAAAA CGTGAACAAA TACAAAATAT TATTGATAGA GTACGTGAGG GCGAGAATAG 984 0 

CTAATTGAAA TATATTGGCG AAgCAATGTA ACGAATCTAA GAAAGCCCTA GAAAATACCT 9900 

20 CCATAATTGA TTGTCATATA AAACAAAAAC GGTAATTTCT ATTTATTGAG ATAGAAATTA 9960 

CCGTTTATTT CGTGGACCTA TTGCATTGTT TTTATCATGC ATAATCATCA TTGTCGTTGT 10020 

TTGAGTCAAT TTTAATTTTC AGAATCAGAA GGCTGTTCTG GAATTGGGAA ATATTTGAAA 10080 

ATTTCACCGC TTTCAATCGC TTCGGTTAAC TGTTCTAACC ATT CGTAATA AACATGTGTA 1014 0 

TGATCAAGCT GAGCTTTAAT TTTTTGTGCC TCTTGTGTTT CAGCTTCAGT TAAATCACTG 10200 

CTTTCAAGTA ATGGATTGAT AATAGCTTGA GCATCTTTTA CTGCTTCGAC ATTGATGTCA 10260 

ATTTCACGCT GGAATTTTTT AGTGAAAAAG TTTCGGAAAA AGATGAAAAA GTCTTTCTCG 10320 

GCGATAAAAT GTTGTTTGCG GCTTCCTCTC GTAAATTGTT GTTTAACAAT ATCAAATTCC 103 80 

TGCAATTTCT TAACGCCAGC ACTCATACTT GGTTTGCTCA TTTGCAATTG ATGACGCATT 1044 0 

TCATCAAGCG TCATACTGCC TTCAAACACC ATTGTGCCAT ATAAGTTTCC TACACTTCTA 10500 

TTAGTGCCAT ACAAATCCAT TGTCTGTCCA ATTGAATTAA TTACAATATC TTTTGCTTGT 10560 

TCTAATTGTT GCTGTTTGTT CTGAGAACGA GTCATCATTG CACCTCCGTA CATCATTTTG 1062 0 

GTCACGTTAA AATAAATACT AATACATTAT AAAACCTTTT CTAAAAAAAG ACATTAAAAA 106 8 0 

TATTTAAAGC ATTAAAGTTA AATGTTTCGT TAAATAAAAA TCTAACGAAC TTACAAAACT 10740 

45 TAATTCTTGA GTTGTTTTGT AAATTGACAC ATTTTTCATT TCTATGCTAA CATAAGTnTG 10 800 

TAAAATTcGT TAAATAAAAA TTTAACAAAC TTAACGGrGG TTGTTGAAkG GrACTTTTAA 10 860 

aACATTTATC TCAGCGTCAA TATATTGATG GTGAGTGGGT TGAAAGCGCG AATAAAAATA 10 920 

50 

CAAGAGATAT TATCAATCCT TACAATCAAG AAGTGATATT TACGGTTTCT GAAGGGACAA 10980 

AAGAGGATGC AGAACGTGCA ATCTTAGCTG CAAGACGTGC GTTTGAGTCT GGTGAATGGT 11040 
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TTGGTAAAGC AAGTGATAAA CCAGAATTTA ATACATTTAC ATGGGCGGCA ATGCTGTTTT 7 56 0 

GTGCAGGCAT AGGCTCTGAT ATTTTATACT GGGGCGTTAT TGAATGGGCT TTTTACTATC 7 62 0 

5 

AAGTTCCACC AAATGGCGCG AAAAGTATGA GTGATGAAGC ACTCCAATAT GCGACGCAAT 76 8 0 

ATGGTATGTT CCACTGGGGG CCAATTGCTT GGGCTATTTA TGTTCTACCA GCATTACCAA 774 0 

TTGGTTATTT AGTATTTGTT AAAAAACAAC CGGTGTATAA AATTAGTCAA GCTTGTCGTC 7800 

10 

CGATTTTAAA AGGTCAAACA GATAAATTTG TAGGTAAAGT TGTAGATATC TTATTTATCT 7 86 0 

TTGGATTGCT AGGTGGTGCG GCAACATCAC TAGCGTTAGG TGTGCCATTA ATTTCTGCAG 7 92 0 

;5 GCATAGAAAG ATTAACTGGT TTAGATGGTA AAAATATGAT TTTACGTTCG GCCATTTTAT 7 980 

TAACAATCAC GGTTATATTT GCCATTAGTT CATATACAGG ATTGAAAAAA GGTATTCAAA 804 0 

AGTTAAGTGA TATCAACGTT TGGCTATCCT TTGTACTTTT AGCCTTTATA TTTATTATTG B10 0 

20 GACCGACTGT TTTTATTATG GAAACGACAG TGACAGGGTT CGGAAATATG TTGAGAGATT 8160 

TCTTTCATAT GGCAACATGG TTAGAACCAT TCGGTGGTAT TAAAGGTCGA AAAGAAACGA 8220 

ATTTCCCACA AGACTGGACA ATATTCTACT GGTCATGGTG GTTAGTATAT GCGCCATTTA 82 8 0 

25 

TCGGTTTATT TATCGCTAGA ATTTCAAAAG GTCGACGCCT TAAAGAAGTC GTGCTAGGAA 334 0 

CAATTATTTA TGGAACGCTT GGATGCGTAT TATTCTTTGG TATTTTTGGT AACTATGCTG 84 0 0 

TGTATTTACA AATTTCTGGA CAGTTTAATG TAACACAATA TTTAAATACA CATGGTACAG 84 6 0 

30 

AGGCAACCAT TATTGAAGTG GTG CATC ATT TACCATTCCC ATCATTGATG ATTGTACTAT 852 0 

TCTTAGTATC TGCTTTCTTA TTCTTAGCAA CAACATTTGA TTCGGGTTCA TATATTTTAG 8 5 80 

35 CGGCAGCATC TCAGAAAAAA GTGGTAGGCG AACCATTACG TGCCAATCGT TTATTCTGGG 8 64 0 

CATTTGCATT GTGCTTATTG CCATTTTCAT TGATGCTAGT TGGTGGTGAA CGTGCATTAG 870 0 

AAGTATTGAA AACTGCTTCA ATACTGGCAA GTGTGCCATT AATTGTTATT TTTATTTTCA 876 0 

40 TGATGATATC ATTTTTAATC ATTTT AGGGC GCGATAGAAT TAAACTTGAA ACGCGTGCTG 882 0 

AAAAATTAAA AGAAGTTGAA CGTCGTTCAT TGCGAATCGT TCAAGTATCa GAAGAAGAAC 83 80 

AAGACGATAA 7TTATAATTC AAAGCGGGTC TGGGACGACG AAATGaATTT TGTGAAAATA 8 94 0 

45 TCATTTCTGT TCCaTTCCCC TTTTTTTAGT AGCATTGTAG GATGAACTTT TAGGTTTTCA 90 0 0 

TTAATGTTGT ACTAAAAGAT TTAATTTTTT AGTGCTCCAA GTACTTATTT ATTGTATGAA 9 06 0 

GCATATTCTA AATCGAAGTT TGAAAGACTC TCATTGATTA TTAAATTAAA TAAAGGGTAT 9120 

50 

GCGTATGTAC AATTCAAATT AATCGAAGGA TGAAATAAAA TGACTAATCA ATTTAAAAAT 918 0 
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GGTTCAGTAT TTmCAGTACG CCGAGTGAAT CGCTAcGGAT CGTTTTTGTC GTTTAGACCA 576 0 

AGAGAGATTT GGAGATATTA AAGACATTAC AGATAAAGGA TATTATCAAA ACTCTTTCCA 582 0 

TTATGATGTA CGTAAAGATG TTACACCTTT TGAAAAGTTA GATTTTGAAA AAGATTATCC 5880 

TTATTATGCG AGTGGTGGTT TCATTCACTA TTGTGAGTAT CCGAAATTGC AACACAATTT 594 0 

GAAAGCACTA GAAGCGGTAT GGGACTACTC TTATGACAAA GTTGGTTACT TAGGTACAAA 6000 

TATTCCGATT GATCATTGTT ATGAATGTGA TTACGATGGA GATTTTGAAG CAACTGAAAA 6060 

AGGATTTAAA TGCCCGAACT GTGGCAATGA TAATCCTAAA ACAGTTGATG TCGTTAAACG 6120 

AACATGTGGT TACCTAGGCA ATCCAGTTCA ACGTCCAGTA ATTAAAGGCC GTCATAAAGA 6180 

AATTTGCGCA CGAGTAAAAC ATATGAAAGC GCCTAAAGAA TGATACTTTT AGACATTAAA 624 0 

CAAGGACAAG GTTATATTGC TAAAATAGAA TCAAATAGCT TTGTTGACGG TGAAGGAGTA 63 00 

20 AGATGCAGTG TTTATGTATC AGGATGTCCA TTTAATTGTG TTGGATGTTA TAACAAAGCC 63 60 

TCACAAAAGT TCAGATATGG CGAGAAATAC ACTGATGAAA TATTAGCAGA AATATTAGAT 6420 

GATTGCGATC ATGATTATAT ATCTGGGCTA AGTCTATTAG GTGGCGAACC ATTTTGTAAT 64 80 

25 TTGGATATTA CATTAAATCT TGTCAAAGCA TTTCGAGCAC GTTTTGGAAA TACAAAGACA 654 0 

ATTTGGGTAT GGACTGGATT TTTATATGAA TATTTAGCAA ATGATTGTAC AGAACGTCGA 6600 

GAGTTATTAT CATACATTGA CGTTTTAGTA GATGGTCTAT TTATACAACA CTTATTCAAA 6 660 

CCTGATTTAC CATATAAAGG TTCTTTAAAT CAACGCATTA TAGATGTACA ACAATCACTC 672 0 

TCGCATGCGC GTATGATTGA ATATATAGTT AGTTGAATAT GTATTAGAAG TCAAGGTAAC 67 80 

ATTCGTTGCC TTGGCTTCTT TTTAGGTTAG GTACATAATT GAAAGTTAAT AAAAGCAATT 6840 

CTTTATAAAA ATATATTGAT AGAATATGAC CTAACAATCA TTTTGATACC AATACTAAAA 6 9 00 

GTTGCATATC CGTTTTTTAA AAAAGTTGAA AGAGAAAAGT GGTATTTTAG TGGGAAGGAA 6960 

GTCTAACTTT TTGGTAGCGT TTTACAATAA ATAAATATTC GTTAATAACG TATAAATATT 702 0 

CTTAAATGCC ATTCTAGTAA AATTTGTTAA ATTCGTTAAA TCGTAACTTA ACACTGTTAT 7 0 BO 

TTTAGCGCTA TTAAGGTTTT GTTTATTACG GGAAAAATTA TATAAATATT CAATAATTGC 714 0 

45 CAAGTTTCAA ATTGTATGAA ATTTGCATTA TTATTAAATG TTAGTTATTG TCAATTTTGT 72 00 

GAATCAATAT AATTATTACA TTTTGAGATA AATCGAAACA GGATTCATAA AATTAATAAT 72 60 

TAGGGGGAGC ACAATTGAAA AAAGAGAAAG TTATGGACTG GACGACCTTT ATAGGGACAG 7320 

50 

TAGCTGTACT TCTTTTTGCA GTTATACCTA TGATGGCTTT TCCAAAAGCA AGTGAAGATA 73 80 

TCATCACTGG TATTAATAGT GCCATTTCTG ATTCAATTGG TTCGATATAT TTATTTATGG 7440 
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ACAGCAGGGC 


AATTTGGTGT 


ACCGTCTGTA 


TCAACAGCTT 


ATTCAATGGT 


CATAGGGAAT 


3960 


ATTATAGGTA 


CATTTGTCAG 


CCCATTTTCA 


CCAGCCTTAT 


GGTTGGCAAT 


TGGTTTAGCA 


4020 


GAGGCAAACA TGGGCACGTA 


TATTAAGTAT 


GCATTCTTTT 


GGATTTGGGG 


ATTCGCTATC 


4080 


GTTATGTTAG 


TAATTGCAAT 


GTTGATGGGC 


ATTGTGACGA 


TTTAAGTATG 


AAAAAATAGA 


4140 


AACTATGGTC 


ACGTTGCAAA 


ATGAAATAAT 


AGTTGCATAA 


ACATGTCGAA 


ATGACGGACG 


4200 


AATCTTTAAA 


CAATTTTAAA 


AATTAATGAA 


ATAATTGTGT 


AGAAATATGA 


ATTTCACTAA 


4260 


ATGTTAATAA 


CTTTGTGACG 


TTTTAGTTAA 


CAGACTAATA 


AAAATTTGAA 


AATACTATAT 


4320 


ATAGTGGTAT 


AACGTAATGA 


GTAGACACAA 


TATATAGGAA 


GAAGGGGTAA 


AATGAATCAA 


4380 


ATCGAAGAAG 


CATTAACGGG 


TTTGATTTCT 


AAAGATCCTG 


CTATTGTTAA 


CGAAAATGCT 


4440 


AACAAAGATA 


GTGATACATT 


TTCAACAATG 


AGAGATTTAA 


CAGCAGGTAT 


CGTTTCTAAA 


4500 


TCTTACGCAT 


TAAATCATTT 


ATTACCAAAG 


CACGTTGCAG 


ATGCACATCA 


AAGAGGGGAC 


4560 


ATACATTTT C 


ACGACTTAfSA 


TTATCATCCA 


TTCCAACCGT 


TAACTAACTG 


TTGTTTAATA 


4620 


GATGCTAAAA 


•«» x ivjl i. J*\k__rl 


TAATGGATTT 


GAAATAGGCA 


ACGCGAATGT 


AACTTCACCA 


4680 


AAATCAATAC 


AAACTGCATC* 


AGCGCAGCTT 


GTACAAATTA 


TAGCCAATGT 


TTCTAGCAGT 


4740 


CAATATGGTG 


GCTGTArGGT 


TGACCgCGTT GACGAATTAC tTAGTACATA TGCACGACcA 


4800 


TAATGAAGAA 


CAACATAGGA 


ATATsCGCAA 


AGCAATTTGT 


CAAAGAATCT 


GAAATTGATC 


4860 


GTTATGTTGA 


TCAACAAGTC 


ACTAAAGACA 


TCAATGATGC 


GATTGAAAGT 


TTAGAATATG 


4920 


AAATTAATAC 


CTTATATACA 


TCTAATGGAC 


AGACACCTTT 


TGTAACATTA 


GGATTCGGCT 


4980 


TAGGTACAGA 


TCATTTAAGT 


CGCAAAATTC 


AACAAGCTAT 


CTTAAATACT 


CGTATCAAAG 


5040 


GCTTAGGAAA 


AGACCGCACG 


ACAGCGATTT 


TCCCGAAACT 


TGTATTTTCA 


ATTAAAAAAG 


5100 


GAACQAACTT 


TAGTCCGCAA 


GATCCGAACT 


ATGACATTAA 


ACAACTAGCA 


TTAAAGTGTT 


5160 


CAACGAAACG 


TATGTATCCA 


GATATTTTAA 


ATTATGACAA 


ACTCGTAGAA 


ATATTAGGTG 


5220 


ATTTCAAAGC 


GCCAATGGGT 


TGTCGTTCAT 


TTTTACCAAG 


TTGGAAAGAT 


GCGGAAGGTC 


5260 


ATTTTGAAAA 


TAATGGTCGT 


TGTAATCTTG 


GTGTTGTTAC 


ACTTAATTTA 


CCTAGAATGG 


5340 


CATTAGAATC 


TGCCGGTAAT 


ATGACGAAAT 


TCTGGGAAAT 


CITITATGAA 


CGTATCGATG 


5400 


TGTTACATGA 


TGCATTACTT 


TATCGTATAA 


ATCGTTTGAA 


AGATGCTGTA 


CCGAATAACG 


5460 


CACCGATTTT 


ATATAAAAGT 


GGCGCATTTA 


ACTATAAATT 


AAAAGAAACA 


GATGATGTTG 


5520 


CTGAGTTATT 


TAAAAATAAA 


CGTGCAACGA 


TTTCAATGGG 


CTATATAGGG 


TTGTATGAAA 


5530 
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TACAGTTCGT ATATCGACTT TTTATATACT TGCCGACAGA AAATAAAAGT ACTTGATATA 2160 

ACATATAACG AAAAGCAACA GTTACTGTCA CAAATTGTGT CACAAGAATA TTTAAATCAT 22 20 

GACAAACAAG CTCAATTTTT AGCGTGGTTG GATGTAAGAT AATAATAGCG GACCGTCTAA 22 8 0 

CCGTCTAAGG TAAGTCTTCT TATTTTAACT TTAACGCTTA ATCATTGAAA TTAAGACATG 2340 

GGCGGCTTTG TGAATAGTCT AATAATGAAG GATTTAAGCG ATAATGATAT GCGTTTTAAA 24 00 

TATGAATATT ACAATAGAGA AAAAGATACG TAGAACAAAC TTAATAAAAT AGGTGGATAA 24 60 

ATTGAAATCT GGTTGAAGTC GTTACTATCA TAGCGACCTT TAGCCAGATT TTTTGTGCAA 252 0 

TAGAAAGCAA TAATAAAAAT GATAGATCAA AATGAAATAC AGGACAGGAT ATACAAGGAT 2580 

TAGTCATGCC ATGTTATCAA GTAGGAAAAT CAAACTTCAC TATTGATAGT TACGCAAAAA 264 0 

AGATTTTTTT GATAAAATGA GATAACTTAA ATATAAAAAA TTATATTAAT TATAATATTT 2700 

20 AAGTTAAAGA GGGGGATTAT GTAAATTGTA TTAAAAGTGG AGGGAGAAAA TAATATGAAT 27 60 

AGTGATAATA TGTGGTTAAC AGTAATGGGG CTCATTATTA TTATTTCAAT TGTAGGTTTA 2820 

CTCATTGCCA AAAAGATAAA TCCAGTTGTA GGTATGACAA TCATACCTTG CTTAGGGGCA 288 0 

ATGATTTTAG GATATAGTGT GACAGATTTG GTTGGATTTT TTGCTAAAGG GTTAGATCAA 2940 

GTCATCAACG TTGTTATTAT GTTTATCTTT GCCATTATTT TCTTTGGCAT CATGAACGAT 3000 

AGTGGTTTAT TCAAGCCGCT TGTCAAACGC TTAATATTAA TGACACGAGG CAATGTCGTC 3 06 0 

ATTGTCTGTG CAATGACAGC TTTAATTGGC ACAATAGCCC AATTAGATGG GGCCGGTGCG 3120 

GTAACATTTT TGCTTTCTAT TCCTGCATTA TTACCTTTAT ATAAAGCGTT AAATATGAAT 318 0 

AAATATTTAT TGATTTTACT ATTAGCATTA AGCGCGGCGA TTATGAACAT GGTACCTTGG 3 24 0 

GGAGGTCCAA TGGCTCGTGT AGCTGCAGTG TTAAAAGCCA AAAGTGTCAA TGAATTATGG 3300 

TATGGATTAA TACCTATTCA AATAATAGGT TTCATTCTTG TTATGTTGTT TGCGGTATAT 3360 

CTTGGATTTA AAGAACAGAA ACGTATCAAA AAAGCAATAG AGAGAAATGA ATTACCGCAA 342 0 

ACACAAGATA TAGATGTACA TAAATTAGTT GAAGTATATG AACGAGATCA AGATGTAAGG 348 0 

TTTCCTGTAA AAGGACGTGC AAGAACAAAA TCATGGATAA AATGGGTGAA TACAGCTTTA 354 0 

45 ACTTTAGCTG TTATTCTATC GATGTTAATA AATATTGCGC CACCTGAATT TGCATTCATG 3600 

ATAGGTGTTy CGTTGGCACT TGTTATTAAT TTTAAATCAG TGGATGAACA AATGGAACGA 3660 

TTAAGAGCgC ATGCGCCGAA TGCATTAATG ATGGCTGCAG TGATTATTGC AGCAGGTATG 372 0 

50 

TTTTTAGGTG TACTAAATGA AACCGGTATG CTTAAAGCGA TTGCGACCAA TTTAATCAAA 3780 

GTGATTCCTG CAGAAGTAGG ACCATACTTG CATATTATTG TAGGTTTACT TGGCGTACCA 384 0 

55 



30 



35 



40 



EP0 786 519 A2 



5 



10 



15 



30 



35 



40 



50 



TACTAGAAAA 


TCTAGGCGCT 


GAGCGTATAT 


GTAAGCGTGT 


AGATTGTGAT 


ATTGATTATG 


360 


AAGAAGACGC 


AGAAAAGTGG 


ATGGCAGACA 


TCATTAATAT 


TATTGATACC 


ACATCAGAAG 


420 


GTATTCAAAG 


TGAATCGGTG 


ATAAGTGAAT 


CAATTAAGTC 


TGCCAAAGAA 


AAGAAATATT 


480 


CTAAATCAAA 


TCCATACCAA 


GCAGAAGTAT 


TAGCGAATAT 


CAATTTAAAT 


GGTACCGATT 


540 


CAAATAAAGA 


AACACGACAT 


ATAGAATTTT 


TACTTGATGA 


TTTTAGTGAA 


TCATATGAAC 


600 


CAGGAGATTG 


TATAGTAGCA 


TTACCGCAAA 


ACGACCCTGA 


ATTGGTTGAA 


AAACTAATAT 


660 


CCATGTTAGG 


TTGGGATCCG 


CAATCTCCGG 


TGCCAATTAA 


TGATCATGGT 


GATACAGTTC 


720 


CTATTGTTGA 


AGCACTAACA 


TCACATTTTG 


AATTTACTAA 


ATTAACATTG 


CCATTATTGA 


780 


AAAATGCAGA 


TATCTATTTT 


GACAATGAAG 


AATTATCTGA 


ACGTATTCAA 


GATGAGTCAT 


840 


GGGCGCGTGA ATATGTTATA 


AATCGGGACT 


TTATAGATTT 


AATAACAGAT 


TTTCCAACTA 


900 


TAPAATTAPA 


a pptp AfauT 


ATGTATCAAA 


TCCTTAGAAA 


ATTACCACCA 


AGAGAGTATT 


960 


CP* ATTTPT Afl 


lnul 1 1 IMlu 


GCAACGCcAG 


ATGAAGTGCA 


TATTACCGTT 


GGTACGGTTC 


1020 


GTTATPAAP,P 


APATflTJAPPT 


GAGAGAAAAG 


GTGTATGCTC 


GGTTCATTTT 


GCTGAGCGAA 


1080 


TTAAACCAGG 


PHATATAOTA 


CCAATTTATT 


TGAAGAAAAA 


TCCGAACTTC 


AAATTTCCGA 


1140 


TGAAGCAAGA 


TATAPPPtPTT 


ATTATGATTG 


GACCAGGTAC 


TGrAATTGCT 


CCTTTTAGAG 


1200 


CATATTTACA 


AGAACGTGAA 


GAACTTGGTA 


TGACTGGAAA 


AACATGGTTG 


TTCTTTGGTG 


1260 


ATCAACACCG 


TAGTTCTGAC 


TTTTTATATG 


AAGAAGAAAT 


AGAAGAATGG 


CTTGAAAATG 


1120 


GAAACTTAAC 


ACGCGTAGAT 


TTAGCATTTT 


CAAGAGACCA 


AGAACACAAA 


GAATATGTAC 


1380 


AGCATCGTAT 


AATGGAAGAA 


AGTAAACGTT 


TCAATGAATG 


GATTGAGCAA 


GGCGCACAAT 


1440 


CTATATTTGT 


GGCGATGAAA 


AATGTATGGC 


GAAAGATGTC 


CATCAAGCCA 


TTAAAGATGT 


1500 


attgStaaaa 


GAACGTCATA 


TTTCTCAAGA 


AGAAGCAGAG 


TTATTATTGC 


GACAAATGAA 


1560 


ACAACAACAA 


CGCTATCAAC 


GTGATGTTTA 


TTAGCGATTG 


GTGTTAAATA 


TTTTAAGGTG 


1620 


TAATGATGTA 


AAAAGATATA 


AAGGATGTTG 


CTCAACATGA 


ATATGCCATT 


AATGATAGAT 


1680 


TTAACAAATA 


AAAATGTCGT 


CATAGTTGGT 


GGAGGCGTCG 


TTGCAAGTCG 


TCGGGCACAA 


1740 


ACATTAAATC 


AATACGTTGA 


ACATATGACG 


GTCATCAGTC 


CGACAATCAC 


TGAAAAACTT 


1800 


CAAAATATGG 


TAGATAACGG 


TGTCGTCATA 


TGGAAAGAAA 


AAGAATTTGA 


ACCAAGCGAT 


1860 


ATTGTAGACG 


CGTATCTAGT 


TATTGCAGCA 


ACCAATGAGC 


CACGTGTCAA 


TGAAGCGGTA 


1920 


AAAAAAGCCT 


TACCTGAGCA 


TGCCCTTTTT 


AATAATGTTG 


GAGATGCATC 


AAATGGCAAT 


1980 
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CCAAGAATAC CTCTAGGCAT TGTCTTTTGA GGATCAAGTG CTTCTGCTGA GTTTGCTGCG 23 4 0 

ATAGAATCGA AACCGATATA CGCTAAGAAA ATCATTGAAA CACCAGCATA TATGCCTTGC 24 00 

5 

CATCCACCAA AGTCACCTGT AGCAGTTACT TTGTGTTCTG GAATAAATGG CACATAGTTA 24 6 0 

CTAACATTTA TTGCTGTTAA ACCTACGATG ACAAATAAAA TAATAGCTAA TACTTTTAAA 2 52 0 

ATAACTAAAA TATTTTCCAT ACGAGCTGCT TCCGACATAC CACGTGATAG TAATAATGCA 2 58 0 

10 

GTTAATAAAA TAACGATAGC AGCAATAATA TCGATAAAAC CGCCATTTGT ACCAAATGGA 264 0 

TTTGATAATG CTGCAGGTAA TTCGATGCCA ATTGGTTTCA CAAGTCCGCG TAAATTCGCT 2 70 0 

15 GAGAATCCTG ATGCAACAAA GGCTACGGCG ATAAAATATT CAGCTAATAG AGCCCAACCG 276 0 

GCAACCCATC CAAAAAATTC ACCAAATAAT ACATTGACCC AAGAATAGGC TGAACCTGCA 282 0 

AATGGCATAG CGGCAGCCAT TTCTGCATAA GTAAATGCAA CTAAACCAGC AACAATAGCA 2 880 

20 GCGAGTAAGA ATGATAACGC AACGGCCGGT CCTGCATGTT CTGCAGCAAC AATGCCAGGT 2 94 0 

AGCGTAAAGA TAGATGTCGA TACAATTGTT CCTACACCTA AAGCTAAGAA ATCACGCACC 3 000 

CGAAGTGTAC GCTTTAAATG ACCATCTTTA TTTTGATAGA TAGCCGGATC CTCTTTTCGT 3060 

25 GCTATTTTAT TGAAAAAACT TCCCATAAAC TTTCCTCCCA AACATTCATA AACAATTCTA 3120 

TACGGTGTTT TTTAATATGT TATATCATAG CACAAATAAT CAATATTTTG TCTAAAAATT 3180 

CTGAAAAATC ACAACTTTAT GTTACGTATT AATGACTTGT CTTGATAACA TCCATAGATT 324 0 

30 

TTTTAAATGA TAAAACTGAT TATAACAGAT ATTAAATGAA TAAGTACTAT TTTTTGCnAA 33 00 

TTTTCTAACA ATTTTGCACA TTATATGTTT AAAATCAATT TCATGTTTAT GGTCTGATTG 3 36 0 

GCTAGTGTGT ATGAAATGTA AnTCTTTGAC TnnGA 33 95 

35 

(2) INFORMATION FOR SEQ ID NO: 120: 

' (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13508 base pairs 
4 0 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 

ATCAGGTAAT GCCATGCGTT TAGCTGAAAA TTTTTTCAGA ACGTTTAAGT GATATCGGAC 60 

ATCAAGTTGT TTTGATGTCA ATGGATGAAT ATGATACGAC AAACATCGCG CAGTTAGAAG 12 0 

ATTTATTTAT TAXTACGTCT ACTCATGGTG AAGGAGAACC GCCTGATAAT GCATGGGATT 18 0 

TCTTTGAATT TTTAGAAGAC GATAACGCAC CTAATTTAAA TCATGTGAGA TATTCAGTAC 24 0 
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GTACGTTTAG CTAAGAAGCT TTGTGAGATT GCACCTGGAG ATTTTGAAAA AAGAGTGACC 540 

TTCGGATTAA CCGGATCAGA CGCAAATGAT GGCATCATTA AATTTGCCAG AGCATATACA 60 0 

5 

GGGCGTCCTT ATATCATTAG TTTCACTAAT GCATATCATG GTTCAACTTT TGGCTCATTG 6 60 

TCTATGTCAG CTATTAGTTT AAATATG CGC AAACATTATG GTCCGTTATT GAATGGTTTT 720 

TATCATATTC CGTTTCCAGA TAAATATCGT GGTATGTACG AGCAGCCACA AGCTAATTCA 780 

GTAGAAGAAT ATTTAGCACC CTTAAAAGAA ATGTTTGCGA AGTATGTACC TGCTGACGAA 84 0 

GTAGCATGTA TTGTTATTGA AACGATACAA GGCGATGGTG GACTTTTAGA ACCAGTTCCA 900 

is GGGTATTTTG AAGCGTTAGA AAAGATTTGT CGTGAACATG GTATTTTAAT CGCTGTCGAT 96 0 

GATATTCAAC AAGGTTTTGG GAGAACAGGT ACATGGAGTT CAGTCTCGCA TTTTAATTTT 102 0 

ACGCCTGATT TAATCACTTT CGGAAAATCC TTAGCAGGTG GTATGCCTAT GTCAGCAATT 108 0 

20 GTTGGACGCA AAGAGATTAT GAATTGTTTA GAAGCACCAG CACATTTATT TACAACAfiGT 114 0 

GCTAATCCAG TTAGTTGTGA AGCTGCATTA GCCACAATTC AAATGATTGA AGATCAGTCG 120 0 

CTTCTTCAGG CTAGTGCGGA AAAAGGGGAA TATGTTAGGA AACGAATGGA TCAATGGGTA 126 0 

25 

TCTAAATACA ATAGTGTAGG CGATGTTAGA GGTAAAGGTC TGAGCATTGG TATTGATATT 132 0 

GTTTCCGACA AAAAACTCAA AACACGTGAT GCCAGTGCGG CACTTAAAAT TTGTAATTAC 13 8 0 

TGCTTTGAGC ATGGCGTAGT TATTATAGCT GTAGCAGGAA ATGTGTTGCG ATTCCAACCG 144 0 

30 

CCATTGGTAA TAACATATGA GCAATTAGAC ACGG CGTTAA ACACTATAGA AGATGCACTG 1500 

ACTGCTTTGG AAGCAGGTAA CTTAGATCAA TATGACATAT CTGGACAAGG TTGGTAATAG 1560 

35 CGATTATCTT AATATAAAAT AAAAAATCAT TTCCACATCT GGATGTTAAT CAGATGGGAA 162 0 

ATGATTTTTT TTATTTTTTA TTTTGGTGGG TGGTATTCAG CTACGTCATT TTTCTTAGAA 16 80 

TGTCTAAGTC CATAACTTAA ATATAGGATG ATACCAACAA TAAACCAAAT TAAAGTGTAT 174 0 

40 AATTTCGCTT CGAATCCTAA TCCCCAGAAT ACTAGCAATA CTAAAACAAA TGTAATTGCT 180 0 

GGTAACACAG GATATAAAGG TAATTTAAAT GCAGGAATTG GTAGATCTTT ACCTTcACGC 186 0 

TTTCTCAAAC GATACATTGC TAATGAAACG AACATAAATG CAACAAGTGT ACCTGCTGAA 192 0 

ATTAATTCTG CTAAAAATGC GAATGGGAAC ATAGAACCAA TTAAAACACC AATAATAGTA 198 0 

AGTATAACTA GTGCGCGATT AGGTAAATGT TTGTCGTTTA AGTGGCTTAA CCATGAAGGT 2 04 0 

AATAAGCCGT CACGTCCAAA TGAATAAAGT AAACGTGAGC CTGCTAACAT CATACCAATT 2100 

50 

AATGCTGTAA ACATACCGAT AACAGAGATA GCTTGAACAA TAGCTGCTAC AACACCATGA 216 0 
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5 



15 



20 



25 



AACATAATAA 


GGTTTACTGG 


TTTAATGAGG 


AATATTATGG 


ATTTGGAGCA 


GGTGCAAGTG 


3360 


GTTATGTAGA 


TGGTGTGCGT 


TATACGAATA 


TCAATCCAGT 


GAATCATTAT 


ATCAAAGCTA 


3420 


TAAATAAAGA 


AAGTAAAGCA 


ATTTTAGTAT 


CAAATAAACC 


TTCTTTGACT 


GAGAGAATGG 


3480 


AAGAAGAAAT 


GTTTCTTGGG 


TTGCGTTTAA 


ATGAAGGTGT 


GAGTAGTAGT 


AGGTTCAAAA 


3540 


AGAAGTTTGA 


CCAATCTATT 


GAAAGTGTCT 


TTGGTCAAAC 


AATAAATAAT 


TTAAAAGAGA 


3600 


AGGAATTAAT 


TGTAGAAAAG 


AACGATGTGA 


TTGCACTTAC 


AAATAGAGGG 


AAAGTCATAG 


3660 


GTAATGAGGT 


TTTTGAAGCT 


TTCCTAATAA 


ATGATTAAAA 


AAAATTGAAA 


TTTCGAGTCT 


3720 


TTAACATTGA 


CTTACTTTGA 


CCAATTTGAT 


AAATTATAAT 


TAGCACTTGA 


GATAAGTGAG 


3780 


TGCTAATGAG 


GTGAAAACAT 


GATTACAGAT 


AGGCAATTGA 


GTATATTAAA 


CGCAATTGTT 


3840 


GAGGATTATG 


TTGATTTTGG 


ACAACCCGTT 


GGTTCTAAAA 


CACTAATTGA 


GCGACATAAC 


3900 


TTGAATGTTA 


GTCCTGCTAC 


AATTAGAAAT 


GAGATGAAAC 


AGCTTGAAGA 


TTTAAACTAT 


3960 


ATCGAGAAGA 


CACATAGTTC 


TTCAGGGCGT 


TCGCCATCAC 


AATTAGGTTT 


TAGGTATTAT 


4020 


Vj X <■ rVrV x vj x x 


TACTTr^AACA 

x x x vj^&^v\_»n 


AACATCTCAT 


CAAAAAACAA 


ATAAATTAAG 


ACGATTAAAT 


4080 


CAATTGTTAG 


TTGAGAATCA 


ATATGATGTA 


TCATCAGCAT 


TGACATATTT 


TGCAGATGAA 


4140 


TTATCAAATA 


TATCTCAATA 


TACAACTTTA 


GTTGTTCATC 


CTAATCATAA 


ACAAGATATT 


4200 


AT CAATAATG 


TACACTTGAT 


TCGTG CTAAT 


CCTAATTTAG 


TTATAATGGT 


TAT 


4253 



(2) INFORMATION FOR SEQ ID NO : 119: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 95 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 "(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 

TCCCTAATCG AACAAAATTA TGCGCATAAA CAAAGTAGAT TGATATAAAA TTCTTAATTA 6 0 

TCAGAATATA TTTACAAATC TGAATTTTAT TAGTATATTG GrTAGTrTTC ATAGAGGCAT 120 

45 GACGGTaTTT GAGCAGGATT TTAAATCGGg ATTTTATAAT CGATTTAAGA GAGGCCACtT 180 

TGCTTGcACA TTAATACTGT CAATGGGAGG GGAATGTATA TGAGTrAAGC ACATCAATTA 240 

ATTCAAGAGG ATGAACATTA TTTTGCGAAA TCAGGACGTA TTAAATATTA TCCGTTAGTG 300 

50 

ATTGATCATG GATATGGAGC AACATTGGTT GATATTGAGG GGAAGACATA TATCGATTTG 360 

TTATCGAGTG CGAGTTCTCA AAACGTAGGT CATGCACCTA GAGAAGTAAC AGAAGCGATA 420 
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TATAAATATG GACTATTTAG ATGATATTCG TGTAAATATT GTTTATGAAT TACCTTTAGC 156 0 

TGAAGTTGTA TTTGATTTCT T CG AT CAACT TAAATCTAAT ACTAAAGGAT ATGCATCATT 162 0 

5 

TGATTATGAA TTCATCGAAA ATAAAGAAAG TAATTTAGTC AAGATGGATA TTTTATTAAA 168 0 

TGGTGATAAA GTGGATGCGC TAAGCTTCAT AGTTCATAGA GATTTTGCAT ATGAACGTGG 174 0 

TAAAGCATTA GTTGAAAAAC TTAAAACGTT AATTCCAAGA CAGCAATTTG AAGTACCTGT 18 0 0 

10 

ACAGGCTGCA ATAGGACAAA AAATTGTAGC GCGTACAAAT ATTAAATCAA TGGGTAAAAA 186 0 

CGTTTTAGCT AAATGTTATG GCGGTGACAT AAGCCGTAAA CGTAAATTAC TTGAAAAACA 192 0 

15 AAAAGCAGGT AAAGCTAAGA TGAAAGCAGT TGGTAATGTT GAAATTCCAC AAGATGCTTT 1980 

CTTGGCTGTA TTGAAAATGG ATGATGAATA ATTTTAAAAA ATCAATTAAC AATTTACAAT 2 04 0 

GAATAAAGTT TAATAACTAA AAAGAGGGAG CCTAGGATAA ATTAACGTCC TGGGCTTTAC 2100 

20 AATGTTATAT TGGCAGCCAT CGACAGAGTT AAAATG AG CT TATAArAATG GttCCCCAAC 2160 

ACAGAAGCTG ACGAAAAGTC AGCTTACTAT AATGTGCAAG TTGGGGTGGG GCCCCAACAT 2 22 0 

AGAGAATTTC GAAAAGAAAT TCTACAGGCA ATGCAAGTTG GGGTGGGACG ACGAAATAAA 228 0 

25 

TTTTGCGAAA ATATCATTTC TGTCCCACTC CCTTATGCAT GAGTTTTACT CATGTAATTT 234 0 

TATTTTTAAG GACATATTAC ATCTGGCTAA TGTGTAAGAG CCACTACATA ATAAATCATT 24 0 0 

AGTGGTTCTT TATTATTTCT ATCTCACTCC CTCTAAACAA GAATAAATAT TAAAATGAAT 24 6 0 

30 

CGATATATTA GACAATCATT GATTAAACGT TAAAGTTAAA AGTAAGAATA ATTGCAGATA 252 0 

GTCCAACAGG ATATAGCCGA TTGGATAAAA AGTCTGAGAA GCGGGGCATT AAAATGACGG 2 58 0 

35 TACAAAGTGC ATATATACAT ATTCCATTTT GTGTAAGAAT ATGTACATAT TGTGATTTCA 264 0 

ATAAATATTT TATACAGAAT CAACCTGTAG ATGAGTACTT AGATGCACTA ATCACAGAAA 270 0 

TGTCTACAGC AAAATATAGG ATCTTAAAGA CCATGTATGT AGGTGGCGGC ACACCAACGG 276 0 

40 CCCTTTCTAT TAATCaGTTG GAAAGATTAC TTAAAGCAAT ACGTGATACG TTTACAATCA 2 82 0 

CAGGCGAGTA TACATTTGAA GCAAATCCTG ATGAGTTAAC TAAAGAGAAA GTCCAACTAT 288 0 

TAGAGAAATA TGGAGTAAAA AGGATTTCAA TGGGCGTTCA AACATTCAAG CCGGAGTTAT 2 94C 

45 

TGTC7GTTTT AGGTAGAACG CACAATACTG AAGATATTTA CACTTCGGTG TTAAATGCTA 3CCG 

AAAACGCAGG TATTAAATCA ATCAGTTTAG ATTTAATGTA TCATTTACCG AAACAGACGA 3 06 0 

TTGAAGATTT TGAACAAAGT TTAGATCTAG CTTTAGATAT GG AT ATT CAA CATATTTCGA 312 0 

50 

GTTACGGCTT AATACTTGAA CCTAAAACCC AATTTTATAA TATGTATAGA AAAGGCTTGC 318 0 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4253 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



10 



25 



35 



(XI) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


118 : 






AGTACGTTTT 


ATAATTATAA 


GTACGTAATT 


AACATATTAA 


CATATCGCAA 


GTATGTATTT 


60 


AAATAAgATT GTTATAATTT 


CAAAGTTCAT 


CCAAGaTTAT 


GGCGTTTGCA 


TTTACCTATT 


120 


AAAAACGTTA 


TTATATCAAA 


GATGCGAAAG 


ATAATACGGG 


TTTATTTTAT 


GAAAGTGAGA 


180 


AGGATAAAAT 


GGATAATGAG 


CAACGCTTAA 


AAAGAAGAGA 


GAATATAAGG 


AATTTCTCGA 


240 


TTATAGCACA 


TATTGACCAC 


GGAAAATCTA 


CATTGGCTGA 


TAGAATTTTA 


GAAAATACCA 


300 


AATCAGTTGA 


AACAAGAGAT 


ATGCAAGATC 


AGTTACTAGA 


TTCAATGGAT 


TTAGAAAGAG 


360 


AACGTGGTAT 


TACAATCAAA 


TTAAACGCgT ACGTTTAAAG TACGAAGCTA 


AAGATGGAAA 


420 


TACTTATACA 


TTCCATTTAA 


TCGATACGCC 


TGGACACGTC 


GATTTTACAT 


ATGAAGTGTC 


480 


ACGTTcTTTG 


GCAGCTTGTG 


AGGGCGCGAT 


TTTAGTAGTA 


GATGCGGCTC 


AAGGTATCGA 


540 


AGCACAAACA 


TTAGCAAATG 


TTTATTTAGC 


ATTAGATAAT 


GAGTTAGAGT 


TATTGCCTGT 


600 


TATTAACAAA 


ATTGATTTAC 


CTGCTGCAGA 


ACCTGAACGC 


GTGAAACAAG 


AAATTGAAGA 


660 


TATGATAGGT 


TTAGACCAAG 


ACGATGTTGT 


TTTAGCAAGT 


GCTAAATCTA 


ACATTGGAAT 


720 


TGAAGAGATA 


CTAGAGAAAA 


TAGTTGAAGT 


TGTGCCAGCT 


CCAGATGGTG 


ACCCAGAAGC 


780 


ACCACTAAAA 


GCGTTAATAT 


TTGATTCTGA 


GTATGATCCA 


TATAGAGGGG 


TAATTTCATC 


840 


GATAAGAATT 


GTGGACGGTG 


TTGTTAAAGC 


CGGAGATAAA 


ATTCGAATGA 


TGGCCACTGG 


900 


TAAAGAGTTC 


GAAGTAACAG 


AAGTTGGAAT 


TAATACACCT 


AAGCAGCTTC 


CAGTTGATGA 


960 


ATTAACAGTT 


GGTGATGTTG 


GTTATATTAT 


TGCAAGTATT 


AAAAATGTTG 


ATGATTCTAG 


1020 


GGTTGGTGAC 


ACCATCACAT 


TAGCTAGTAG 


ACCTGCATCA 


GAACCATTGC 


AAGGTTATAA 


1080 


GAAAATGAAT 


CCAATGGTAT 


ATTGCGGACT 


GTTCCCAATA 


GATAACAAAA 


ATTATAATGA 


1140 


TTTAAGAGAA 


GCATTAGAAA 


AATTACAATT 


GAATGATGCA 


TCATTAGAAT 


TTGAGCCTGA 


1200 


ATCGTCACAA 


GCATTAGGTT 


TTGGTTATAG 


AACTGGTTTC 


TTAGGTATGT 


TACACATGGA 


1260 


AATAATT CAA 


GAAAGAATTG 


AAAGAGAATT 


TGGTATTGAA 


TTAATTGCAA 


CTGCACCATC 


1320 


TGTAATTTAT 


CAATGTGTTT 


TAAGGGACGG 


TTCAGAAGTG 


ACGGTTGATA 


ACCCAGCACA 


1380 


AATGCCAGAT 


CGTGATAAAA 


TTGATAAAAT 


ATTTGAGCCA 


TATGTTCGTG 


CAaCTATGAT 


1440 
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TATCAGACGA AACACAATTC ATTGTTATTA CACACCGTAA AGGAACAATG GAATTTGCAG 186 0 

ATAGGTTATA CGGTGTAACA ATGCAAGAAT CAGGTGTTAC TAAACTTGTG AGTGTGAATT 192 0 

5 

TAAATACAAT AGATGATGTG TTGAAGGAGG AGCAATAATG AGCTTTTTTA AACGCTTAAA 198 0 

AGATAAGTTT GCAACAAATA AAGAAAATGA AGAAGTTAAA TCCTTAACAG AAGAACAAGG 2 04 0 

TCAAGACAAA TTAGAAGATA CACATTCTGA AGGTTCAACG CAGGACGCAA ATGATTTAGC 210 0 

10 

AGAAAATGCT GAAGTGAAAA AGAAGCCACG CAAGTTGAGT GAAGCGG ATT TTGATGACGA 2160 

TGGCTTAATA TCAATTGAAG ATTTTGAAGA AATTGAAGCT CAAAAAATGG GTGCTAAATT 2220 

15 TAAAGCAGGA CTCGAAAAAT CTCGTCAAAA TTTCCAAGAA CAATTAAATA ATTTGATAGC 228 0 

GAGATATCGT AAAGTAGATG AAGACTTTTT TGAAGCTTTA GAAGAAATGT TAATCACTGC 234 0 

AGACGTCGGT TTTAATACAG TGATGACGTT AACTGAAGAA TTACGTATGG AAGCACAACG 24 0 0 

20 ACGTAATATT CAAGATACTG AAGATTTGCG TGAAGTCATT GTTGAAAAGA TCGTAGAGAT 24 6 0 

TTACCATCAA GAAGATkATA ATTCAGAAGC TATGAACTTA GAAGATGGTC GTTTAAATGT 252 0 

CATTTTAATG GTTGGTGTGA ATGGTGTTGG TAAAACAACA ACAATTGGAA AATTAGCTTA 2580 

25 CCGATATAAA ATGGAAGGTA AAAAAGTAAT GTTAGCTGCG GGCGATACTT TTAGAGCGGG 2 64 0 

TGCTATTGAT CAATTGAAAG TTTGGGGCGA ACGTGTTGGT GTAGACGTAA TTAGCCAAAG 27 0 0 

TGAAGGTTCT GATCCAGCTG CTGTTATGTA TGATGCgATT AATGCCGCTA AAAACAAAGG 276 0 

30 

TGTTGATATT TTAATCTGTG ATACCGCTGG ACGTTTACAA AATAAmACAA ATCTAATGCm 2R20 

AGAATTAGAA AAAGTTAAGC GTGTAATTAA TCGAGCAGTG CCAGATGCGC CTCATGAAGC 2 8 80 

35 ATTACTATGT TTAGATGCTA CAACTGGTCA GAATGCGTTG TCACAAGCTA GAAACTTTAA 2 94 0 

AGAAGTAACA AATGTTACAG GTATTGTATT AACGAAATTA GATGGTACAG CCAAAGGTGG 3 000 

TATCCTATTA GCCATTCGTA ATGAATTGCA CATCCCAGTT AAATATGTAG GTTTAGGTGA 3 060 

JO GCAATTAGAT GACTTACAAC CATTTAACCC TGAAAGTTAT GTCTACGGCT TATTCGCTGA 312 0 

TATGATTGAA CAAAATGAAG AAATAACAAC AGTTGAAAAT GATCAAATTG TAACAGAAGA 3180 

AAAGGACGAT AATCATGGGT CAAAATGATT TAGTtAAAAC GTTACGAATG AATTATTTGT 3 24 0 

4t) TTGATTTTaT CAATCCTTAT TGACGAATAA ACAACGTaAT TATTTGGAAT TATTTTATCT 3 3 00 

TGAAGATTAT TCTTTAAGTG AAATCGCAGa TACTTTTAAT GTGAGTAGaC AAGCAGTTTA 3 36 0 

TGATAATATA AGAAGAACTG GCGATTTAGT TGAAGATTAT GAAAAGAAAT TGGAATTATA 34 2 0 

50 

CCAGAAATTT GAGCAACGCC GAGAAATATA TGATGAAATG AAACCACATT TAAGTAATCC 34 8 0 
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10 



15 



25 



AACTCAGGCA ATTGAAACAG CATTAGGTGC TTCATTACAA CATGTCATTG TAG ATT CAG A 6 0 

AAAAGATGGA CGCCAGGCTA TTCAATTTTT AAAAGAACGT AATTTAGGTC GTGCGACGTT 12 0 

TTTACCATTA AATGTTATAC AGAGTAGAGT GGTAGCGACT GATATTAAAT CTATTGCTAA 180 

AGAGGCAAAC GGATTTATTA GTATCGCTTC GGAAGCAGTT AAAGTAGCAC CAGAATATCA 24 0 

AAATATTATC GGGAATTTAT TAGGTAATAC GATTATCGTT GATCATTTAA AGCATGCAAA 3 00 

TGAATTGGCA CGTGCGATTA AATATCGAAC TCGTATTGTT ACTTTGGAAG GTGATATTGT 360 

AAATCCTGGT GGtTCTATGA CTGGTGGTGG CGCTCGTAAG TCAAAAAGTA TTCTGTCTCA 420 

AAAAGACGAG TTGACAACAA TGAGACACCA ATTAGAAGAT TACTTGCGTC AAACAGAATC 480 

ATTTGAACAA CAATTTAAAG AGTTGAAGAT AAAAAGTGAT CAATTAAGTG AACTGTATTT 54 0 

TGAAAAAAGT CAAAAGCATA ATACACTTAA AGAGCAAGTG CATCATTTTG AAATGGAGCT 600 

20 CGATAGATTA ACTACACAAG AAACACAAAT AAAAAATGAT CATGaAGAAT TCGAATTTGA 660 

AAAAAATGAT GGTTATACGA GTGACAAAAG TCGACAAACT TTGAGTGAAA AAGAAACTTA 720 

TCTAGAAAGT ATTAAAGCAT CTTTAAAACG ACTAGAAGAT GAAATTGAAC GCTACACAAA 78 0 

ACTTT CT AAA GAAGGTAAGG AAAGCGTTAC TAAAACACAA CAAACCTTAC ATCAGAAACA 84 0 

ATCTGATCTT GCTGTGGTTA AAGAGCGTAT TAAAACACAA CAACAGACAA TAGATCGATT 900 

AAATAATCAA AATCAACAAA CTAAACATCA ATTAAAAGAT GTTAAAGAAA AAATTGCATT 96 0 

CTTTAATTCG GATGAAGTGA TGGGCGAACA AGCTTTTCAA AATATTAAAG ATCAAATTAA 1020 

TGGTCAACAA GAAACGAGAA CACGCTTATC AGATGAATTA GATAAATTGA AACAACAACG 108 0 

TATTGAGTTG AATGAACAAA TCGATGCGCA AGAAGCTAAA CTACAAGTTT GTCACCAAGA 114 0 

TATTTTAGCT ATCGAAAATC ACTACCAAGA TATTAAAGCT GAACAATCAA AGCTAGATGT 120 0 

ATTAATTCAT CATGCGATAG ATCATTaAAT GATGrATATC AATTGACTGT TGAACGTGCG 1260 

40 ArATCTGAAT ATACGaGTGA TGrATCGATg ACGCATTACG TAAAAAAGTT AAGTTAATGr 132 0 

AGaTGyCGAT TGATGrACTA GGTCCTGTAA ACTTAAATGC AATTGAACAA TTTGAAGAGT 1380 

TAAATGAACG TTATACATTT TTAAGTGAAC AACGTACAGA TCTTCGTAAA GCTAAAGAAA 144 0 

CATTAGAGCA AATTATAAGT GAAATGGATC AAGAGGTTAC TGAAAGATTT AAAGAAACTT 1500 

TCCATGCTAT TCAAGGACAT TTTACAGCTG TGTTCAAACA ATTGTTTGGT GGAGGCGATG 156 0 

CAGAATTGCA ATTAACTGAA GCCGATTATT TAACAGCTGG TATTGATATT GTGGtACAAC 162 0 

CACCGGGTAA AAAGTTG CAA CATTTATCGT TACTGAGTGG TGGTGAGCGT GCATTAACTG 16 80 

CTATTGCTTT ACTATTTGCA ATTTTAAAAG TAAGATCTGC ACCTTTTGTT ATATTAGrTG 174 0 
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TATCAAGTTT TAGGTGCTTT GCCATGATTT AAGAGTCACC CCCATACTTT GGGCATTTTA 8 82 0 

ACGCCAGAAT AAATCCCCCG CCACTATGTG AAGTGTGGGG GATTATTTAT ATTTTATTAG 8 88 0 

5 

AATATTCAGA TTTTTGAGTG TGTCAACTTA GCTTAGTCAA TGTATATTTA ACGTCACTTA 8 94 0 

CTCTTTTTCT TTCATAATTA ACACATTCAA ATAAACTTTG ATCAAAAAAC ACAAAGTTAA 90 0 0 

AAGTACCATC TTGTAATATG CTCTCATACA TTATCCCGTC ATATTTAAGG CTTCGAATAT 906 0 

w 

AATCAGCTAA ATATTGAAAT GGCAAATAAT CTATTCCTTG TTCATCGCTT GGATTTGTTA 912 0 

TTCCTTTATG AATCTTTTTT AATGTTTGGT AATTTACAAA ATACTTTCTA AATCCATCAT 918 0 

15 CGCCAGCTTT GATTGCATTA CTAGTTAAAT TAGTTAAATT CGCAATTTTC AATTTCTCTT 924 0 

TTGTCACGTT TTTTTGTAAC TTAACCTTAC CTATATAAAT AATGTCATTA TGCTTAGGTT 930 0 

TAACTTCTTC TATACTGACC TGTTCTTTTG TACTAAGGTA TAATACGCTT ATCCATTTAG 93 6 0 

20 AATTCAATCT TCCTGCCGTT GCAAATCCCT TTOGTGGTGA CATTAGTTCA CTTTTCTCTG 942 0 

TAATGAACTT AACTATTCTA GAT CT AT AT A ATGGTTCAAA TCTTTCTCTA AATTCCTCAA 94 8 0 

TACTATAGTA ATTAGTAGTG ATATCGAGAA AGAACGCTAA ATTCTCTAAA TTGATCATAT 954 0 

25 

TTTTATGAAA TCTATTTTTA TACTTCAAGC TCTCACAAAA TCCATCCCAG TCATTATTTG 9600 

CTACAATTAG ATTTTTATTT GTATATTTTT TATCGTTTAT GATTTTAGCG CCTACTAAAT 9660 

CTTCCAACAC TCGTCTATCT AAATTTTCAT CATCTTTAAA AAGTTCATTT AAAATACAAC 972 0 

30 

TTATTTGAGC TTCCTCAACA TTAAATATAC TCCAGTCGTC TTTTAATGCT ATTTCAATCT 973 0 

TTTTACCTTC TTTTGGGCTA AAAGTATCTG GTAAATTTAT ACTAATATCA TATAATTCTA 984 0 

35 ATGCTGGTCT TAAATAATCT CTAATAAGTT CTAATTTATC TATGTCCTTA GTCGTATCAA 990 0 

ATATTTTAAC ACCAAGATGA TTGTTATCAA TATCACAATT GTCAAATTTG CTATTTATCA 996 0 

TTTGCAATGA TTTCTACGAT TTCAGTATTA TTAAAACATT TTTCACATAT TTTCATTTTG 10020 

40 AGACtCCAAG TATCTATTCA TAATTTCTAG GTGATGCATG ATAGATAACC TTTTAATTAA 10080 

ACCTAATCCT GGATaCTTAT TATTTTCATT TAATTCTTCA AATTGTCCCA AGCGCATAAG 1014 0 

ATCTATTTTT AATATCTAAG TTTTTTGACC ATGTTACTAA TT 10162 

45 

(2) INFORMATION FOR SEQ ID NO: 117; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3491 base pairs 

(B) TYPE: nucleic acid 

50 (C) STRANDEDNESS: double 

(3) TOPOLOGY: linear 
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CTTGTAAAAT AATCTTGAGT AGATTACTAT 
ATTTGTGaAT AGGGAGGCAC AACATCATGT 
5 TACAATTCAA TTATGATGAA ACTACAGTTC 

GAAAAAAACA TATCCTAGGT ATTGTTGGTG 
AATCTATTTT AGGGCTACTA CCAGATTATC 

10 

TTAATGGGCA ATCGTTAAAT AATTTATCAA 
ATATTTCAAT GATTTTTCAA GATCCACTCT 
AACAAATTAC AGAAGTAATA TTTCAACATA 

15 

TGACAATAGA CATTTTAGAA AAAGTAGGTA 
ATCCACATGA ACTTTCTGGT GGTATGCGTC 

20 TAAAGCCACA AATTTTAATC GCAGATGAaC 

ATCAATTACT GCAGTTAATG AAGTCCCTTT 
TCACTCACGA TTTAGGCGCT GTGTATCAAT 

25 GAAGTGTCGT TGAAAGTGGC ACGGTTGAAA 

CAAAACGCTT AATAGATGCG ATTCCTGATA 
ACAATGATAT TTTATTAAAA TTCGATCGCG 

30 

CCTATACCGA GCAGTTAATG ATATTAACTT 
TGTCGGTGAA TCAGGGTCAG GGAAATCGAC 
AGTGTCAGAA GGCTTTATTT GGTATAACGA 

35 

ATTGAAATCT TTACGACAAG AGATACAAAT 
TCCAAGATTT AAAGTCATTG ATGTGATTAA 

40 AGATAATGAT GACATTATTA AAACTGTCGT 

AACTTTCTTA TATCGCTATC CACACGAATT 
CGCGAGAGCA CTTGCTGTTG AACCTAAAGT 

45 AGACGTTTCA ATTCAAAAAG ATATCATCGA 

CATCACTTAT TTATTCATCA CACATGACAT 
TGCAGTTATG AAAAATGGCG AAATCGTTGA 

50 

TCCGCAGTCA GACTATGCAA AGCAACTTAT 
GTCATGCGTT GTGCAACTTT ATCACTGTAT 



GATATACAAA AGTATAGAAT AAATTTACAC 702 0 

CAAATTTATT AGAAGTCAAC AGTCTGAATG 7080 

AAGCGGTAAA AAACGTCTCT TTCGAATTAC 714 0 

AATCAGGATC AGGAAAAAGT ATTACCGCTA 72 00 

CAGATCACAC ATTAACAGGA GAAATTATTT 72 60 

CTTCAGCGTT ACAACAAATT CGAGGTAAGG 73 20 

CTTCGTTGAA TCCAAGATTA ACGATTGGCA 73 80 

AACGTGTATC TAAATCTGAA GCAAAGTCGA 744 0 

TAAAACATGC AACTCGACAA TTTGATGCTT 7500 

AACGTGTCAT GATAGCAATG GCATTGATTT 7560 

CAACAACGGC ATTAGATGCC AGTACACAAA 7 62 0 

ATGAGTACAC AGAAACATCT ATTATTTTTA 7680 

TTTGCGACGA TGTGATTGTA ATGAAAGATG 7740 

GTATTTTTAA ATCGCCACAA CATACCTATA 7800 

TTCATCAAAC GCGTCCGCCA AGACCGTTAA 7860 

TGAGyGgGAT TACACATCAC CGAGTGGCAG 7 92 0 

GGCTATTAGA AAAGGCGAAA CATTAGGCAT 79 80 

ATTAGCTAAG ACGGTCGTCG GTCTAAAGGA 8 04 0 

ATTACCATTA AGTTTATTTA AAGATGATGA 8100 

GATTTTTCAA GATCCATTCG CATCTATTAA 816 0 

ACGACCACTA ATCATTCATG GGAAAGTCAA 822 0 

ATCGTTGTTA GAAAAGGTTG GCCTAGATCA 82 8 0 

ATCTGGTGGG CAACGTCAGC GTGTAAGTAT 834 0 

GATTGTTTGC GACGAGGCAG TGTCCGCTTT 84 00 

GTTATTAAAA CAATTACAGT TAGACTTCGG 84 60 

GGGTGTTATC AATGAAATAT GTGATCGCGT 852 0 

ACTGAATAAC ACAGAAGATA TTATCAAACA 85 80 

TTCAGAAGTA GCAGTTATTG CTAAATAAAA 864 0 

GGTCTGAAAT AAATTGCGCG ACTTCTGATG 8700 
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TATAAGCTTT AATCAACTTA TCATAGATTG ATTTATCGTC CTTGTCTTTC TCTTTACGCA 342 0 

ACTGATCGAT GTCCTCATCT TTTAATATCT TGATGTCATT TATATGTTTG TGCATATTGT 34 8 0 

AAGTATTATT GTTAGGCACA GACTTTTTAT CACGTGCTCT ATCTAAAGAA AACTTAACAT 3 54 0 

CTTCAGCCGA TACACGCTCT CCAGTATTAC GTGCTTGTCC ATTGACCACT TTCGCAAAAT 3600 

AAT CAT CATC TCTTAACAAG AAATAAAATG CTTTATTGTC CTTATTCACA GCATAATCAT 3 66 0 

GACTTAACGA ACCTTTCGTT GTTAAATGAT CATTTTCATC TAATAATAAT AACCTTGTGT 3720 

ACATATTCAT ATTAATTGAA TATACTGACG GCGCAATTGA ACGTATTGGA TCCAATGTAG 3 780 

GAATTTCACC ATCTTGTTGT GTCATCACAA GTGGCCGCGT ATCTCGTTCT CTACTATTGT 3840 

TGTAATCAAA TTGTTGCCAT ATTAATGCAC GTGAATTTGG CAATCCAACA CTATTTTTAT 3 900 

CTAACACTTT ATTGTCATAT ACTAAATTCT TTTTTGATCC ATATAAAGGC GCCATATACC 3 960 

CTTTATCAAA TACAACTTCA TCTTCAATTT GCTTATATGT TTGTTTAACA TCTGCTTCAT 4 02 0 

TTTGAGTAGA AGCTTTATTT AACAACTGGT CTACATGTTT ATCTTTCAAT AAACTATTTG 4 080 

ATCCTGTAGA ACTAAATAAT GCCGTCATAG CATAGTTCGG GTCACCAAAC ACTGTCATCC 4140 

25 AGTCATCAAT TTGGATATCA TAATTGCCGG CTTGACGTTG TGTACGATAG CTAC CAT AAT 4200 

CTGGTTGGAT ATTCATCTTC ACGTTAAATC CTGCATTTTC CAATTGATCT TTAACGATAT 4 2 60 

TCATATCATT TTCATAACTT GCTTGTCCTA GGAAATGTAT TGTTGGTCGC TCGCCTTTCA 43 20 

CTTCAACTTT CGATGACTTT TGAGCCACTT CTGATTTCGT AGGGACACCA CAACCACTTA 4 3 80 

ATACCAACGC TAAAACTATA ATTGCGATAC TAATGATTTT CTTCACATCT ATCCCTACCT 444 0 

TTTTAATGAA TTCTTGGATC TAGTGCATCA CGCACTGCAT CACCTATAAA ATTAAATGCT 4 5 00 

AAAACGACGA ACATAATACA AACACCAGGT ACAATAGCTA AATTACTGTG CGTTTCCAAG 4 560 

TAGTTACTAC CGGTACGTAA AATGTTGCCC CATTCAGCTA CATCAGGTGC AACACCAAGT 4 620 

CCTAGGAAAC TTAAACTACT TGTTGTTAAT ACAACCACAC CTATATTTAA TGAAAAACGT 4 6 80 

ACAATCATAG GCGCAATCGC ATT CGGTAAA ATATAACGCC ATATGATATT CCAAGTGTTT 474 0 

TCACCAGTGA TACGTGCTGC ATCTACATAT TCCATGCGTT TAATTTCTAA AACACTGGCA 4 8 00 

45 CGCATTGTCC GTGCAAATGA TGGTATATTA CCGATACTTA AAGCAATAAT TAAATTTGGA 4 860 

ATACTTGCTC CAAATGATGC AATAATTGCC ACCGCTAACA ATAATGATGG AATTGCAAAC 4 920 

ACTACATCTA AAATTCGCAT TATTAAATTA TCAATATGAT TAAAATAACC TGCGATAGTG 4 930 

CCTAGTAACA CACCAAAAAT AACTGCAATA ACTACTGAAA TAATTGAAAT TGAAAATGTC 5040 

AGCTTCGTTC CTACAACTAC GCGTGTAAAT AAGTCTCTAC CGAAATCATC AGTACCAAAC 510 0 
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ATCAGACACA ACACCATGCT CTATATCAAT ATTTGCTTTA TTGCTATCAA TGAGCGTACT 162 0 

GCGTGCTTTT AAAT AAT CAT CATCAATTAA TGACTGTACA GGCAC CTCAT GAAAATTATC 16 80 

ATCCGCCAAG TATTGCGCAC GATCACTATA TGCTAAATGC ATCGCTTGTA TCAAATGATG 174 0 

CAAGTAATCA ACAGATCTTG GACCCATAGA TGGTAAATCG ACATGTTCTA ATAACTTCAA 1800 

TATTTGAATT ACCGTGATAC CGCCAGAACT AGATGGTCCC ATTGaATAAA TGTCATAGTC 186 0 

TTTAAATGTT GCACTGATTG GCGCTTTAAT CTGAATGTCA TATTTGGCTA GATCCTCTAA 192 0 

AGTGATTGTC CCACCACATG CTTTGACAAC ATTGACTAAT TGTTTCGCAA TGTCACCTTT 1980 

ATAAAATGCA TTAAACCCTT GTTCTCTTAA TATTTGAAAT GTCTTACCTA ATTCGGGTTG 2 04 0 

TACAATCCAA TCACCTTCAC GCCAATATTG ATTTTCATGC GTAAATACTT GTGCCGTTTC 2100 

ATGATACTTT GTCAATCGTG CGTGTTGCTG GCGCGAATAT TTTTCAGTAG CCCAATTGGC 216 0 

20 TGCATGACCT TCAATGGCTA GTTCAATTGC AGGATTAATT AAATCTTCCA ATGACAATTT 2 22 0 

AGCATAACGC TTGTGAATAT AATCAAACAG CTTTGGAATT GCTGG CACAG CGACAGTTTT 2280 

ACCATGTGTA GTCATATCAA AAAATGATTT ATATTCGCCT GAATCATCTA GATAAAATTG 234 0 

TTTGTCTACA TGTTCAGGTG CTGTCTCACG TGCATCAAAC GCAGTTATAC TGCCAGTACT 24 00 

TTGCTCATAA TATAGCAAAT ACCCGCCACC ACCAATACCT GATGCAAATG GTTCTACCAC 24 6 0 

ATTCAATGCC AGTTGAATTG CAATCACTGC ATCCATGGCG TTGCCACCTT GATCTAATAC 2 52 0 

ATCCTTACCA ATTTTAGCCG CAAGAGGATG TQATACGGAA ATTAACCCTT CTTTAGATGT 2 580 

TTTTGTCTGT TTGTCATTTA AGTTAATGAC CATACTATAT CCTCCTACTT TCTGTTAAAT 264 0 

ATTTAAAACA TTATTGATTA ATGGCTTTTT CTACTTTTTC TAAATCTTGA CGTTGCTCGT 270 0 

TACCAGTATC GACAAGTGGT GTAATCGGTG ATGCAATTTT AAATTTATCG CCACGATAAA 276 0 

ACTTAATAAA TTGATCCTGA TCTATCGCAT TAACTACTGC TTGTCTCAAG TTTGGATGCG 2 82 0 

40 TCTTAAATAT ACCTTTTTTA ATATTTAGCA TTAAAAAGAC TGACTTGCGT CCATTTTTGC 2 88 0 

GAATAATGCT TAAATTTTTA TCCGACTTAA TTAAATCAAA ATGTTTTTGA TTCACATCTG 2 94 0 

CCAACATATC AATTGAATGA TTTCTAAGTT CTGACAATGC ATTATTCGGG TCACCATTAA 3 00C 

45 

ACTTCAATGT AATATTTTTA ATTTTAGCTG GTCCATAACT ACCTTTTTCT GTTTCGTTGA 3 06C 

ATCCTGGATT ACGTTGAAAC GTTGCTTGAT ATGCATTTTT CTGTGTCATA ATGTATGCGC 312 0 

CACTTGCATA CAGCGCATTT TTCCCATCTG AATTTGCAGG AATTGTACTG CTATCCCCAT 3 180 

ATCCTTTTGG ATATTCTTGA TTTACTTGAT TAACAAATTT TTTAGATAAA ATGCCTGCCG 3 24 0 
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(A) LENGTH: 10182 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS : double 
(D) TOPOLOGY : linear 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 

w TTTTTGATTC AAAGTGGTGA TTTAACAAGC ATTTTAAATA GCAATGATTT GAAAGTCACA 6 0 

CATGATCCTA CCACTGATTA TTATAATTTA TCTGGTAAGT TGTCGAACGA TAATCCAAAC 12 0 

GTTAAACAAT TAAAACGTAG ATATAATATT CCTAAAAACG CATCAACAAA GGTGGAATTA 180 

15 AAGGGAATGA GTGATTTAAA AGGCAATAAT CATCAAGATC AGAAACTTTA TTTTTATTTT 24 0 

TCAAGTCCTG GAAAAGACCA AATCATTTAT AAAGAAAGCC TTACTTATAA TAAAATAAGT 3 00 

GAACATTAAT ACTTATGCTG TAATTATAGA AACATCCAAA TCATCTATTA nAATCCTATA 360 

20 

TTATAAAAnC ACCTCACATA ACTCGTTCAA CTGTACCAAA CCACATTACA TTAGATTTTA 420 

GGCTAACTAT TGTGATGTAC ATCAAAAACG AATTTGTGAG GCGTTGTATA TTTTACAAAG 480 

GTGACTAGCG TTTCGTATAG CATTTCCAAC ATTACTACAC TCAAGCGTCA CGCTAAAGTT 54 0 

25 

CGAAATCGAA TCCTTTCATT CAACAAAAGC TCATATCCAC TACAAACTTC ATATCAAGCG 600 

TATAAACTAT CTTGTGATAC TATCTCGATC ATATCTATAG TATGCATTTG TGTTCCGTTT 660 

30 CACTGAAGTA TATGTATCAT CAGTTAAGTA TAAACCGTCA TCCTTCAATG TTACTTGATA 720 

AGCATATTTC CGTGCTAACC AGG CAATATC TATATAATTT TCTCCTGCGT TTTCATAACT 780 

TCTTAAATCT TCAATATGTG CACTAACTTC AGGGaAAATG ATTCTAACAA CACTTTCATC 84 0 

35 AACCCAATAT TTGTCATGCA TCCATCGCAC TTGATCTGCC AATAAAGGTA ACTGCACATC 900 

ATTGAAATAT AGACGAAAGC CGTCACTATC ATACATTTGC CGATATGGTA ATGGCTGTTT 960 

TCTAATCACT AACACCTCGC CACCCATTAC GGTGCCTTCT CTAGTATCAT CACTTCCACC 1020 

40 

CGAAGCTTCA TACGTTGTTG GGTCAACCTG TAGTCCATGT ACATCTCCAA TATAAGCATC 10 80 

TGGTTTATGT TCCATTGCAT GTCCATGTGC AATCAATGCT AATATTGTAG ATTGTGAAAA 114 0 

TTGAGGCTCC CATTCAATGC GATTAGGATG GCTACTATAA ATTCTAGGTT CATCTATAGC 12 00 

45 

CTGCTGAATA TCCATGCCAA ACACTAATAC ATTGATTAAT GTTTGCGCAA CACTAGCAAT 12 60 

GATACTTATG GCACCAGGTG CACCTACTGT TAATATTGGC TTCCCGTGAT ACATCACAAT 1320 

50 CGTTGGAGCC ATGTTACTTA GTGGTCGTTT ATATGGTGCA ATTTCGTTAA TACCACCATC 13 8 0 

TACTACATCA AAGCCATCCA TTGTCGTATT CAATAACACA CCGTAGCCTG GAATCGTGAT 144 0 

ACCTGAACCA TAAATCATAC CAATTGATGT CGTAAATGAA GCAATATTAC CTTCCTTATC 1500 
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ATAGTTACTA ATGAATTGAA TAAGTTCAAA GGCTTTGAAA CATCATATAT AATAAACGAA 9180 

AATCAAGTTT CCTATTATGA AATTATAACA CTACTTAATA AACGTCCCCT CgACAAGTCG 924 0 

5 ACTATGGTAA CAAAATTCAA TATCTTAATT TTTATCATAC AGAACT AT CT AACGCATTAT 9 3 00 

TTGCAATTAA ATTTGCCCAT TAACCTATTT TTCATAAAAT GTCATTTAAA CAAGTTATTT 93 60 

ATTAAAATTC ACTTTATTAC ATAAATTATA CAATTArAAA GTTTCTTCAA ATTGTAAAGA 94 2 0 

w 

TGCATTAATC GAGTTATAAT CATAATGATT AAGATGGT 94 58 
(2) INFORMATION FOR SEQ ID NO: 115: 

1S (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 910 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY; linear 

20 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 

AnGCGT AT CA TGTCACGCAT TTTAACTACT TCTTTACCAC AAGATTATAC AGTCACATTA 6 0 

25 

GTTGATCGTA TGCCATTTCA TGGATTGAAA CCAGAATTTT ATGCTTTAGC TGCGGGCACG 12 0 

AAATCAGATA AAGATGTTCG TATGAAATTC CCTAATCATC CACAAGTGAA TACAGTTTAT 180 

GGTGAAATTA ACGACATAGA TTTAGATGCT CAAATTGTCT CAGTCGGTAA TTCTAAAATT 24 0 

30 

GATTATGATG AG CTAATCAT TGGTTTAGGA TGTGAAGATA AATATCATAA CGTTCCAGGA 3 00 

GCCGAAGAAT ATACACATAG TATTCAAACA CTCTCAAAGG CTCGGGATAC TTTC CATAGT 3 60 

35 ATTAGTGAAC TACCAGAAGG TGCTAAAGTC GGTATCGTTG GTGCTGGATT AAGCGGCATA 420 

GAACTTGCCA GCGAATTAAG AGAAAGTAGA TCAGACTTGG AAATATATCT TTATGACCGT 4 80 

GGGCCGCGAA TTTTAAGAAA TTTTCCAGAA AAATTAAGTA AGTATGTTGC GAAATGGTTC 54 0 

40 GCCAAAAATA ATGTTACCGT TGTTCCAAAT TCAAATATTA ATAAAGTTGA ACCTGGTAAA 6 00 

ATATATAACT GTGATGAACC TAAAGATATT GATTTAGTTG TATGGACAGC AGGAATTCAA 66 0 

CCTGTTGAAG TTGTTCGTAA CTTGCCGATT GATATAAATA GTAATGGACG CGTGATAGTT 72 0 

45 

AACCAGTATC ATCAAGTACC AACATATCGT AACGTCTATG TAGTTGGTGA TTGTGCTGAT 7 80 

TTACCACATG CGCCAAGTGC TCAGTTAGCC GAAGTTCAAG GTGATCAAAT TGCCGATGTG 84 0 

CTTAAAAAGC AATGGCTAAA TGAACCATTA CCTGACAAAA TGCCGGAACT AAAGGTACAA 9 00 

50 

GGTATCGTTG 910 
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ATGTGATGTT AGTAGATCAT AGGGTTGATG ATAATATTAA AGCTGAAAAC GTTATATTTA 73 8 0 

TTGGCCTTTT GTGTAAACAT GGACATTGGC ATGCAGTCAT TTATGACATT GCTCAAGACA 74 4 0 

AAACTGCCGA ACTCGAAATT GAAAATATTA TAGATATTTC GTATTCATTC GGTAAGACGA 75 00 

TTCAAACCAG AGACATATCC ATTGATAACT ATCATCAATT TTTAAACCCC ATCGATTCCT 75 6 0 

AAAAAACAGC AGTAAGATGA TTTTCAATTA GAAAATATCT TGCTGCTGTT CTCTATTTAT 762 0 

ACAATACTTC GTATTGAATG GnTTCGCTTT CCTAGGGTGC CGTCTCAGCC TTGGTCTTCG 76 8 0 

ACTGGCACTG CTCCCTCAGG AGTCTCGCCA TTAATACTAC GTATTAACAT GTAATTTTAC 774 0 

TTTGAAATAC TTAAAAAAAT AAAACACTTT GCCCAACTTA CACTACCAAT AGAAACTGCT 7800 

GTTAGAATTC CTCAAAATGA TATTTCGCGA TATGTTAATG AAATTGTTAA AAAGATAGCT 7860 

GATAGCGAAT TCGATGAATT CAGACATCAT CGTGGCGCAA CATCCTATCA TCTAAAAATG 7 92 0 

ATGTTAAAAA TCACCTCATA TTCATATACT CAATCTGAAT TTTCTGGCCG TAGAATAGAA 7980 

AAATTACTTC ATAACAGTAT TCGAATGATG TGGTTAGCTC AAGATCAAAC ACCTTCTTAT 804 0 

AAAACTATTA ATCTTTTTAG AGTGAATCCT AATACTGATG CGCTAATTGA ATCTTTATTT 8100 

25 ATTCAGTTTC ATAATAAAAT GCATATCAAA AAAGCTGATT TCTATCAAAT AATTAATAGA 8160 

AATCAGCTTT TTTCaTTGCC TAAAAACTTA ATGTCCCGAC CTCTTTATCT ACGCATAAAT 8220 

ACTTATTACT GATATAACGA AAGAAACAAA ATTATTTGCT ATATGTAATG CAATTGTTGA 8280 

ACCTAGGTTT CTTCCAGATT TTAAATAAGT GAAAACTAAT ATGATGGATA GTATGAGATA 834 0 

TGGACCAAAC TCAAACGGCG ACTTTGCATC AGTCACATGA ATAAATGCAA ATAAGAACAC 84 00 

CGAAACAATA CTCATAGCTA TAAAATTAAA CTTCTTACCT AATTCTCCAA TTAAAATATG 84 60 

TCTAAATACG ATTTCTTCAA CTATTGGACC TACAATCACA ATTAATAAGA ATGCTACAGG 8520 

TAAAAATGCA GG CACTTCAA ACATTTTATT TAGCTCAAGT TCATTGGCTG TTtCACTATA 858 0 

TTGCAAATGT TTAGGTAGAA ACTGTGTCAT ATATTCATAT GTATAAATTA AGATGAGAGC 8 64 0 

AATAATATAC GTTATTGACA ATCTAAGC CA ATATTTTTTG ATATACGCAA AACCAGCTCG 8 700 

AAGCCTTGAT GGCATCACTT TTAAATGAAA TAAATAAAAT GCGCCAATCC CAATCGTATA 876 0 

TGCTAAAGCT TGTGTGATAG TCGCTACAAA TATCAGATTA CTATCGATTT CATAATAACC 882 0 

AAACAAAATT GGTCCTATGT AAGCTGCAAT TGTGAGTGCA TAAAATATAA CACCTATAAT 888 0 

TGGAATTATA AGCAAATCTC TCCATGCTAT ATCTTTAAAC GTGTATTTCT TTTTTTCATT 8 94 0 

50 TTCCaCTGTT ATATCCtTTC CTGTTTAATA ATTGATTTTT GGAGGTACTT CTACATGATA 9000 

AACGAAACTA AGTATATGAG ACAACAAATT ACTAATTTGA TTCAAATCAT TGATACGATT 9060 
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GGTAGGCTCA TTTTACTTTT AGACGAACGT 
ACATACTTTG TCTGTTTTCT CTATTTATTA 
5 TGTAGACGTA TAACTATTTT TTATCATTTT 

TGATAACCAC CATTTGCATT TAAAATTTTA 
ATAATAAAAG GTGCACCCTT TAAATGATCA 

10 

AGATTCAATA GTTCTGCAAA TAAAAACTGT 
TTAATATGTG CCCCAAATTG ACCTTTTGCC 
ACTAAACGAT AACTAAATGA GGCGTCAAAT 

15 

TGTGCATTAA ACGATATAAT AGCGTCTTCC 
ATTCCATTAC AAAAAGCACC TTCTCCTCGT 

20 TCATATACGT ACGATAACAT TGGTTTACCT 

TCTTCTTGCT GTTTTACTAA ATTGGCAGTT 
TTAATTTCAT TCGTAATCAT TTCATTACTT 

25 AAATGTGTTG CTAAAAATTG TTGGAATTGT 

AAATCAAATC GATGACGCTT AGTTTCTGTA 
TTGTCTATTT GTTTCAACCA CGAACATATT 
GTCATTTCGT CCACCACTTC TCATATCATT 
ATCAACAATA CAACTGAAGA CTTCTTCATT 
CTAGCCGACA AATAGAAAGG AAAGTAAGTA 

35 

GAACGATTAA AAGCTATCTG TTATATAGCT 
ACAAGTAATA TCATAGCACA ATCTTTTTTA 
GATTTTTTTA AAAAGATATT GAAAATGTCC 

40 

AATATATCAT CTTTTAGGAG GTGGCTGTCA 
TTACTGCAAT TCAACAAAAT AAAAAAATGA 

45 TATCCAAACG CACAATTT7A AGAGATATTG 

ATGCGCATTA TGGGAAAAAT GGTGGTTACC 
TAAACTTATC TGAAACACAA TTATCAGCCT 

50 ACTCGACATT ACCATATAAA AGCGAAATCA 

rA~AAA~A~^ nTTAAGAAAA T~C;CTTAAAr 



TTCAATCCCA CCACTCCTTT ACTATTCCTT 55 8 0 

TATAGTAAAA TAATTTTTTT ACTATACTTC 5640 

TTATCTCTAG AGAATATCTA TCTGTATTTT 57 0 0 

AGTACCGTTT CATGACATGC TTTATTACTT 5760 

ATTGCCTTAC CATCTAAAGT CGTCATTTTT 5 82 0 

GCAGCAATGT CCCAAGGTTT AGGATTTGTA 588 0 

ACTCGCATAG AATCTAATCC GCAAGCACCA 5 94 0 

AAATCTTGCA CCGTATCTAG ATTCATCACT 6000 

AATTTTAACG ATGGTGGTTC TTCCATCTTA 6 060 

ATTGCTTTAT AAAGCTTTTT ATGCGGATAA 612 0 

TCATAAAAAT ACGCCAATAT AATACAATAA 6180 

CCATCAATGG GATCCATAAT CCATAAATGA 6240 

TTTTCTTCCG CTAATAGTTG GTGTTCCGGA 63 00 

TGTTGAATCT GTTTATCTAC ATTTGTAACT 63 60 

GTCATTTCCA TAATTAATTG CGGAATAACA 6420 

AACTTATCTA TTTGCTGTAA TGTTTT AT CT 6 4 80 

ATCATTTTAT TATTACCCTA TATTAAAAGA 6540 

TTATGCATAA AAAAATCGGC TAGTCACGTG 6 6 00 

ATAAATATTG AAGATGTTGT GATGTAACTT 6 6 60 

CTACCCCTTT GTTTAATCGC TCCCCCTGTT 6720 

AAATGTAAGC GTTTTCCACA AAATTTTTAC 6 7 80 

TCATTGTCAC TCTTATGTTA TACTTTGTGT 6 840 

TGAATAAAGC TGAAAGGCAA AATTTAATAA 6900 

CCGCTTTAGA ATTAGCTAAA TATTGCAACG 6 960 

ATGATTTAGA AAATCAAGGT GTTAAAATTT 7 02 0 

AAATACAACA AGCACAATCT AAAATTGCAT 70 8 0 

TATTTTTAGT GCTTAATGAA AGTCAGTCGT 714 0 

ACGCAATTAT AAAACAATGT TTAAGTCTTC 72 0 0 

GCATGGAOTT TTATATTAAA TTTGAT^ArA ~>?£" 
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TATGCTTTTC TAAAATTCGA TCAAGCGCTT 
CAGTATTTAC AGTCATTGGA CGATCAATCA 
5 CAATCTCAAC AGTACATGTA TCACATGTTT 

TTGAAGGTAC AAAAGTATCT TGTGATTTAA 
CAAGATAATC TTTTCCATCA AGTGTAACCA 

10 

CTATATATAT TTTCCGTAAA TGACTTTTAA 
TGCCCCACAC ATCTTTCAGA TAGAATTAAT 
AAGTAAAATT TTGTATTTTG CCTTTTTACA 

15 

ATTAAATCAT CTTTTTGTTT AATTGAAAAT 
TTTCACGCTT TTTGCCATAT CTTTCACAAC 

20 CACCTAAAAA TCGTTATACT ATTTATAAAT 

TTGATAAATA TCTACTATCA TTTAGAAGGT 
GATTAATTTA TAAAAATCAA ATCAGGCATT 

25 ATCACCTTCT ATTTACGGGC TATTAGTTCT 

CTAATTAATT TGTGTACAAT TTTGATAACT 
TCTTTTAATA ACTTAGTACT TTCAGCTTTT 

30 

CCGTCACTTT GAATGCCGCC TTGACCACTC 
TCTTTATAAT TGCTTCTAAT CGTATTCAAA 
TTTTGAATTT CATTCATTAG ACTATTAAAA 

35 

TTGGCCATCG CTTCAAGCAC AATTTGCTGA 
CCTTCTTCTT TACGACTTCT AATAAACTTC 

40 TGTCCTTTTG TAAAACGAAC ACCATCAACA 

CCGCCGATGC TATCTATCAT ATTATGCAAA 
ATTGGCACAT TCATTAATTT TTCAAGTGAT 

4 5 TAGGCATGTG CAATTTTTTC AGTAGTACCA 

GGTATACTTA CTATTTCAGT TTTCTTCGTT 

tCACTACGCT CTCCGCCACC CTTTTTCTTA 

50 

GCGATTGTGA ATGGATCACC ATCGTTTAAA 

TTGCGATCTA ACGGATTGTG TATCTTATTA 

55 



CTTTTTGAGC ATCTTTCACA TCATTGTTCA 37 80 

CCGTACTACA TGAACGTTCA ATTTTACCGT 3 84 0 

GAATTGGTCC CATCGACTCG TTATAACAAA 3 900 

TAAATTCAAG TAAATTCGTA CCTGGTTCTA 3 96 0 

CCAAATGTTC TTGCATATTA CTCACCCCGT 4 02 0 

TAAATTGCTC ATATCCACCT AAAATAACGA 4080 

TTAATTGTAT TACTTTATGT ACTAGTTGTT 414 0 

ATCATTTTTA TTTGAAATAT TTTGCGCGAA 4200 

AATTATCATT ATTAGTTTTC CAATTATCTG 42 6 0 

CTTATTAATG ACAATATTTA ATAATCACCT 432 0 

ACCCTTTTTC TGAAAATTAA TAACCCAAGT 43 80 

AATATTTATC TTTAAATTAA ATTTGTAATG 444 0 

AAATAAAATA GCCCATAAAT ACAAAGTGTT 450 0 

ATTCGTTATT CTATTTACAG ATCATTCTAT 4 560 

TATTTTCCCT TAGTTTACTA CTCTAGATTA 4620 

GACTGCTCAC TAGGAATGAA GTAGTACAAT 4680 

AATTGATGTT TATTAATCGT GTCATTAGCA 474 0 

TCACCTAATG TTAAATCTGT TTTAACATTA 4 80 0 

TGTGTAATCG ATGATGGGCT TGCAATCTTA 4 86 0 

CGTTGTTGTC GACCAAAGTC ACCACCAGCA 4 920 

AATGCTTGAT CACCATTTAC ATGTGTCTGC 4 980 

GTGAATGTAT CATTACTTAC TACATCAACA 504 0 

CCATCCATAT CGATTGTCGC ATAATGATCA 5100 

TTAACAGCCA TATTTGGTCC ACCATATGCA 5160 

CGGCCAACAA TTTCCGCTCT TGTATCACGC 5220 

TTAGGGTTGA TAGATAAAAT CATAATACTA 5280 

CGATCAGCAT CTGAATCGAC ACCAAATAAA 534 0 

CTCACTTTTT TATCTCTTAA TTCTGAATGA 5400 

CCAGTAATAA AAATTTTAGC AGCTACATAC 5460 
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w 



15 



25 



TTGCTGGCAA TACAACATCT GCGTATGTTG CTGTGAATGT TAAAAATTCA TCTTGGACTA 19 8 0 

CCATGAAATC TAATTTTTCA AACGCAGCTT GTACAAAATT AATATTTGAA TCCACAATAC 204 0 

CCGTATCTTC ACCATATAAG TACAATGAGT GTACTTCTCC GTCATGTATA CCTTCTACCA 210 0 

TTTCATGATT ATCTTTACCA GCTTTTGGAT TCAATTTAAC GCCATATTCT TTTTCAAATT 216 0 

TAGCGCGAAT ATCATCCGCT TCAATACTTT GATAACCAGT AATCTTATCA GG CAT ACTTC 222 0 

CCATATCACT ACATCCTTGA ACATTATTAT GTCCACGTAA TGGATACGCA CCAGTACCAG 22 8 0 

GACGACGATA ATTACCTGTT ACTAATAATA AGTTTGAAAT CGCTGTACTT GAGTCACTAC 234 0 

CAATGTCTTG TTGTGTAATA CCCATTGCCC AACAAATTAC AACAGATTCA GCTTTAGCAC 24 00 

ATTCTTCAGC AAATTTAATC AATTCTGATT CAGGAATACC TGTTGCTTCT TCAGCAAAAG 24 6 0 

CCATTGTAAA TGTTTCTAAT GATTTGTAAT ATTCATCAAA ATCATCTACC CACTCATCAA 2 52 0 

20 TAAATGCTTT ATCGTGTAAA TCATGATCAA TAATATACTT AGTCACTGCA CTTAACCACG 2580 

CTAAATCCGT ACCTGGTTTA GGTTGATAAA AACGATCCGC ACGTTCTGCC ATTTCATGTT 2 64 0 

TTCTAATATC AAATACATGT ATTTTTTGAC CAAATAATTT TTGTGCACGT TTCATGCGTG 2 700 

ATGCGATAAC TGGATGAGCT TCGGCTGTAT TAGTACCTAT CAATACAGAC ATTGCCGCTT 2 760 

TTTCTAAATC TTCAATACTA CCTGAGTCAC CGCCGTGTCC AACCGTTCTA AATAAGCCTT 2 820 

TTGTTGCAGG TGCTTGGCAA TATCTTGAAC AGTTATCAAC GTTATTTGTG CCAATAACTT 2 8 80 

GTCTTGCTAA TTTTTGCATT AAATACGATT CTTCATTCGT CGCTTTAGAA GAAGAAATGA 294 0 

ATGATAGTGC ATCTGGGCCA TGCTTTTCTT TAATAGCTGT AAAATTATCT GCAATGACGT 3 00 0 

TTAAAGCTTC ATCCCATTCT ACTTCATGGA AC7CACCATT TTTCCTTACT AGTGGTTTAG 306 0 

TTAATCGTTG AT CTG AATTA ATATGTCCCC ATGAAAACTT ACCTTTAACA CAAGTCGCAA 312 0 

TTTTATTTGC TGGAGAATCA TGTGATGGTT GTACTTTTAA AATTTCTCTA TCTTTAGTCC 318 0 

AAACTTCAAA TGAACAACCC ACACCACAAT AAGTACACAC TGTTTTAGTT TTCTTAATAC 3 24 0 

GCTCTTTACG CATTTCTGCT TCTGAATCTG AGATTGCAAA TAGTGGACCA TAACCAGGTT 3 3 00 

CTGCTTTTTT AGTTAAATCA ATCATTGCTG CTAATGAACC AGGTTCCGTA TCAGTCATAT 3 36 0 

AACCCGCATT ACCTTCCATA TTCACTTCCA TCATGGCATT ACATGGACAT ACCGTCGCAC 34 2 0 

ATTGACCACA AGATACACAT GAAGACTCAT TAATCGGTAC ATCATTATCC CAAATAACAC 34 8 0 

GTGGATGTTC ACGATCCCAA TCAATTCTAA TAGTTTCATT CACTTCGATA TCTTGACATG 3 54 0 

CTTCTACACA ACGCCCACAT AAGATACATT GATTTGGATC ATAACGATAA AATGGGCCGT 360 0 



30 



35 



40 



50 



EP0 786 519 A2 



CTTAGACAAT AAAAAATATG CCACTACAAT 
AACGATTACT TTCATCGTTG TTCTATCTCT 
5 TGATAATGAA ACAGCAAAAA AAGTAATAGC 

TAAACCTATG ACTAATAATG TATTAGAAAT 
AATATGCACT CACCCTTTTT ATTTAAATAA 

10 

CATAGGCCTG TCGATGATTG ATTCACAATA 
TCATTTATTC CATCAAAAAC ACCCTTTTTA 
ACACTGCTTG CATGTAGAAA CACTTTTTCA 

15 

TTTAGCATAT TAAGCATGTA TGTTAAACTA 
AATAGACTTC TTATAGTTGT CTTAGCATTT 

20 TTATTAACTT TATTTAGGAA GTCGCTTAAA 

ACCATTGAAG CCATATTAGA TAATAGCCCT 
AATTTATTTA TGATGACTTG ACGTCCTCGA 

25 GCATCATCTA ATGTTTTAAT AAGCTTGATT 

ATTGCCTCTG TAACTTCATT TAAACTTTCT 

CGTCTAATTT TAGAAGATAT TCTCTCAGCC 

30 

CATAATCTGA ACGTTCCCAT TTTTTCTGTA 

TATTGACACG GAAATTATTA GGGTTCAACG 

AACGACAGCT AGTACGTTTA TAAGATGGTG 

35 

TTAATAAGTT AATTGCACCT AGATCTCCAT 
AGADTCTTT ACCTTTAACA CGATCTGTCA 
CAGAAATCAG CTTAACTTCT GCACCTTCAT 

40 

AAATTTCAAC AAATGCACGT GGCACTTTGT 
TACCTTCATG GAAGTGCTCT AACAATCGAC 

45 CTTGCTTAAA GTAATTATCA AATGATAATG 

TGAATCCTTC TAAGTATAGA ATAGGCTCAT 
AACTATTGAA TCCTTCTAAA CGATCATAAC 

50 CTACTTCATC CATAATTTCA CTAGGATGCT 

CAATTGCTTG GAAAATTTTC CAGTCAGGTT 

55 



CGCTAATATT ACGATTAAAA AAGAAGCGTT 18 0 

GAACATCATA TTAAAGACAA CTAGACTAAT 24 0 

TAACACTAAT TTCATCATAA ATAGACAGAC 3 00 

TACAGCTGAC GTTTTTAACA TTCTCGaATT 3 60 

CTTACATAAT CATAATAATA CATGATGTTT 420 

GCACGTGATT TTTTTGTTTT TCAATATTAT 4 80 

ATTTTTACAA AAATTAAAAA AAGTGCTCCT 54 0 

TTGTAATGTT ATTCTTCTCG AGACATACCT 600 

CGGTTCATGT CGTCATCTTT CAATACGCCC 660 

GGACTCGCTT GATTGGCAAC GTGTAATCCT 72 0 

TCTGATACAT TGAGTTCACC TAATAAAAAT 780 

GTATAAATAT CTTTATTAAG TTCAACTGCA 84 0 

ATTGCACCAT TTAAAGCATC TAATAGTTTT 90 0 

GCTTTTAATA TACTATCTTT ATTCGCTGCA 960 

AACTTAATTT GTTCTTCTGA TTTTTCTAAG 102 0 

ATTATTTATC CACCTGATTT CCCGGGAAAA 108 0 

CTTGAACACT GTACTGCGGT TGACGTTTTT 114 0 

GTGACTTACC ACGTTTCGTA ATTACCTCCA 12 00 

TATCCGTGTA TTGATCAACA TCACTaTTAG 12 60 

TTTCCATCGC aTCaTTATTT AATGGAATAT 132 0 

CGTGAACTTG TAATAC CGCT TCTCCTGTyT 13 8 0 

GAATGCCTCT ATCTTCAGCA AGCTCTGGAG 144 0 

ATTTAATCAT TGGTGTTTGA TAAGTCATAT 1500 

CATTGTTTAC ATGAATATCA TAAATTTCAT 156 0 

GGAATAATTT TGCTTTACCA TTATCAAAAT 1620 

CAGTACCATC AGGTTGTACT GGCCATTGTA 16 80 

TTACCCCAGC ATATAGAGGT GTTAAGCGTG 174 0 

TGTAATTCCA ATCAAATCCT AAT CTATT AG 1800 

TTkAATCACC AAGAGGTTCT AATGCTTGGT 1860 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 

AAAGTTTTAA AAGGGGTGAG ATACTTGGCG AATAATCCAT TCCAGCTTTG CGTTTAAAAG SO 

5 GAATTATACT TGCCATTGTC GGTGCTTGTT TATGGGGATT AGGTGGTACT GTTTCTGATT 120 

TCTTGTTCAA ATATAAGAAT ATTAATGTCG ATTGGTACGT CACTGCTCGA CTTGTAGTCA 130 

GTGGTGTTTT CTTACTTATT ATGTACAAAA TGATGCAACC CAAACGTTCA ATATTTAGCG 24 0 

10 

TATTCCAAGA TCGACGTATG TTAGGCAAAT TACTTATCTT CAGTATACTG GGCATGTTAG 3 00 

TAGTACAATA TGCTTATATG GCATCTATTA ATACAGGTAA TGCTGCGATT GCAACATTAC 3 60 

TACAATACAT TGCGCCAGTT TATATTATTA TTTGGTTTGT CATAAGAGGC GTTGCAAAAC 420 

75 

TAACATTATT TGATGTGCTT GCTATTATCA TGACACTATT AGGAACATTT TTATTATTAA 4 80 

CAAATGGTTC ATTTTCTAAT TTAGTCGTCA ATCCTGCAAG TTTATTCTGG GGTATTTTAG 540 

20 CTGGTGTAGC ACTCGCTTTT TACACAATTT ATCCTTCAGA CCTACTTAAC CGCTTCGGTT 6 00 

CGATTCTAAT TGTCGGGTGG GCAATGCTTA TTTCTGGTGT TGCGATGAAT TTACGCCATC 6 60 

CAATTTGGCA CATTGATATC ACTAAATGGG ACATATCAAT TATATTATTT TTAATCTTTG 72 0 

25 GTATTATCGG TGGTACCGCA CTCGCATTTT ATTTCTTTAT CGACAGTTTA CAATACATAT 780 

CAGCGAAAGA AACAACATTA TTCGGAACTG TTGAACCTGT CGTAGCCGTT ATCGCAAGCA 84 0 

GTCTATGGTT ACATGTGGCA TTCAAACCAT TTCAAATCGT AGGCATCATT CTTATTATGA 900 

TTTTAATTTT ATTACTATCA CTTAAAAGAC AACCTGAAAC ATTAGATGAA TAAGAAAACT 96 0 

CTGATAATCA CTTTAGCAAG TAACTATTAT TTAACAACGT AGTTACCTTA TAGGTGATAT 1020 

CAGAGTTTTT TATTTTAGTT AATAATATTT TTCACTTGGT ATAAAAAaGC GTCGTCGCTC 108 0 

35 

TGGTAATCGG AAATACTGGA ATAAAATATG GAATTGGGTA ATAATCCCAG GTAnTAAAAG 114 0 

TCCATGTTCC GATAnCCTnT CCGCAnCTCC AACCAAATTT GCCGATAAGG TTCCAAAAGG 12 00 

CATCCTGGGG GTAC 1214 

40 

(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9458 base pairs 
us (B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 
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TATGGATTCG ACATGTCCTT GTGCCTGGTT ATTCTGATGA TAAAGACGAT TTAATTAAAC 17220 

TAGGGGAATT TATTAATTCT CTTGATAACG TCGAAAAGTT TGAAATTCTG CCATATCATC 17280 

AGTTAGGTGT TCATAAGTGG AAAACATTGG GCATTGCATA TGAATTAGAA GATGTCGAAG 17 340 

CGCCCGATGA TGAAGCTGTT AAAGCAGCCT ACCGTTATGT TAACTTCAAA GGGAAAATTC 17400 

CCGTTGAATT ATAAATACAA TTCAGACCGA AAAGAAAGCA TATGCAACTT CAAGAGTGAA 17460 

GGGGCATATG CTTCTTTTTC AATTGAGTAT TGAGTATTAG CAAGACGTAG TAAGTATATG 17520 

AGACAACTTC TACAATGGTT GAAGGAAGAC GTTTTTGTAA GTAGCTATGC TGATAAAGAA 17580 

TGTGATGTCT TGTTAAAGGT GGGGTTCCAA TATCATCATT TAGCTGATGT TGAATGGGTT 1764 0 

ATTATTTGCT ACTTGCATAT GAATATGAGT CTTTTCAAAT TTTTATTGAC CCTGAGTAAT 177 00 

GAAAAATATT AAGATGAAAC TTAATATTAA AgCAATGCGG AGCGTGATTA TGAAGAGAAT 17760 

TAGTAAAGAT ATATGGGCAG TATTTAAATT ACTGTATCaA AATAAAGGGC GTTTTAGCAT 17820 

TAATGCCTTA CTATTGCAGT TAATCATGAT TTTTATTAGT AGTACATACT TAATTTTACT 17880 

ATTTAATATG ATGTTAAAAG TAGCTGGcAA AGCCAACTTA CGATTAACAA TTGGACGGAA 17 940 

25 ATCGTTAGTC ATCCCGCCAG TGTGATACTT CTTATTATAT TCATATTAAG TGTTGCCTTT 18000 

CTGATTTATG TAGAGTTTTC ATTGTTAGTT TATATGGTTT ATGCCGGCTT TGATCGACAG 1806 0 

ATTATTACAT TTAAATCCAT TTTTAAAAAT GCCTTTGTAA ATGTGCGTAA ACTCATAGGT 1812 0 

GTACCAGTTA TTTTCTTTGT CATTTATTTA ATGTTAATGA TACCCATTGC CAACCTAGGA 18180 

CTAAGTTCAG TATTAACAAA AAATATTTAC ATACCTAAAT 1TTTAACGGA AGAACTTATG 18240 

AAAACGACGA AAGGTATAAT CATTTACGGT ACCTTTATGA TTGCTGTATT TATATTAAAT 183 00 

TTTAAATTAA TATTTACTCT ACCGTTAACG ATTTTAAACC GCCAGTCGTT ATTTAAAAAT 183 60 

ATGAGACTAA GTTGGCAAAT TACGAAGCGA AAT AAGTTT C GGCTTGTTAT AGAAATAGTT 184 20 

ATATTAGAAC TCATCATTGG TGCGATTTTA ACATTAATTA TTTCAGGAGC AACATATCTT 18480 

GCTATTTGTG TAGATGAAGA AGGAGATAAG TTTTTAGTCT CATCAATTTT ATTTGTTGTA 18540 

TTGAAAAGCG CATTGTTCTT CTATTATkTA TTtACGAAAT TATCATTAAT CAGTGTGTTA 18600 

4$ GTACTGCACT TAA 18613 

(2) INFORMATION FOR SEQ ID NO: 113: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1214 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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AGACCCAACT TGGGTAACTG AATCTATCGG TGGTGTAGGT ATTGACGGAC GTCCACTTGT 15420 

TACGAAAAAC TCATTCCGTT TCTTACACTC ATTAGATAAC TTAGGTCCAG CTCCAGAACC 15480 

AAACTTAACA GTATTATGGT CAGTACGTTT ACCTGACAAC TTCAAAACAT ACTGTGCAAA 15540 

AATGAGTATT AAAACAAGTT CTATCCAATA TGAAAATGAT GACATTATGC GTGAAAGCTA 15600 

TGGCGATGAC TATGGTATCG CATGTTGTGT ATCAGCGATG ACAATTGGTA AACAAATGCA 15660 

ATTCTTCGGT GCACGTGCGA ACTTAGCTAA AACATTACTT TACGCTATCA ATGGTGGTAA 15720 

AGATGAAAAA TCTGGTGCAC AAGTTGGTCC AAACTTCGAA GGTATTAACA GCGAAGTATT 15780 

AGAATATGAC GAAgTATTCA AGAAATTTGA TCAAATGATG GATTGGCTAG CAGGTGTTTA 15840 

CATTAACTCA TTAAATGTTA TTCACTACAT GCACGATAAA TACAGCTATG AACGTATTGA 15900 

AATGGCATTA CATGATACAG AAATTGTACG TACAATGGCA ACAGGTATCG CTGGTTTATC 15960 

AGTAGCAGCT GACTCATTAT CTGCAATTAA ATATGCACAA GTTAAACCAA TTCGTAACGA 16020 

AGAAGGTCTT GTAGTAGACT TTGAAATCGA AGGCGACTTC CCTAAATACG GTAACAATGA 16 080 

CGACCGTGTA GATGATATTG CAGTTGATTT AGTAGAACGC TTCATGACTA AATTACGTAG 1614 0 

25 TCATAAAACA TATCGTGATT CAGAACATAC AATGAGTGTA TTAACAATTA CTTCAAACGT 16200 

TGTATACGGT AAGAAAACTG GTAACACACC AGACGGACGT AAAG CTGGCG AACCATTTGC 16260 

TCCAGGTGCA AACCCAATGC ATGGCCGTGA CCAAAAAGGT GCATTATCTT CATTAAGTTC 16320 

TGTAGCTAAG ATCCCTTACG ATTGCTGTAA AGATGGTATT TCAAATACAT TCAGTATCGT 16380 

ACCAAAATCA TTAGGTAAAG AACCAGAAGA TCAAAACCGT AACTTAACTA GTATGTTAGA 16440 

TGGTTACGCA ATGCAATGTG GTCACCACTT AAATATTAAC GTATTTAACC GTGAAACATT 165 0 0 

AATAGATGCA ATGGAACATC CAGAAGAATA TCCACAGTTA ACAATCCGTG TATCTGGTTA 16 560 

CGCTSTTAAC TTCATTAAAT TAACACGTGA ACAACAATTA GATGTAATTT CTCGTACATT 166 20 

CCATGAAAGT ATGTAACAAA ATTTAAGGTG GGAGCACTAT GCTTAAGGGA CACTTACATT 16 68 0 

CTGTCGAAAG TTTAGGTACT GTCGATGGAC CGGGATTAAG ATATATATTA TTTACACAAG 16 740 

GATGCTTACT TAGATGCTTG TATTGCCACA ATCCAGATAC TTGGAAAATT AGTGAGCCAT 16 800 

45 CAAGAGAAGT CACAGTTGAT GAAATGGTGA ATGAAATATT ACCATACAAA CCATACTTTG 16B60 

ATGCATCGGG TGGCGGTGTA ACAGTCAGTG GTGGCGAACC ATTGTTACAA ATGCCATTCT 16 920 

TAGAAAAATT ATTTGCAGAA TTAAAAGAAA ATGGTGTGCA CACTTGCTTA GACACATCGG 16 980 

50 CTGGATGTGC TAATGATACA AAAGCATTTC AAAGGCATTT TGAAGAATTA CAAAAACATA 17 04 0 

CAGACTTGAT ATTATTAOAT A T A AAA OA T A TTGATAAT^fl '-BAA^AT.^"" i\ - — 
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CCACGCTCCG AAAAATCTTC GTTATGCAAG TTTGAAAGCA GTACTTGAGT AGATCCGTGT 


13620 


WWII TV •» MMIMf !/""» T» m 

TTAATTTCAA 


TTTTGACATG 


CTCTTGTTTT 


rp/~» » TA STTPST 

TCAAA1 1(-A1 


TT 71 A 71 71 TTfifl 


nv^vjnn x wv\vj 


13680 


TTTGATTGAT 


t\ c^r* r* tv r* tv tv t> tv 
A LUOACjAA IA 


TV TV ^n^O rnrrt TV TV T 1 

AACTGTTAAT 


ACA1 111 1A1 


r* nr! 7a tt p 7i n 71 


\J 1 VJAvvv-VJ X 


13740 


TTAGCGCATG 


f »«»i/-* -i T' TV TV TV TV TV 

CTGaTAAAAA 


TV TV I*/"* TV O IV TV TV T 

AATGAGAAAT 


TV TV TTV fT"7V TV P 7\ 

AATACjCAAIjA 


T A T 71 71 A TTTT 


TH ATTT f ATf^ 

1 VJrt 111 V^^rt X VJ 


13800 


ATATCCCAT C 


TV TV T""P f^l ' TV TP T 

AATTCTA LXjL 


ATATTTTAAT 


T\ TV A T 7A TV 'TT^T 

ALAAIAA1 1 1 


TAflPAATAAA 


1 v3nv.uv«Al/vt 


13860 


GTAATGTTAA 


TV fTT TV IIHIHII TV /"» TV TV 

ATATTTAGAA 


ATGTTTATAG 


A 1 bAL 1 1 vjr 1 1 


A tv r< tv n^ r PTY3P 
AALjA^vj 1 1V7C 


A A ATHTTPiTPr 


13920 


ATAGCACAAA 


ATTTTTGTTT 


/""T^PTA TV P TV P/* 1 TV 


1 1 I AL-L-UAVivjr 


v, iVj l*vu\r\ 1 \— 


/WV^ lull 1 


13980 


ATTTTATTTG 


m tv /"^ «»TVimfTi TV rp 

TAGCTGTTAT 


TV rp TV TV TV TV TV 

AT AAAAAT CG 


VtCAAUlAIAI 1 


OTA 71 Pn/ITTPTl 


AAA^TflAATT 


14040 


TTTACGTCAA 


rpj tv TV TV TV /"^fi TV fTV|< 

TAAAAGTATT 


TAATCLAtj I \~ 


TPTTP TA 't' 1 TV T*TA 

1*_1 1LA1A1A 


•PTA TA a TA ^ r P7V TV TV 




14100 


TGTTGATTTA 


ACGCTTATCA 


TV TV TV TV ' !'« IWT* 

ALAA1CA1 1 1 


1 1 IA1AAAUA 


h ATATATAr*T 
AA 1 A 1 A 1 1 


rr*TAAATTAA 


14160 


C1T1TAAAGC 


AATGAAAATA 


/—I rfi/— i TV TV /"^ TV ' 1 WT'TV 

GTGAACATTA 


TAACTGTTGT 


Vj 1 AAvJAVxAAl 


ppf. ATTAP.r'A 


14220 


TATTACTGTT 


■« /"i* iv % rmn tv 

ACACAAATTA 


GTACAGTTTC 


TATGTTTTGA 


PTA TV P TA M * 1 * 1 YTI 

LAI AL-Al 1 l\j 


A Til A A A ATTP. 
A 1 orttthn 1 x o 


14280 


TACATAATTT 


ATGTGAAAAA 


W TV f^r^TV ^ TV TA TV 

AATCACAACA 


AACAIXjL 1 Ac 


TA TA TP 7A <T A TV 
AA 1 VjAC 1 A 1 vj 


A A A A rTZTT A A 


14340 


CATAGCATTT 


CAAATTCACA 


ACATTATACA 


GATGGAGGC-G 


1 1 1ACj1A1\jI 


TAPTAATir'AAA 
1AoAAAL«AAA 


14400 


TaAAAATCAT 


GCAACAGCTT 


nop* TV TA rpTT* 

GGCAAGGATT 


T»TV TV TV T\ TV TPLP' TV 

T AAAAAT Ljvj A 


tv p tv TrV 7a 71 H TA 
Avj A 1 UVjAALH 


C AO ArWTACA 


14460 


TGTAAGAGAG 


TTTATCCAAT 


Tl TV Tt TV ^"W T 1 TV /"^TV 

TAAACTACAC 


TCTTTATGAA 


{jLj 1 AA I bA 1 1 


1 111 1 nuL 


14520 


AGGACCAACA 


GAAGCAACTT 


CTAAACTTTG 


GGAACAAGTA 


ATGCAGTTAT 


pna tv rpa AfS A 


14580 


ACGTGAACGT 


GGCGGCATGT 


GGGATATGGA 


CACGAAAGTA 


GCTTCAACAA 


X V^/lv</\ 1 v# X Vu>^k 


14640 


TGATGCTGGT 


TATTTAGACA 


AAGATTTAGA 


AACAATTGTA 


GGTGTACAAA 


V_ X vinnnnw v_ v_ 


14700 


ATTCAAACGT 


TCAATGCAAC 


CATTCGGTGG TATTCGTATG GCGAAAgcAG 


CTTGTGAAGr 

v> 1 1VJ1 


14760 


TTACGGTTAC GAATTAGACG AAGAAACTGA 


AAAAAT CTTT 


ACAGATTATC 


GTAAAACACA 


14820 


TAACCAAGGT 


GTATTCGATG 


CATATTCTAG 


AGAAATGTTG 


AACTGCCGTA 


AAG CAGGTGT 


14880 


AATCACTGGT 


TTACCTGATG 


CATACGGACG 


TGGACGTATT 


ATCGGTGACT 


ATCGTCGTGT 


14940 


AGCTTTATAT 


GGTGTAGATT 


TCTTAATGGA 


AGAAAAAATG 


CACGACTTCA 


ACACGATGTC 


15000 


TACAGAAATG 


TCAGAAGATG 


TAATTCGTTT 


ACGTGaAGAA 


TTATCAGAAC 


AATATr'f^Tnr 

r\r\ 1 a 1 .1 vjt Vw 


15060 


ATTAAAAGAA 


TTAAAAGAAC 


TTGGACAAAA 


ATATGGTTTC 


GATTTAAGCC 


GTCCAGCAGA 


15120 


AAACTTCAAA 


GAAGCAGTTC 


AATGGTTATA 


CTTAGCATAC 


CTTGCTGCAA 


TTAAAGAACA 


15180 


AAACGGTGCA 


GCAATGAGTT 


TAGGTCGTAC 


ATCAACATTC 


TTAGATATCT 


ATGCTGAACG 


15240 


TGACCTTAAA 


GCAGGCGTTA 


TTACTGAAAG 


CGAAGTTCAA 


GAAATTATTG 


ACCACTTCAT 


15300 



55 



EP0 786 519 A2 



5 



10 



15 



20 



25 



30 



35 



40 



50 



GTAACGTAAC 


ATTTG pp. AT A 


ATTGTTGGAP 
nx lux x vjvjxVL 




VJV.X/v\X X X v_Vj 


VjAGATAACGT 


11820 


AATTAAATAT 


TGTATTGTTT 

*• W X *» X X W XXX 


GCATCGTATT 


GAATAGGAAA 


TY1AGGPTGGA 


Ai XVjVjUCj x 1 C 


11880 


TATTTCCTTT 


AACTGAATAT 


CACGPAAGPG 


r\v.vj x 1 cioin 


TGPTPGATAG 
X Vj v_ x v^ \jr\ x AVj 


AA X vjVjA X vJAvj 


11940 


TTGCTCATTT 


GATTCAAATA 


AAT CGT AAAT 
x v— vj i x 


ATAATTATTA 
f*x/wx xnx Xrt 


ATTTPTTrTA 

r\ X X 1L1M 


L? X 1 UAV_ X Vj 1 1 


12000 


Grrrrri'AAA 


GGCGTATATG 


TACPTAGATG 


APGA'rrri w i'G 

rl\vVJ/\ X X X X X VJ 


GPATAGTAAA 


TTTTTTPa AT 
X X X X 1 lUftAl 


1 A C f\ 

1 2. Ub U 


AATCGTTTCG 


ATATdTTTG 


TTTGT PG TTT 

XXX X V— XXX 


AGPCATATTA 


TPTGPGPTIV A 

X v_ X vjv_ Vjk_ X f\J\ 


TG A A & PP AAA 
X UaaaLLaAa 




TATTACTAGT 


AAAACAAGAA 


CT APGG P CAT 

Vw x nuuvjuun X 


AAPAATTAAP 


AAPGTGATRP 


P ATPTTP A AT 
L-A 1L1 1LAA1 




GTTTTCATGT 


A TAT c "rri'AT 

X *V X V_ X X X r\ X 


AAAT AATG AG 


A PG 2i TWxTr* a 
X \j\j x La 


vJv^aXVjtVjX X 1a 


A x 1 1 x Av-AvjA 




TTCATTCATA 


AATCCGAATT 


GTTGTGGTfT 

VJ X X VJ X V_7vJ X V— X 


ATAr*TTTTr*A 

A X nv, X X X X V^rt 


PPTATArtTA IV 


A APPPTPATP 
AAv-Vjvj X L-A X L. 


l^f J U U 


GTTGGCGTAT 


AAAATATTGT 


PATATTGATP 


nlllv.un 1 nnu 1 


PPP AATTPTP 
VjV-VjAM X 1 Vj 1 


vjVjI 1A1LI I 1 


t o "3 c n 


CtTAATTTPA 

V_ L. X r\r\ X X X 


PTTAAAPGTG 


VJVJVJ 1 vj 1 UAWt 




1 laAvjvJAlAl 


ATGTACTATT 


12420 


TTTGAATTTA 


AGCTGATGPG 


TTGAAAATAA 




X X X Vj 111a 


A ATPTTP AT A 
AA1 Vj X 1 LAI A 


i t a □ n 


ATTATTGGTT 

X X J-fc X X VJnJ X X 


ATAAAPTGAT 


X X vjVj x OV^non 


TAATTPATi IV 


lAAAUlbl IVj 


PPP P PTPTTP 
LLtVjLiL. 1 vj 1 iVj 


lzb4U 


GkGTATTAAT 


T*TT A A T A A TT 
x x xnnxnnx x 


P A rn 1 1 " 1 " 1 " 1 'PT 
V— rt.v».Vj X X X X X 


*Vvjv_vjvj l WiCii 


TP A TP A. TP A T 
1 LA 1 Vjii X Lt A 1 


I IvjyXAAAlL 


xzbUU 


GAGCTCTTGA 


AAPGAATTAT 


TATPPTGTGT 
lni vj v_ X Vj X Vj X 


rVrt X/xrVrVi vj x V. 


X VJ>iA 1L1 vjV_ 1 


TTTPAPTATP 
111 UAL? 1 A X Vj 


X^C D o U 


ATGTAAAGAT 


GACTGACTTT 


PATPAAPATp. 


TTG ATG A ATP 

1 X VJrt 1 VjMM 1 V* 


GTAPPATPPT 
vj X X vjv, 1 


P A ATPP A A AT 
LAnl LLAAA1 




ATAGATGGCA 


TAGAAGPTTA 


PTAGTGPAAT 
v_ x r\\j X V. Lnn X 


AATAATCAfT 
/in. X An X Vjrtv.. x 


a a a a & t l\ ptp 
aaAaa 1 aL 1 Vj 


PA A A A AT APT 
LrAAAAAX AVj 1 


1 z / o V 


AGACnCAAAT 


AACGATCGTP. 

\ x v_vj X v_ 


TTAATTGATG 

X X X X \3r\ X w 


TPTATAAGGT 
X V_ X rt innOVJ X 


TTflT A TP PPn 


TPATTP A ATP 
X LA 1 1 vj AA X L 


1 O Q A Pi 


ATCTCCAAAA 


ATTTATGATG 

*V XXX X X VJ 


TGGAATATrf 


GGTAATTTAG 

Vj\j X r\n XXX n\J 


IV TTTPGP.T 2L T 
n X X X V_i^jVj irtl 


TA A AGGT ATP 


1Z3UU 


TTCTTAAGAT 


TTTCGATAGA 


V- X Ort X V-.VJV-. X X 


TGTTrAPTAA 

X vj i X V_rtV«. X J"\J\ 


p R TP PTTTPP 
V-hM. 1LL1 XX LVj 


A ATTPAPTTP 
An X X LtAL XXL* 




GCAT-CGAACT 


CTGCAACTAA 


TPGTt GTTGT 

1 v-ul Lui X vj X 


ATTGAGPGGP 

AL X OAvJVvVJvJV. 


TTT.TTAAATA 
X X Vj X innnlrt 


TTGPA PT A AP 
X XvjLAL 1AAL 


1 1 PI *5 Pi 


TriTTACGCT 


TAGGATGAGG 


VJ X vj X vj v^r\ X x X 


TTA APT A A AG 
X X /vVv_ X AAAvJ 


PAATrPPATP 
V_*/vf\ X X7L.V-M.X V_ 


A A P A TTT A A P 
aaLa! X 1AAL 


1 "3 Pi Q Pi 
1 j U u U 


ATTGTTCCTT 


CAATTGGATA 

\— Ta-T* -4- A n-JXJ4» Jh •«* 


AACGATTGAT 

rtrtwvJrt X X vjrt X 


APAGGATAAP 


PTTTGTTTTT 
VvXXXVJX xxxx 


pp A TP.TGPP.T 
LL/\X vj X VjLVj X 


1 "5 1 A Pj 
1 J 14 U 


GCATCTTGTT 


CGTAGCTTAG 


AfTTGCGTAA 

v^ Vw. 1 o V_ Vj X 


TATTTAPPTT 

Xf\X 1 Irtv-V-X X 


TTGPA APATP 

X X vjv — ttALA X V_ 


TTP A ATP APT 
X X LAA 1 VjAL X 


1 7 0 Pr n 


TTAGACGTCT 


TTGACAGTTG 

x x vj,rv\__nv7 X X vj 


C ATPG r ATGG 

V_«/A X \_>VJ V r\ X \J\J 


TTTTfYl A ATT 

X X X X VJ\J/v\ X X 


GATGP APATP 

\Jr\ X Vj V__rt\_rt. X V- 


APTTAPTPP A 

ML x x M.L 1 LoM 


1 1 9 £ Pi 
1 j z o u 


TGATGCATGP 

X Wj~k A VJ^rt X VJV_. 


T ATAAAT AG P 

X n X t\nf \ X A\vj v_. 


APGGATATGT 

rtv»VJv_^rt X AX vj X 


TGAT1VGPPTG 
X i Mvj^v_ xvj 


T P P TTP TTP T 
ILvjI XvjX XLtX 


A TTTrY 1 A TTT 
A X X X vjvj A 111 


1 ^ J Z vj 


GAGTACGCAA 


TTTTACCTTT 


AAGTATAGGT 


TGTAATAAAT 


CTTGATAACC 


TCGAATCTTA 


13380 


ATATCTCCTT 


GTAAATCTGA 


ATTCACTACT 


ATAACTGTTG 


GCATTAATAG 


AAAACTAGTA 


13440 


A n AT A TTT A"" 
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CGCGTATCTA TTCGGTGACT CAATGGCGAA AGTTGGTTTG GCGGCTATTG CTGATCCAAC 10020 

ACGTAACGGT TTAAACATCT TTGGATATAC ATTAAGTGGA TGGACAGATG TTTTCATCGT 1008 0 

CTTCTATGTT GCATTATTCC TAGGCATGAT TCTATTAGGA ATCGTTGCTT TCTATGAAGA 1014 0 

AAAGAAAATT AGAAGTTTAA AAATTTAATA TAAATCGGAT TAAAAGTATC GCCAATCTAT 102 0 0 

TG CAATAT AG TTGGCAATCC TGCCCCGACG GCATGTGCGT GAAGAGATGA AAGATACTGC 10260 

TTCTACCCTT GCAAATATAT CATCTCTATG TCTCGGGGCA GATCATAATT CCCTGTTATG 103 2 0 

AAGTAT CCTT ATTTGCCCGA CTTAGGGTGA CTCAATGAAT TTACTCCTTA CAATAAAGAC 103 80 

ATATAGCGGT GTCAATATTG TAGGGAGTAT TGTTTTATAT TTAAACTCTC TAAAAAGCGG 1044 0 

ACTGAAAGAA AAGTGAAAAC TTCTCTATCA GTCCGCTTTT TCATAGAACA AAATGGAGGC 10500 

GCCATAATCA TTAGTTATGT GCTAATCTAT TTTGCTTGCT TACAATAATC ACTTGGCGAC 10560 

ATTTGTAAAT ATTTTTTAAA ATGATAGCTA AACATTTTAT ACTCTGAAAA GCCTACTTTG 10 620 

TCTGCAATTT CATAGTGTTT GTAATGTCGA TCTAACAATT GCAGAGATTG TAAAATACGA 10680 

TAG CG ATTT A AATAATCGAC AATTGTAATA CCAACATGAT CTTTAAATGT TCGCATCGCA 10740 

25 TACGATTCAC TAACATCGAT ATGTTGAATT AAATCTGAAA CAGtCACTTT CGTTTGATAA 10800 

GATTGCTTAA TTTGATCCAC AATCTGGTTT ACATAATAAT CAT CG TATTC TACTTTTAAT 10 860 

AG TGGTTGGA AGGCATCATG ACAAGATGCT AAGCTACGGC CGTTCTGTGA TTGTTGCTCT 10 920 

AATAAGGTAC GGACAAGTCT TCCTAAAATA ACTTCTAATT GTGCATGGTC TACTGGTTTT 10 980 

AATAAATAAT CAAGAACATG ATGTTGAATG CCGGCTTTCA TATATTCAAA GTCATCGTAA 11040 

CTCGATAATA TGATGACATT ACAATCTAGA TGCGCAATAT CATTGAGTAA ATCGACGCCA 11100 

TTTTTACGTG GCATACGAAT ATCAGTAATT ACTAATTCTG GCTGATGTTG TTGAATTAGT 11160 

GAT^ATGCTT CAACACCATC TTTAGCAGTG TATATTGTAT TGAAATGATA GTCTCCCCAA 11220 

GGAATGATTT GCTTTAATCC TTCTCGAATA ATTCGTTCAT CATCACAAAT AACTACCTTA 112 80 

AACATCTACA TTCCCCCTTG AAAGTGGTAT TTTATAACAA ATTAACGTAC CTTGATTACG 11340 

CTTTGAAAAA ATATGGAGTC GTGCATGTGA ACCATATTGA ATCATTGCTT TATTGTGTAA 114 00 

45 ATGATTTAAT CCCAAATGCT TAGTATCAAA TACATCATTA TTAAGAGATT GGCGTACATA 114 60 

TTGCAGGCGA GATGACGACA TCCCGATACC ATTGTCGCAA ACTAAAACAT GTAAATTCTG 11520 

ACGTGCCAAT GTCAGGCGTA TAGTAATGTC CAATGACTCA GTATCTCTAC CATGTTTAAT 11580 

AGCATTTTCT ATGAGTGGCT GAAGCATCAT TTTACCAATT GTCTGGTGAC GCGCTTCTTC 11640 

AGAACTTTCA ATATGGAGCT TAATCATGTC ATCAAAACGG aTGTTTTGTA TTGCAACATA 117 0 0 
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TAATACACCG ATTAATTCAG GAATGATGTT TAAGAAGTAA TTTGGGTGTT TTGTAATTTT 822 0 

ATATAATCCA GATTTAATAA TAGGATGGTT AGGTAAAATG AATAATTTTA ATGTCCAAAT 82 8 0 

5 

ACCACCTAAA GTTTTAATAA CCATAAATAA CATGATATAA GCAAAGATTA ATATAACTAA 834 0 

GCCAATACCA TTTGCAAAGC TAAATGTATC TTTATTAATA AATGCCTCTA CACCAGCCAA 84 0 0 

TACATAAATT AAAACGTGTG TTATTGCTAA AAACTTCGAA TTTTTAACGC CATATTCAAC 84 6 0 

10 

TGCACCGTCT GCTTTTAATT GTTTTGAGTG ATTAATAGAT ATCTTTAAGC TGACAAGTCT 852 0 

GATACAGAAA AAGATAAGTA ATATAGATAG AATCATGATG TCCTCCGTCA TTATGTCATA 85 8 0 

15 TGTATAAGCG TTGATTTTGA CAACATAAAG TATTTTATAG ATAAAGCTTG TCAAATACTA 864 0 

TTAACTATTT ATTAATTTTA GTACATAAAT ATGTTTCTAA GTATGTGTTT ATGTTCAGTA 8700 

TTTTGGATAA TTTAATAATT TTAAGGATAT TAAGCGCTTA CACCGACGTG ATATATTTGG 876 0 

20 CTTAACGAAA ATGATTGAGG TGACAGAGAT GAACTTTTTT GATATCCATA AGATTCCGAA B82 0 

CAAAGGCATT CCATTATCGG TACAACGTAA ATTATGGCTT AGAAACTTCA TGCAAGCTTT 888 0 

CTTCGTAGTG TTCTTTGTTT ATATGGCTAT GTATTTAATT CGAAACAACT TTAAGGCGGC 8 94 0 

25 ACAACCGTTT TTAAAAGAGG AAATTGGATT ATCTACATTA GAACTTGGTT ATATCGGATT 90 0 0 

AGCATTTAGT ATCACGTACG GTTTAGGAAA AACATTACTT GGATATTTTG TCGATGGACG 906 0 

TAACACAAAA CGTATTATCT CGTTCTTACT TATCTTATCT GCGATTACAG TTTTAATTAT 912 0 

30 

GGGATTTGTT TTAAGTTACT TTGGTTCTGT AATGGGATTA TTAATTGTAC TTTGGGGACT 9180 

TAACGGGGTG TTCCAATCAG TTGGTGGACC TGCAAGTTAT TCAACGATTT CAAGATGGGC 924 0 

GCCAAGAACG AAACGTGGCC GATACTTAGG ATTCTGGAAT ACATCACATA ATATCGGTGG 93 0 0 

35 

TGCCATAGCA GGTGGTGTTG CACTTTGGGG TGCTAATGTA TTCTTCCATG GAAATGTTAT 93 6 0 

AGGGATGTTC ATTTTCCCAT CGGTGATTGC ATTACTTATT GGTATCGCAA CATTATTTAT 94 2 0 

40 CGGAAAAGAT GATCCGGAAG AATTAGGATG GAATCGTGCT GAAGAAATTT GGGAAGAGCC 94 8 0 

GGTCGATAAA GAAAATATTG ATTCTCAAGG TATGACGAAA TGGGAGATCT TTAAAAAATA 954 0 

TATCCTGGGA AATCCTGTTA TATGGATTCT ATGTGTTTCA AACGTCTTTG TATACATTGT 96 0C 

45 ACGAATCGGT ATTGATAACT GGGCACCGTT ATATGTGTCA GAGCATTTAC ACTTTAGTAA 96 6 0 

AGGCGATGCA GTTAATACGA TATTCTACTT TGAAATTGGT GCATTAGTTG CAAGTTTATT 97 2 0 

ATGGGGCTAC GTATCAGACT TATTAAAAGG TCGTCGTGCA ATTGTAGCTA TTGGCTGTAT 97 8 C 

50 

GTTTATGATT ACATTTGTTG TCTTATTCTA CACAAATGCT ACAAGTGTCA TGATGGTTAA 984 C 
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GCAAGCATGC AGTTTGAATG TTCGTGGTCT GCAAATATCA AAGAAGATAA GGTTCACGTT 642 0 

AGTTTATCAG GAGAAGATGG CGGTATCAAT TTATTTCCAT TTGAAATATA TGAGCCCCGC 64 80 

TTTGGAACTA TTTTTGAAAG CAAAGCTAAT GTTGAGCATA ACGAAGACAT TGCTGGTGAG 654 0 

AGACAGGCGC GTAACTTTGT CAATGCGTGT TTAGGGATAG AAGAGATTGT GGTGAAACCG 66 00 

GAAGAAGCAC GCAATGTAAA TGCCCTTATA GAAGCGATTT ATCGTAGCGA TCTTGATAAC 6660 

AAGAGCATAC AACTTTAATG ATTATCATAT ATGATACAAA ATTCTCAATA TAAAAAGAAG 6720 

GAGTGCTTTT CAATGAAAAT AGGTGTATTT TCAGTATTAT TTTACGATAA AAATTTTGAA 67 80 

GATATGTTAG ATTATGTCTC AGAATCTGGA TTGOATATGA TTGAAGTTGG AACAGGTGGT 684 0 

AACCCAGGAG ATAAATTTTG TAAGTTAGAT GAGTTGTTAG AAAATGAAGA CAAGCGCCAA 6900 

GCATTTATGA AGTCAATCAC AGACAGAGGC TTACAAATAA GTGGTTTCAG TTGTCATAAC 6960 

20 AATCCAATTT CTCCAGATCC GATAGAAGCG AAAGAAGCCG ATGAAACGTT ACGTAAAACA 7020 

ATCCGTTTAG CAAATCTATT AGACGTGCCA GTTGTTAATA CATTTTCTGG CATTGCAGGA 7080 

TCAGATGATA CCGCTAAAAA GCCTAATTGG CCTGTTACAC CTTGGCCAAC AGCCTACTCT 714 0 

25 GAAATTTATG ATTATCAGTG GAATGAAAAG TTGATACCAT ATTGGCAAGA TTTAGCTGAG 7200 

TTTGCAAAAG AGCAAGATGT AAAAATTGCC ATAGAGTTGC ATGCAGGATT TTTAGTGCAT 726 0 

ACACCATATA CAATGTTGAA GTTACGTGAG GCTACAAATG AATATATCGG TGCTAACTTA 73 2 0 

GATCCTAGTC ATCTATGGTG GCAAGGTATT GACCCAATTG CTGCGATTCG CATATTAGGC 73 8 0 

CAAGCAAATG CAATTCATCA CTTCCATGCT AAAGATACGT ATATTAATCA AGAAAATGTA 744 0 

AATATGTATG GTCTAACTGA TATGCAACCA TATGGTAACG TTGCGACAAG AGCATGGACA 7500 

TTCCGTACAG TTGGTTATGG ACATAGTCCA TATGTATGGG CAGATATCAT AAGTCAACTT 7560 

ATTATTAATG GATATGATTA TGTATTAAGT ATTGAACATG AAGATCCTAT TATGTCAGTA 7620 

GAAGAAGGTT TCCAAAAAGC TTGTCAAACT TTGAAATCTG TTAATATTTA CGACAAGCCA 7680 

GCAGACATGT GGTGGGCATA ATACGAACTC GAGGTTAGTC TGAAGTTTGT CTGAAGTAAG 7740 

ACTGGTGGCA GTGTTGAATA AATGCATATG TCGCCAAGCC ATTGCCAAAA ATTTCACACC 7800 

45 TTAAATCAAG TCATTGTTTG TAAAGAAGGT GTACTTTATA TAAGTATATA GCGATGGTCA 7 860 

TACCCATTCA CAGTAACAAT CCTCACCATT GAAAAGAGTA TATAACCTTT TCAATAGTGA 7 920 

GGTATATGAT AATAAAAAAA GCCTGTTGTC ACAATGGTCA TAGACACGAC ATACTTTAAA 79 80 

50 

GGTTTCTGAA TATAATATTT CAGAATGCAC TTTAAAGATG GACGTCGATG TAGACTAAAG 804 0 

TGATGACAGG CTTTCATCTT TTTAAATATT CATTAATTTC TCTTCTTGTT TAATACGTAC 8100 
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GGGTGTGGTG GTATTGCGAA TGGCAAGCAC ATGCCAAGTT TACAAAAAGT TGAAAATGTT 4 62 0 

GAAATGATCG CATTTTGTGA CGTAGACATT TCGAAAGCAG CGAGTGCGGC AGAAGCATAC 4 6 80 

GGAACTGACA ATGCAAAGGT TTATGATGAT TACAAAGCAT TGTTAAAAGA TGACACGATT 4 74 0 

GATGTTATCC ATGTTTGTAC GCCAAATGAC TCGCATTGTG AAATTACTGT AGCAGGGTTG 4 8 00 

CATGCTGGTA AACATGTGAT GTGTGAAAAA CCAATGGCTA AAACGACAGC AGAAGCTCAA 4 86 0 

AAAATGATAG ATACAGCTAA ATCAACAGGT AAAAAATTAA CAATAGGTTA TCAAAATCGT 4 92 0 

TTCCGAGCAG ATAGTCAATT TTTACATCAA GCAGCGCAAC GTGGCGACTT AGGAGACATT 4 9 80 

TACTTCGGAA AGGCACATGC CATTCGTCGT CGAGCAGTAC CAACATGGGG TGTCTTTCTA 504 0 

GACGAAGAAG CTCAAGGTGG AGGACCATTA ATCGATATCG GTACACACGC TTTAGATTTA 5100 

ACGTTATGGA TGATGGATAA TTATGAACCA GAATCAGTGA TGGGTTCAAC ATT C CAT AAA 5160 

20 TTAAATAAAC AGCATCATGC GG CAAACGCT TGGGGTTCAT GGAATCCAGA TGAATTTACA 522 0 

GTTGAAGATT CTGCGTTTGG ATTTATTAAA ATGAAGAATG GAGCGACGAT CATTTTAGAA 52 8 0 

TCCGCTTGGG CGATTAATTC TTTAGAAGTG GATGAGGCAA AATGTTCATT ATCAGGAACT 5340 

AAAGCAGGTG CTGATATGAA AGATGGTCTA CGTATTCATG GTGAAGACAT GGGTACACTT 54 00 

TATACCAAAC ACGTTGAATT GGAAAACAAA GGCGTCGACT TTTATGAAGG TAATGAAGTG 54 6 0 

GATGAAGCTG AAGAAGAAGC AAAAGCTTGG ATTGATGCAG TTGTAAATGA TACTGAACCA 552 0 

GTTGTGAAAC CGGAACAAGC AATGGTAGTT ACAAAAATTC TTGAAGCGAT TTATCAGTCT 5 580 

GCAAAATCAG GCAAAGCAAT TTACTTTGAA TAACATCATA CGGTAAGGAG GCACATCATG 5 64 0 

ACAAAATTAA AAGTTGGTGT GATAGGTGTT GGTGGTATTG CACAAGACCG TCATATTCCA 57 00 

GCATTGCTGA AACTCAAAGA CACAGTCTCA TTAGTTGCAG TACAAGATAT TAATACAGTG 5760 

CAGATGATTG ATGTTGCGAA gCGCTTTAAT ATACCTCATG CAGTTGAGAC ACCTAGCGAG 5 82 0 

CTGTTTAAAC TTGTTGATGC GGTGGTCATT TGTACACCTA ATAAATTCCA TGCTGATCTT 5 8 80 

TCTATAGAAG CATTGAACCA TGGTGTCCAT GTATTGTGTG AAAAGCCAAT GGCGATGACG 5 94 0 

ACGGAAGAGT GTGATCGCAT GATTGAAGCG GCTAATAAAA ATCACAAATT ATTAACTGTC 6 00 0 

45 GCATATCATT ATCGTCACAC AGATGTGGCA ATTACTGCTA AAAAAGCAAT TGAATCAGGT 6 06C 

GTGGTTGGTA AACCTTTAGT AGCACGTGTA CAAGCGATGC GTAGGCGTAA AGTGCCTGGC 6120 

TGGGGTGTTT TTACCAATAA AGCGTTGCAA GGTGGCGGTA GTTTAATCGA TTATGGTTGC 6180 

CACTTGTTAG ACTTATCTTT GTGGCTACTA GGTAAAGATA TGGTGCCGCA TGAAGTGCTA 6 24 0 
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TGTAGTTGTA TTTCCATTAA TAyyTATGTT TGGAGTAGCA TTTACAAATT ACAATTTATA 2 82 0 

CAACGCGCCT CCGAGACACA CATTAGAATG GGTTGGTTTA GATAACTTTA AAACGTTATT 2 880 

CACAATTGGC GTTTGGCGTA AAACATTTTT CAGTGTTATT ACTTGGACAT TAGTATGGAC 294 0 

GCTTGTTGCA ACGACACTTC AAATTGCATT AGGGCTGTTT TTGGCAATTA TTGTAAATCA 300 0 

CCCTGTCGTC AAAGGTAAGA AATTTATCCG TACTGTGTTA ATCCTACCTT GGGCTGTACC 3060 

ATCATTTGTG ACAATTTTAA TATTTGTAGC GTTATTTAAT GATGAATTTG GTGCGATAAA 312 0 

TAATGATATT TTGCAACCTT TATTAGGTGT AGCACCAGCA TGGTTAAGTG ATCCGTTTTG 3180 

GG CAAAAGTG GCATTAATCG GCATTCAAGT ATGGCTTGGA TTCCCATTTG TCTTTGCACT 324 0 

GTTCACTGGA GTACTGCAAA GTATTTCATC AGATTGGTAC GAAGCAGCAG ATATGGATGG 3300 

TGCGTCTAGT TGGCAAAAGT TTAGAAACAT CACATTCCCG CATGTCATTT ACGCCACAGC 3 360 

GCCATTGTTA ATTATGCAAT ATGCAGGTAA TTTCAATAAT TTTAATCTTA TTTATCTATT 3420 

TAATAAAGGC GGTCCACCAG TGTCAGGGCA GAATGCTGGT AGTACAGATA TCTTGATATC 34 8 0 

TTGGGTGTAT AATCTGACAT TTGAGTTTAA CAACTTCAAC ATGGGTGCAG TTGTGTCATT 354 0 

25 AATTATTGGA TTTATTGTTG CTATTGTCGC ATTTATTCAA TTCAGACGTA CAAGTACGTT 3600 

TAAAGATGAG GGAGGTTTAT AAGATGACAA AGAAGAAAAA CATATTAAAA GCAATCGGTA 36 60 

TTTACAGTTT TATAGCGATG ATGTTTGTCA TCATTTTATA TCCACTACTG TGGACATTTG 3 720 

GCATTTCCCT TAATCCAGGT ACGAACTTGT ATGGTGCCAA AATGATACCA GACAATGCAA 378 0 

CATTTAAAAA TTATGCATTC TTACTATTCG ATGACAGTAG TCAATACCTG ACTTGGTATA 384 0 

AAAATACGCT TATCGTAGCA TCTGCAAATG CACTGTTTAG TGTGATATTT GTCACGTTAA 3 900 

CAGCATATGC TTTTTCTAGA TATCGCTTTG TTGGTCGTAA ATACGGGCTG ATTACATTTT 3960 

TGATTTTACA AATGTTCCCT GTATTAATGG CAATGGTCGC AATCTATATT TTGCTAAATA 4 02 0 

CAATTGGATT ATTAGATTCT TTATTTGGAC TAACACTGGT ATATATTGGT GGAT CAATAC 4 080 

CGATGAATGC CTTTTTAGTG AAAGGTTACT TCGATACGAT TCCAAAAGAA CTTGATGAAT 414 0 

CTGCCAAAAT TGATGGTGCA GGGCATATGC GTATTTTCTT ACAAATTATG CTTCCATTAG 4200 

45 CTAAGCCGAT TTTAGCAGTT GTTGCTTTGT TCAATTTTAT GGGGCCATTT ATGGACTTTA 426 0 

TATTACCTAA AATACTATTA AGAAGTCCTG AAAAATTCAC ATTAG CAGTT GGATTGTTCA 4 32 0 

ACTTTATTAA TGATAAGTAT GCAAATAATT TCACAGTGTT TGCAGCAGGG GCAATTATGA 4 38 0 

50 

TTGCAGTACC TATAGCAATC GTATTCTTGT TCTTGCAACG CTATTTAGTA TCAGGTTTAA 44 4 0 

CAACAGGTGC GACAAAAGGT TAGTTTGAAA TTAGGAGTGG GGCAGAATTG ATAAAGAACC 450 0 
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AAATTAGATT CAAGAACTCA AGTGATGGCG AACGACAAGA TTACACTAGC ATTTGATATG 1020 

AATAAGTGTC ACTTTTTTGA TGAAAAAACA GGAAATCGTA TCGTCTAAGG GGG AGTATTC 10 80 

ATGTCTAAAA TTTTAAAATG TATCACGTTA GCCGTGGTAA TGTTATTAAT CGTAACTGCA 114 0 

TGTGGCCCTA ATCGTTCGAA AGAAGATATT GATAAAGCAT TGAATAAAGA TAATTCTAAA 12 0 0 

GACAAGCCTA ACCAACTTAC GATGTGGGTG GATGGCGACA AGCAAATGGC GTTTTATAAA 12 60 

AAAATTACGG ATCAATATAC TAAAAAAACT GGCATCAAAG TAAAGCTTGT AAATATTGGT 13 2 0 

CAAAATGATC AACTAGAAAA TATTTCGCTA GACGCTCCTG CAGGAAAAGG TCCAGATATC 13 8 0 

TTTTTCTTAG CACATGATAA TACTGGAAGT GCCTATCTAC AAGGCTTAGC TGCTGAAATC 144 0 

AAATTATCAA AAGATGAGTT GAAAGGTTTC AATArGCAAG CACTTAAAGC GATGAATTAT 15 0 0 

GACAATAAGC AACTAGCATT GCCAGCTATC GTTGAAACAA CCGCACTTTT TTATAATAAA 156 0 

20 AAATTAGTGA AAAATGCACC GCAAACGTTA GAAGAAGTTG AAGCTAATGC TGCCAAACTA 162 0 

ACTGATAGTA AAAAGAAACA ATACGGTATG TTATTTGATG CTAAAAATTT CTATTTTAAT 168 0 

TATCCGTTTT TATTCGGCAA TGATGATTAT ATTTTCAAGA AAAATGGCAG TGAATATGAT 174 0 

ATTCATCAGC TAGGACTAAA TTCAAAACAT GTCGTCAAGA ATGCTGAACG ATTACAAAAA 1800 

TGGTACGACA AAGGGTATCT TCCTAAGGCA GCAACACATG ATGTCATGAT TGGTCTTTTT 186 0 

AAAGAAGGAA AAGTAGGACA ATTTGTCACT GGACCGTGGA ACATTAATGA ATATCAAGAA 192 0 

ACGTTTGGTA AAGATTTAGG AGTAACAACA TTACCTACAG ATGGTGGCAA ACCTATGAAA 198 0 

CCATTTCTAG GTGTACGTGG TTGGTATTTA TCTGAATATA GTAAACATAA GTATTGGGCT 204 0 

AAAGATTTAA TGCTGTATAT CACTAGTAAA GATACATTAC AAAAATATAC AGATGAAATG 2100 

AGCGAAATTA CTGGACGTGT TGACGTGAAA TCATCTAATC CAAATTTAAA AGTGTTTGAA 216 0 

AAGCAAGCAC GTCATGCTGA ACCGATGCCT AATATTCCTG AAATGCGACA AGTTTGGGAA 222 0 

40 CCGATGGGCA ATGCAAGCAT ATTTATTTCA AATGGTAAGA ATCCTAAACA AGCGTTAGAT 2280 

GAGGCGACGA ATGATATAAC GCAAAATATT AAGATTCTTC ATCCATCACA AAATGATAAG 2 34 0 

AAAGGAGATT AGTTATGACG AAACGTAACC CTAAATTAGC GGCATTATTA TCTGTTATAC 24 0 0 

45 CTGGTTTGGG ACAGTTTTAT AATAAAAGAC CCATTAAAGG GACGATATTT TT7ATCTTTT 24 6 0 

TCATCAGTTT TATTTCTGTT TTTTATAGCT TTTTAAATAT TGGTTTTTGG GGATTGTTCA 252 0 

CATTAGGGAC AGTACCTAAG TTAGACGATT CTCGTGTCTT ACTTGCACAA GGTATTATTT 2 580 

CTATCTTACT CGTTGCTTTC GCAATCATGC TATATATCAT TAATATTTTA GATGCATATC 2 64 0 
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AATTTTATGT T ^ CAAAAGT AAACAATCAA AAGATGTTAG AAGATTGCTT CTATATAAGA 660 

AAGAAAGTGT TTGTAGAAGA ACAAGGCGTC CCTGAGGAAA GTGAAATTGA TGAATATGAA 720 

TCTGAATCTA TTCACCTCAT TGGATATGAT AATGGACAGC CAGTTGCCAC TGCTCGAATA 780 

CGCCCTATTA ATGAAACAAC TGTCAAAATA GAACGAGTAG CTGTGATGAA ATCACATCGT 840 

GGACAAGGAA TGGGTAGAAT GCTTATGCAA GCTGTAGAAT CATTAGCTAA AGATGAAGGT 900 

TTTTACGTAG CTACTATGAA TGCCCAATGT CATGCTATCC CATTTTATGA AAGTTTAAAC 960 

TTTAAAATGA GAGGTAATAT ATTTCTTGAG GAAGGCATCG AG CAT ATTGA AATGACAAAA 1020 

AAGTTAACCT CGCTTAATTA AAAAAAGTTG TATCTATTTT AGAAACA 1067 
(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18613 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



25 ( X i) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 

AAGACGtAtG ATAACAACAA TACgTGTAGT GAAAGATTTT AATCTACATA TTACTGACAA 60 

AGAATTCATT GTATTTGTTG GACCATCGGG ATGTGGTAAA TCAACAACAT TACGAATGGT 120 

TGCTGGACTA GAGTCTATCA CATCTGGAGA TTTTTATATT GATGGGGAAC GCATGAACGA 1B0 

TGTTGAACCA AAGAATAGAG ATATTGCGAT GGTATTTCAA AACTATG CAT TATATCCACA 240 

TATGACTGTT TTTGAAAATA TGGCATTTGG GCTAAAGCTA CGTAAAGTAA ATAAAAAAGA 300 

GATTGAACAA AAAGTTAATG AAGCAGCTGA AATATTAGGA TTAACTGAGT ATCTTGGTCG 360 

TAAACCAAAA GCGTTATCTG GCGGACAGCG TCAACGTGTT GCTTTGGGCA GAGCTATTGT 42 0 

TAGGGATGCG AAAGTCTTTT TAATGGATGA ACCATTATCG AATCTTGATG CGAAyTtCGA 480 

GTACAAATGC GCACAGAAAT ATTGAAATTA CATAAGCGAC TTAATACTAC GACAATTTAT 54 0 

GTTACACATG ATCAAACTGA AGCATTGACG ATGGCTAGTC GAATTGTTGT TTTGAAAGAT 6 00 

GGCGACATTA TGCAAGTCGG CACACCTAGA GAAATATATG ATGCCCCTAA TTGCATATTT 660 

GTGGCGCAAT TTATCGGCTC ACCAGCAATG AATATGTTGA ATGCTACAGT TGAAATGGAC 720 

GGATTGAAGG TAGGAACACA CCATTTTAAA TTACATAATA AAAAATTTGA AAAGTTAAAA 7 80 

50 GCTGCTGGCT ACTTAGACAA GGAAATTATT TTAGGTATTC GAGCTGAAGA CATTCATGAA 84 0 

GAACCAATAT TTATTCAAAC TTCTCCAGAG ACACAATTTG AATCTGAAGT AGTTGTATCC 900 
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TAGTGCTTGA TCCTGGTCAT GGAGGTAGTG ACCAGGGTGC TTCAAGCAAT ACTAAATATA 4 8 00 

AAAGTTTAGA AAAAGATTAT ACGTTGAAAA CAGCAAAAGA ATTGCAGCGT ACTTTAGAAA 4 86 0 

AAGAAGGCGC AACTGTTAAG ATGACAAGAA CAGACGATAC ATATGTTTCA CTAGAAAATC 4 92 0 

GTGATATCAA AGGCGATGCC TATTTGAGTA TACATAATGA TGCGTTAGAA TCATCTAATG 4 98 0 

CAAATGGAAT GACaGTTTAT TGGTATCATG ATAATCAAAG AGCTTTAGCA GATACGTTAG 504 0 

ACGCTACGAT TCAGAAGAAA GGTCTACTTT CTAATCGCGG TTCAAGACAA GAAAATT AT C 510 0 

AAGTGTTAAG ACAAACAAAA GTTCCTGCTG TTTTATTAGA ATTAGGTTAT ATTAGTAACC 5160 

CAACTGATGA AACGATGATT AAAGATCAAT TACATAGACA AATTTTAGAA CAAGCAATTG 5220 

TTGATGGCCT TAAAATTTAT TTTTCTGCGT AGGGCTTGCA AAAATATGTG AAAGTAGTTA 52 80 

TCATTGATAT TGAATTTTAT AACTAAAACC GTTAGTATTC TTGAAATGGT AAATGAAATA 5340 

20 GGTAGCAATC TAACTAAGAT TGTGTAGGAA TATAATCCAT AGACTGAAAG ATTATGCTGA 54 00 

GTAGTTTATA TACATTGAAC ACAAGAAGAG GTGCTTTATG AAAAGTAAAG CCGTTAAACG 54 60 

TACGTTaAAC GTTTTGAGTG GGTTTATTAA ATGCACGCTT ATAAAAAGTA ATGATGATTA 55 2 0 

25 CAATTAGGCA TGTTTTTTAA ACCA 5544 

(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH : 1067 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 

AAAAGATTGC AAATATAAAT GGCATGTTTA ATATGTTAGA ACAACAAATC ATTCATAGCC 6 0 

AAGATATGGC TCATTTTAGA AGTGAATTTT TTTACGTCAA TCATGaGCAT CGAGAAAACT 120 

ATGAAgCACT CCTAATTTAT TACAAAAATA GTATCGACAA TCCTATTGTA GATGGTGCAT 180 

GTTATATTTT AGCCCTACCT GAAATTTTCA ATAGTGTTGA TGTTTTCGAA TCAGAGTTAC 24 0 

CATTTTCATG GGTATATGAT GAAAATGGCA TTACCGAAAC AATGAAATCA CTTAGCATTC 3 00 

CATTACAATA TTTAGTTGCA GCAGCTTTAG AAGTAACTGA TGTGAATATA TTTAAGCCTT 3 60 

CAGGATTTAC AATGGGAATG AATAATTGGA ATATTGCTCA AATGCGAATC TTTTGGCAAT 4 20 

ATACAGCAAT TATTAGAAAA GAAGCACTAT AACATTAATA ATTAATTAGC TATAAAGATG 4 80 
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TGAGTTGCCA TATGGTGCTG TGCCGATTGA TTTTGCTTAT GCGATTCACA GTGAAGTAGG 30 00 

TAATAAGATG ATTGGTGCCA AGGTGAATGG CAAAATTGTA CCAATTGACT ATATTTTACA 3060 

AACAGGCGAT ATTGTTGAAA TACGTACTAG TAAACATTCA TATGGACCAA GTCGTGATTG 3120 

GTTGAAAATT GTTAAATCGT CTAGTGCCAA AGGTAAAATT AAAAGTTTCT TCAAAAAACA 3180 

AGATCGTTCA TCTAATATTG AAAAAGGCCG AATGATGGTT GAAGCTGAAA TAAAAGAGCA 324 0 

AGGATTTAGA GTCGAAGATA TTTTGACAGA GAAAAATATT CAGGTTGTTA ATGAAAAATA 33 00 

TAACTTTGCA AATGAAGATG ATTTATTCGC AGCTGTAGGA TTTGGCGGCG TGACATCCTT 3360 

ACAGATTGTT AATAAATTAA CTGAAAGACA ACGTATTTTA GATAAACAAC GTGCTTTAAA 3420 

TGAAGCACAA GAAGTTACGA AATCATTGC C TATTAAAGAC AACATCATTA CTGATAGTGG 34 8 0 

TGT CTATGTA GAAGGTTTAG AAAATGTACT TATCAAGTTG TCAAAATGTT GTAATCCTAT 3 54 0 

ACCaGGTGAT GATATTGTAG GTTATATCAC CAAAGGTCAC GGTATTAAAG TACATCGCAC 360 0 

TGATTGCCCA AATATTAAGA ACGAAACTGA ACGACTAATT AATGTTGAAT GGGTAAAATC 3660 

AAAAGACGCA ACTCAAAAAT ATCAGGTTGA TTTAGAGGTA AtGCGTATGA CCGAAATGGC 3720 

25 TTGTTGAATG AAGTACTACA AGCTGTTAGC TCGACAGCCG GCAATTTAAT TAAAGTTTCA 3 7 SO 

GGACGTTCAG ATATTGATAA AAATGCAATA ATAAATATTA GTGTCATGGT GAAAAACGTG 3 84 0 

AATGATGTTT ATCGTGTGGT AGAAAAGATC AAACAACTTG GTGATGTTTA TACAGTAACA 3 900 

AGAGTTTGGA ACTAGAGGTG CAAAATATGA AAGTAGTTGT ACAAAGAGTT AAAGAAGCAT 3 96 0 

CGGTGACGAA TGATACATTA AATAATCAAA TCAAAAAAGG ATATTGTTTA TTAGTCGGTA 4 02 0 

TCGGTCAGAA CTCTACAGAG CAAGATGCAG ATGTAATTGC AAAGAAAATT GCTAATGCAA 4 080 

GATTATTTGA AGATGACAAT AATAAATTAA ACTTTAATAT CCAACAAATG AATGGTGAAA 414 0 

TACTATCAGT TTCACAATTT ACTCTCTATG CAGATGTAAA AAAAGGTAAC CGTCCAGGTT 42 00 

TCTCAAATTC TAAAAATCCT GATCaAGCGG TAAAAATTTA TGAGTATTTT AATGcaTGCG 4260 

CTACGAGCGT ATGGTCTTAC TGTGAAAACA GGTGAATTTG GAACACACAT GAATGTTAGC 4320 

ATAAATAATG ATGGTCCAGT CACTATTATT TATGAAAGTC AGGACGGCAA AATTCAATGA 43 80 

45 AAAAAATAGA GGCATGGTTA TCTAAAAAGG GTCTTAAAAA TAAACGTACT CTAATAGTAG 44 4 0 

TGATTGCCTT TGTCTTATTT ATCATCTTTT TATTTTTATT GCTGAATAGC AATAGTGAAG 450 0 

ATAGTGGGAA CATCACGATA ACTGAAAATG CTGAATTACG TACAGGTCCA AACGCTGCGT 4 560 

ATCCAGTCAT ATATAAAGTT GAAAAAGGTG ACCATTTTAA AAAGATTGGT AAAGTAGGTA 4 62 0 

AATGGATTGA AGTTGAAGAT ACATCCAGTA ATGAAAAAGG TTGGATAGCT GGATGGCACA 4680 
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ATTAGCTACT GGTGGTACGA TTGAAGCAGC AATAAAATTA GTTGAAAAAT TAGGCGGTAT 120 0 

CGTAGTAGGT ATTGCATTTA TAATTGAATT GAAATATTTA AATGGTATTG AAAAAATTAA 126 0 

AGATTACGAT GTTATGAGTT TAATCTCATA CGACGAATAA TAAATAATAT AATTTTATCA 132 0 

AATGAAATCC TTCATCAAAT GTATAAGAAC CAATGACTTA ATTAAAAAAG TTGTTTAAGT 138 0 

TTTCTTAACA TGAGATGTTA GGATTTTTTA TTTACTGAAA ATGTTAGATG ATTGAGCATT 144 0 

ATACCTTAAT AACATCGTTT ATTTATTTCA TAAATTGTAG TATCATAGAA CTAATATTTA 1500 

AAAAATGAAA CAGTAGATTT AGGTCGAATT TTTGTAAAAG TTTTAAAAGT AGGAATAGTA 156 0 

TACAAATTAA ACTCGCTCAA GTAAAATTAA TATTACGATT AATGACGACA GGATAAATAT 162 0 

TTATCGTCGA CGGACGTATG ATTGGTGTGG GACAAATACT ATTCAACAAG AGTACCTAAA 1680 

TCATTGTTTA AGGCGAAGTA ATAAATATGA ATGGGGTGTA TCATATAATG AACAACGAAT 174 0 

20 ATCCATATAG TGCAGACGAA tTCTTCACAA AGCAAAATCA TATTTGTCAG CAGATGAATA 1800 

TGAGTATGTT TTAAAAAGCT ATCATATTGC TTATGAAGCA CATAAAGGTC AGTTCCGAAA 1860 

AAACGGATTA CCATACATTA TGCATCCTAT ACAAGTTGCA GGTATTTTAA CAGAAATGCG 1920 

ATTAGACGGA CCGACGATTG TCGCAGGTTT TTTGCATGAT GTAATTGAAG ATACACCGTA 198 0 

TACATTTGAA GATGTAAAAG AAATGTTCAA TGAAGAAGTT GCTCGAATTG TTGATGGTGT 2040 

GACGAAGCTT AAAAAAGTAA AATACCGCTC AAAAGAAGAA CAACAAGCTG AAAATCATCG 2100 

CAAGTTATTT ATTGCGATTG CCAAAGATGT ACGCGTAATT TTGGTGAAAT TAGCAGACAG 2160 

ATTACATAAT ATGCGTACCT TGAAAGCCAT GCCGCGCGAA AAACAAATTA GAATTTCTCG 2220 

AGAAACATTA GAAATTTATG CACCATTAGC ACATCGTCTT GGTATTAATA CAATCAAATG 228 0 

GGAACTAGAA GATACGGCTC TTCGTTATAT TGATAATGTG CAATATTTTA GAATAGTCAA 234 0 

TTTAATGAAG AAGAAACGTA GTGaACGTGA AGCGTATATC GAAACGGCTA TTGATAGAAT 24 00 

ACGTACTGAA ATGGACCGAA TGAATATCGA AGGCGATATA AATGGTAGAC CTAAACATAT 24 6 0 

TTACAGTATT TATCGGAAAA TGATGAAGCA GAAAAAACAA TTTGATCAAA TTTTTGATTT 2 52 0 

GTTGGCGATA CGTGTTATTG TCAATTCTAT TAATGATTGT TATGCGATAC TTGGGTTGGT 2 58 0 

GCATACGTTA TGGAAACCGA TGCCAGGACG TTTTAAAGAT TATATTGCAA TGCCTAAACA 264 0 

AAATTTGTAT qj^qjqpj^qq ATACTACAGT AGTAGGCCCA AATGGAGACC CGCTCGAAAT 2 7 00 

CCAAATACGA ACGTTTGATA TGCACGAAAT TGCTGAGCAT GGTGTTGCAG CACACTGGGC 276 0 

TTACAAAGAA GGTAAAAAAG TAAGTGAAAA AGATCAAACT TATCAAAATA AGTTAAATTG 2 82 0 
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AAGACTAAAT TTTTTGTAGC ATCGTATGCT AAGCCACCAG GTACTAATGG AATGATACCC 1764 0 

GTTACCATAA AAATGATGGC AGGTTCTTTT TGTTTACGAG CCATATAATG ACTTAACAAG 17700 

CCTAATGCTA AACTACCAAA GAAACTAGAG TATATAGTGT GCACATTAAA GCCGTTGAAG 17760 

AATAAGGTGT AAACCATCCA TCCACACGTA CCAACGAAAC CACATGATAG ATATAATTTT 17820 

CTAGGTGCAT CAAAAATGAC GCAGAA 17 846 
(2 J INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5544 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 

ATTGACACTT GGTGAAAGTA ATATCGCCGC GCTATTTTGG CAAAATGGAC ACTTAGAACC 60 

TGAGTTACAA GATGAACAGC CAATTAATAT ATTAGGATCT GkTCAAATCA ACGAATGGAA 120 

25 TGGTAATCAA TCACCGCAAA TAATTATTCA AGATATTGCG ATGAATGAAC AGCAAATATT 180 

AGATTATAGA AGTAAGCGAA AAAGTTTACC TTTTACAGAA AATGATGAAA ATATTGTCGT 240 

GCTTATTCAT CCTAAAAGTG ATAAAGTAAA TGCGAATGAA TATTATTATG GTGAAGAAAT 3 00 

TAAACAACAA ACTGATAAAG TAGTATTAAG AGATTTACCA ACGTCAATGG AAGACTTGTC 3 60 

TAATTCCTTG CAACAACTGC AATTTTCTCA ACTTTATATA GTTTTGCAAC ATAATCATTC 420 

GATTTACTTC GATGGTATAC CTAATATGGA TATTTTTAAA AAGTGTTATA AAGCATTAAT 4 80 

AACTAAACAA GAAACAAATA TCCAGAAAGA GGGTATGTTA TTGTGTCAAC ATTTAAGTGT 54 0 

GAAACCAGAT ACACTTAAAT TCATGTTGAA AGTTTTCTTA GACTTAAAAT TTGTAACACA 600 

AGAAGATGGT TTAATTCGAA TCAATCAACA ACCTGATAAA AGATCGATTG ATTCCAGCAA 660 

AGTATATCAA TTAAGACAAC AACGTATGGA TGTTGAAAAG CAATTATTAT ATCAAGATTT 72 0 

TTCAGAAATA AAAAATTGGA TAAAGTCACA ATTGTCGTGA GCAATTTAGG AGGAAATATT 78 0 

45 AATGGATTTA AAGCAATACG TATCAGAAGT TCAAGATTGG CCGAAACCAG GTGTTAGTTT 84 0 

CAAGGATATT ACTACAATTA TGGATAATGG TGAAGCATAT GGCTATGCAA CAGATAAAAT 90 0 

TGTAGAATAC GCAAAAGACA GAGATGTTGA TATCGTTGTA GGACCTGAAG CGCGTGGCTT 96 0 

TATCATTGGC TGTCCTGTAG CTTATTCAAT GGGGATTGGC TTTGCACCTG TTAGAAAAGA 102 0 

AGGGAAATTA CCTCGTGmAG TCATTCGTTA TGAGTATGAC CTAGAATATG GTACAAATGT 1080 



55 



EP0 786 519 A2 



TGGGTATAGA ATACCTTCGA GGTGAGTTTT TATTTATGGA AAAAAAGAAT AAG CAAAT AG 15840 

ATAGAGGCGA TTTAAAACAA AACCTATCTG AAAAGTTTGT ATGGGCGATT GCATATGGTT 15900 

5 

CATGTATCGG ATGGGGCGCA TTCATCTTAC CAGGAGACTG GATTAAGCAG TCAGGTCCGA 15 960 

TTGCAGCATC AATTGGTATA GTTATTGGTG CATTATTAAT GATATTAATT GCGGTTAGTT 16 020 

ATGGCGCATT AGTAGAGAGA TTTCCAGTAT CAGGGGGCGC GTTTGCCTTT AGTTTCTTAA 16 080 

w 

GTTTCGGCAG ATATGTGAGT TTCTTCTCAT CATGGTTTTT AACTTTTGGT TATGTCTGTG 1614 0 

TCGTTGCTTT AAAtGCGACC GCATTCAGTT TACTAGTTAA ATTCTTATTG CCAGATGTCT 162 0 0 

15 TAAATAATGG GAAACTATAC ACCATTGCGG GCTGGGACGT TTATATTACG GAAATCATTA 16260 

TTGCGACCGT ATTACTACTT GTATTCATGC TAGTAACGAT TCGTGGCGCA AGTGTATCTG 16320 

GATCATTACA ATATTATTTC TGTGTGGCGA TGGTAATCGT CGTATTATTG ATGTTCTTTG 163 8 0 

20 GTTCATTCTT TGGTAATAAT TTTGCACTTG AAAATTTACA ACCGTTAGCT GAACCTAGCA 16440 

AAGGATGGTT AGTGTCTATT GTGGTTATTG TATCCGTGGC ACCATGGGCA TATGTTGGAT 16500 

TTGATAATAT TCCACAAACA GCAGAAGAGT TTAACTTTGC ACCAAACAAG ACATTTAAGC 16 560 

25 TTATCGTGTA CAGTTTATTA GCAGCATCAT TAACTTATGT TGTCATGATT TTATACACTG 16 620 

GTTGGTTATC AACAAGTCAT CAAAGTTTAA ATGGGCAGTT GTGGTTAACA GGTGCTGtTA 16 68 0 

CACAAACAGC ATTTGGTTAT ATTGGATTAG GTGTATTAGC AATTGCAATT ATGATGGGTA 16740 

30 

TATTTACTGG TTTAAATGGA TTCTTGATGA GTTCAAGTCG CTTGTTATTT TCTATGGGAC 16 800 

GTTCAGGTAT TATGCCAACA ATGTTTAGTA AATTACATAG TAAATACAAA ACACCATATG 16 860 

TCGCAATCAT ATTCCTAGTA GGAGTGTCGT TAATTGCACC TTGGCTAGGA AG AACTG CAT 16 920 

35 

TGACTTGGAT TGTAGATATG TCATCTACTG GTGTATCCAT TGCCTACTTT ATTACATGTT 16 980 

TGTCTGCAGC GAAATTATTC AGTTATAACA AACAAAGTAA TACGTATGCA CCGGTTTACA 17040 

40 AAACGTTTGC TATTATCGGC TCATTTGTAT CATTCATTTT CTTAGCGTTG TTATTAGTGC 17100 

CAGGTTCTCC TGCAGCACTG ACTGCACCGT CTTATATTGC ATTACTTGGA TGGTTAATCA 17160 

TCGGTTTAAT ATTCTTTGTG ATTCGATATC CTAAATTGAA AAATATGGAT AATGATGAAT 17220 

^ 5 TAAGTCGCTT GATTTTAAAT AGAAGTGAAA ATGAAGTTGA TGATATGATT GAAGAACCTG 17280 

AAAAAGAAAA AACTAAATAA TAAAAGAATC GCACAATAAA CCTTCTTCAT TCGGAGGCGT 17 340 

ATCGTGCGAT TTTTTGTATT ATAAATTGAC ATTTAAGACG AGGCAGCTGA ACCTTATATA 17400 

50 

TAATTGCTAA GAGTTAGGGC TGAGCCATTT CTAACAAATA TTTATAATCG TTTAAAAGAT 17460 



EP0 786 519 A2 



10 



20 



25 



35 



40 



CACTCCTACA 


TAATAATATT 


GTATTCATCA 


TATCA1TTTT 


AACCTAATTG 


AAAAATATTA 


14040 


AGCATTCAAT 


ATTTGATGAT 


TGTTGAAATG 


AATCATTCAT 


ACTATTGTAA 


CTTTTGAAAA 


14100 


TGTCATTCAC 


TTTAGATAAG 


TGTGATATGT 


TAAAATATGT 


CCTGAGGTGA 


GATTGAATGG 


14160 


AATGGTCGCA 


AATTTTTCAT 


GACATAACAA 


CGAAACATGA 


CTTTAAAGCT 


ATG CATGATT 


14220 


TTTTAGAAAA 


AGAATATTCG 


ACTGCAATCG 


TATACCCTGA 


TAGGGAAAAT 


ATATATCAAG 


14280 


CGTTTGATTT 


AACACCGTTT 


GAAAATATCA 


AAGTTGTTAT 


ATTAGGACAA 


GACCCGTATC 


14340 


ATGGTCCAAA 


CCAAGCACAT 


GGATTAGCAT 


TTTCAGTGCA 


ACCTAACGCA 


AAATTCCCTC 


14400 


CATCTTTACG 


TAATATGTAT 


AAAGAATTAG 


CAGATGATAT 


TGGATGCGTT 


AGACAAACAC 


14460 


CGCATTTACA 


AGATTGGGCA 


AGAGAAGGCG 


TCTTGTTATT 


GAATACAGTT 


TTAACCGTAA 


14520 


GACAGGGTGA 


AGCAAATTCT 


CATCGTGATA 


TTGGTTGGGA 


AACATTTACT 


GATGAAATTA 


14580 


TTAAAGCAGT 


GTCTGATTAT 


AAAGAACATG 


TTGTCTTTAT 


TTTGTGGGGG 


AAACCTGCAC 


14640 


AGCAAAAAAT 


AAAGCTTATC 


GATACATCTA 


AACATTGTAT 


TATAAAATCA 


GTGCATCCTA 


14700 


GTCCACTGTC 


TGCATATAGA 


GGATTCTTTG 


GATCAAAACC 


GTATTCCAAA 


GCGAATGCCT 


14760 


ATTTAGAGTC 


AGTAGGAAAA 


TCACCAATTA 


ATTGGTGTGA 


AAGTGAGGCG 


TAGATGTTGA 


14820 


ATAGAGAAAC 


TTTAATAGCA 


CGAATTGAGC 


AAGAATTAGT 


ACAAGCAGAG 


CAGGCACAGC 


14880 


ATGACCATGA 


CTTTGAAAAA 


CATATGTATG 


CCATACATAT 


ATTAACATCT 


TTATATG CTT 


14940 


CAACATCAAA 


TACACCACAT 


ATTGGTGAAC 


AACAAATGAA 


TCGTCGTATT 


GCTAACCATA 


15000 


ATCAAATGCC 


ACAATCACAA 


ATAACGCAGC 


CAACTCATCA 


AGTGACAGTT 


GCTGAAATTG 


15060 


AAGCGATGGG 


TGGTAAAGTA 


AATACGCATT 


CAGCACATCA 


TCATAATAAG 


TCATATTCAC 


15120 


AACCTTCAAA 


CCAACAACAA 


AGATTAGCGA 


CAGATGATGA 


CATTGGCAAT 


GGTGAATCCA 


15180 


TATCtGATTT 


TTAAAAAGCA 


ACAATGAAAC 


ATAATTACTT 


AATAGCTTGT 


TAAGTATGTA 


15240 


GGTTAATAAT 


CAAGACGCAT 


ATACTTTTAT 


TCGAGTGTTC 


GGATTTAAAC 


ATTTATTAAT 


15300 


ACTGAATTAT 


ATAAGGAGAG 


GTAGCAATGA 


AATTATTTAT 


TATTTTAGGT 


GCATTAAACG 


15360 


CGATGATGGC 


TGTCGGTACA 


GGTGCATTTG 


GTGCGCATGG 


TTTACAAGGA 


AAAATAAGTG 


15420 


ATCACTATTT 


ATCAGTATGG 


GAAAAAGCAA 


CGACGTATCA 


AATGTACCAT 


GGCTTAGCAT 


15480 


TATTAATTAT 


AGGTGTAATT 


AGTGGTACAA 


CTTCAATCAA 


TGTTAACTGG 


GCTGGCTGGT 


15540 


TAATATTTGC 


TGGTATTATT 


TTCTTTAGTG 


GATCATTATA 


TATTTTAGTA 


TTAACTCAAA 


15600 


TTAAAGTTTT 


AGGTGCGATT 


ACGCCAATTG 


GTGGCGTATT 


GTTCATCATT 


GGATGGATAA 


15660 


TGTTAATCAT 


TGCGACATTC 


AAATTTGCTG 


GTTAAATTTT 


AAAACTTTAG 


ATT AC CT ATG 


15720 
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ATTCATTGAA CAACACGAAC ACGAAATTAT AGCAATTAAT GACGATGGAG AGATTAAAAT 12240 

AAAAATTTCT TTGAGCACAA AAAAATAACC GATATTAGCT GCATGAACGC ATATTAATTA 123 00 

5 

GGAGATGAAA GGACAGCTAA TATCAGTTAT GTATTGTTAT TATTATTGGG AACAGAGATG 12 3 60 

AATATAGGTT ACGTTTCTTT CTTTGCACGG GGATGCATTA ATCTAAAATA ATAATAACAA 12420 

CTATATCAAT GTTTAATAAA TTCTGGATTA TTGGAACGAT TAGTCAATTT AACTAACTTT 124 8 0 

10 

CATATGATCT ATATCGTCTT GTAATAAAGA GAGCAATTTG AATATTTCAG TATCACTAAA 12 540 

TGAATCGTCA CATTTAATTG AAACATGCTG AAACGTTTTG GTTATAATTT CATAAACTGG 12 6 00 

, 5 TGCGCCTTCA TGGTGATACT GTCGATAAAT AATCATAACC TATATTACCT CCTTTGCTAC 126 60 

TCTATGGTTA TATTATAAAT AACATTTTTA TGTGTGACAT CAACCTTAAG TATCAACTTT 12720 

TTATCAGACA TAGAACGTAT GATTTACTAA GACTATTTAT GTATAAAAGT TCTAAATAAA 12780 

20 TATATATTTA TAGAGTCGCC TGGCAGTCAT TTGGGaAATA TAACATATAT GATTAGAGAG 12840 

GCATCTATCG CAAAAGAATG ATAATGATAG AGGTATTGAG CATATAGATG AGTTTAAGTT 12 900 

CATCTTGAAA ATAAAGGGTT ATTTAGTCAT AGATGTAGAT GTATAGGAAA TATTTGTATG 12 960 

25 TATTGTTCGA TATGTATGAA ATTTTCAATA AAAGCTAATA ACGCTTATAT GTAACTTTCA 13 020 

AATTTAAATT ATATACAGAG CATGATGATT ATAAAAAAAT AACCACATCA CATAAATTGA 13 0 80 

GTTCATACCC AATTTAAGTG GTGTGGCTAA TAATGTTGAT TTATAGATGA ACCGCCTAAT 1314 0 

30 

CGTTAAACCT CTGTTACTTC AACATCGATA TGTTCAATAC GGTTGTATGC ACCGTGATCC 13 2 00 

ACAGGACCAA CAAAATCATT CATTTTCCAA CCGTTTTTAA TAGCAGAAGC GACGAAAGCT 132 6 0 

TTCGCGCTAA TCACAGCTTC TTTCGGTGAC TTACCGTTAG CTAAATATGC AGTTGTTGCC 13 320 

35 

GCAGCAAATG TACAACCAGC ACCATGGTTA TAACTTTGTT GGAACATGTC TGTTGTTAGT 13 3 80 

TGATAAAATG TTTGACCATC ATAGTATAAG TCATACGATT TATCTTGATC TAAAGCTTTG 13440 

40 CCACCTTTAA TGATGACATG CTGTGCGCCT TTATCAAAGA TAATTGTTGC AGCCTTTTTC 13 500 

ATATCTTCAA TTGAATTTAA TTTACCTAAT CCTGATAATT GACCCGCTTC AAATAAGTTT 13 560 

GGTGTCACTA CCGTTGCTTT AGGTAGTAAA TATTTAATCA TCGCCTCAGT ATTTCCAGGA 13 62 0 

45 TTAAGCACTT CATCTTCGCC TTTACAAACC ATGACAGGAT CTACTACAAA ATATTGTGCA 13 6 80 

TTAGATGCCT CATATACTTC TCCAGCACGT TTGATTATCT CCTCAGTACC TAACATACCT 13 74C 

GTTTTAATAG CATCAGGTCC GATTGATAAA GCCGTTTCAA GTTGTTTTTC AAATACATCC 13 8 00 

so 

ATTGGTAATG GTGTAACATC GTGTGACCAT GTATCTTTAT CCATAGTAAC GATGGCAGTT 13 8 60 
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CAATTTTTAA AAAAGAAAGT GAAaCgnTaT 
AAGATGTATT ACACTGCAAG CGGTaAAATT 
5 AGAGGTGAAT TATAATATGA ATCAAGCAGT 

GAAATATGGT GGCACTTTAA AACATTTAGA 
ACATTTTAAA GAGAAGTATC CAGAGGTAAT 

10 

TGTTGTTGGG AATGGTGGCA ATATTGCAAG 
TTCAATACCT GGCGTCACAA TCGATCGGCA 
TGCATGTCGC ATGATCCAAG CCGGAGCTGG 

15 

TACAAGTCGA GCACCTTGGA AAATCAAACG 
TGAGTTTTAT GAGCGTGCAT CATTTGCACC 

20 TGCTGAAAAT GTGGCCAAGA TGTATGATGT 

TCGAAGTCAT CAATTGACAG CGGAAAATGT 
ACCTATAACC GTTAAAGGAG AAATATTCAA 

25 GAAAGATAAC TTTGGCCGAT TTAAGCCCGT 

TAGTTGTATG AAAAATGATG GTGCAGTTTT 
CGAATTAGGT TTCGAGCATG GTTTATTATT 

30 

TAATTTTCCT GGCATTGGTC CAGTACCAGC 
AACGATAGAA AATATTGAAG TCATTGAAAT 
CTGCCAACAA GCTTTAAATA TTTCAAATAC 

35 

ATCAGGTCAT CCATACGGTG CAAGCGGTGC 
TGACAAAGAG ACTATGATTG CATCTATGGG 

40 ATTTACTCGA TTCTAACCAG CGATTAAATG 
ATTATCAGTG TTTTAACCAA CCTTATAGAA 
TATGGCCAAA ATTTGATTTA TTTAAAAAAT 

45 CAGCAATTAA TCAAACTCAA AAGATAGAAG 
ATATTGAATG CCGACAGACT CGCAATATCA 

AAAATGATCA ACATGTCATA ACGGTtACAC 

50 V 

GGAGTTTAAT GAGATATGGA TAAATGAATA 

ACATAATGAG ATTGTGCCAG GACAATTAGT 



GAAATTCCAT CGATGATTCA TCATGTAGAA 10440 

GCTAGAGAAA AAATGATGTC GATGTATTTG 10500 

CATAGTTGCA GCTAAACGAA CTGCATTTGG 10 560 

GCCaGAACAA TTGCTTAAAC CTTTATTCCA 10620 

ATCTAAAATA GATGATGTAG TTTTAGGTAA 106 80 

AAAAGCATTG CTTGAAGCGG GGCTTAAAGA 10740 

ATGTGGGTCT GGACTTGAAA GTGTTCAATA 10800 

CAAGGTATAT ATTGCAGGTG GTGTTGAAAG 10860 

ACCGCATTCT GTGTACGAAA CAGCATTACC 10 920 

TGAAATGAGC GACCCATCAA TGATTCAAGG 10 9 80 

TTCAAGAGAA TTACAAGATG AATTTGCTTA 11040 

AAAGAATGGA AATATTTCTC AGGAAATATT 11100 

CACTGATGAA AGTCTAAAAT CACATATTCC 11160 

GATCAAAGGT GGGACCGTTA CCGCTGCGAA 11220 

ATTGCTTATT ATGGAAAAAG ATATGGCATA 112 80 

TAAAGATGGT GTTACGGTAG GTGTTGATTC 11340 

CATTTCCAAC TTACTAAAAA GAAATCAATT 114 00 

TAACGAAGCG TTCAGTGCAC AGGTAGTTGC 114 60 

GCAATTAAAT ATATGGGGTG GTGCATTAGC 1152D 

CCAATTAGTG ACTCGATTAT TTTATATGTT 115 80 

GATAGGGGGA GGTCTAGGAA ATGCAGCATT 1164 0 

TGTCATTTTC TAAGGATAGT GTGGCTGCAT 11700 

AAGAAGTACC AC CATTAATG TGTGCGTCAT 11760 

ATGCAAATAG CGAACTGATT TTAACAAAAT 11 820 

TAGACACAAT ATATGTAGGG CATTTAGAAG 11880 

CACGTTATAC AATGGCTTTA ACATTAACTA 11940 

AAACTTTTAT TAAGGCGATG AAGTAGAGAT 12 000 

TTTGGCGCTC GTAAATGATG ATAATCCAAT 12060 

GAGTCAAATG ATGCTGATGG CTATGTCATT 12120 
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GAACGTAAGA ATTAGAGATT 


TTAATaAAAA GTATAAATCA ATCGTATATA 


AGCACTTTAA 


8640 


AGCTAGTAGG 


TTCTGCTAAC 


TTTAAAGTGC 


TTTTTAAATT 


GAGAACTGTA 


ATTAGCCGTA 


8700 


ATAAAGTTTT 


TGTATATACA 


TAAACCCCCA 


CTGCAATGAT 


TATCGCAATG 


GGGGAAAGAG 


8760 


GGGACTTAAA 


GCATATGTTT 


AGCTTTGAAT 


ACTTAAAATT 


CTCTTGCTAT 


TGAAATGTTA 


8820 


GGATGTAAAT 


ATGTCTTAGA 


GTATTTTGTC 


CAACGCAATT 


AATATTGAGA 


CTCTAACCTT 


8880 


CAATATTATT 


ATAGAGAACA 


CAAACTTAAA 


TAGATTGGGT 


GACTTATTTG 


TGTCAGTTAT 


8940 


TGCGATTGCG 


ATAACTTCTT 


TTCTCTATAT 


ACATATAGTA 


ACGTCTTATC 


TAATAAAAAA 


9000 


CATGGTACTA 


CAGTATCAAA 


TTTATCTAGG 


GCTTAAGTTT 


GATTTTTATA 


ATAGGCAGGT 


9060 


TTACCTGATA 


AAAATACTTA 


TTCATTATAT 


AATGTTAACA 


ATATGTATTT 


TAAAGTTTAC 


9120 


ATTGAGTGAG 


GGATATTGAT 


GAACGTAATT 


TTAGAACAGT 


TGAAAACACA 


TACTCAAAAT 


9180 


AAACCTAATG 


ACATAGCATT 


ACATATCGAT 


GATGAAACAA 


TTACATATAG 


TCAACTAAAT 


9240 


GCCCGCATCA 


CTAGCGCAgT TGAATCTTTG CAGAAATATT CACTTAACCC 


TGTCGTTGCT 


9300 


ATTAATATGA 


AATCACCGGT 


GCAAAGTATT 


ATTTGTTATT 


TAGCTTTGCA 


TCGTTTACAT 


9360 


AAAGTGCCTA 


TGATGATGGA 


AGGTAAATGG 


CAAAGTACTA 


TACATCGTCA 


ATTGATTGAA 


9420 


AAATATGGTA 


TTAAAGATGT 


AATTGGAGAT 


ACAGGTCTCA 


TGCAGAATAT 


AGACTCACCG 


9430 


ATGTTTATTG 


ATTCAACGCA 


ATTACAGCAC 


TACCCCAATT 


TATTACATAT 


TGGTTTTACT 


9540 


TCAGGGACAA 


CTGGACTGCC 


AAAAGCATAT 


TATCGTGATG 


AAGATT CATG 


GTTGGCTTCT 


9600 


TTTGAAGTTA 


ATGAAATGTT 


GATGTTAAAA 


AATGAAAATG 


CAATAGCAGC 


CCCTGGACCA 


9660 


CTATCGCACT 


CGTTAACATT 


ATATGCGTTA 


TTGTTTGCTT 


TAAGTTCCGG 


TCGTACTTTT 


9720 


ATAGGACAGA 


CCACTTTTCA 


TCCTGAAAAG 


TTACTTAATC 


AATGTCATAA 


AATATCATCA 


9780 


TACAAAGTTG 


CTATGTTTCT 


TGTTCCAACG 


ATGATTAAAT 


CATTATTGTT 


AGTTTACAAC 


9840 


AATGAACATA 


CAATCCAATC 


ATrrmAGC 


AGTGGAGATA 


AGCTGCATTC 


TTCTATTTTT 


9900 


AAAAAGATAA 


AAAATCAAGC 


AAATGACATA 


AATTTGATTG 


AATTTTTTGG 


TACATCGGAA 


9960 


ACCAGTTTTA 


TCAGCTATAA 


CTTGAATCAG 


CAAGCACCAG 


TTGAATCAGT 


AGGTGTGCTA 


10C20 


TTTCCAAA7G 


TGGAATTGAA 


AACAACGAAT 


CACGATCACA 


ATGGTATAGG 


AACTATTTGT 


10080 


ATAAAAAGTA 


ATATGATGTT 


TAGTGGCTAT 


GTAAGTGAAC 


AATGTATAAA 


TAATGATGAA 


1014 0 


TGGTTTGTTA 


CTAATGATAA 


TGGCTATGTA 


AAAGAGCAGT 


ATTTATATTT 


AACGGGACGT 


10200 


CAACAGGATA 


TGTTAATTAT 


TGGTGGTCAA 


AATATATATC 


CAGCACATGT 


TGAACGCCTT 


10260 


T m 7\ A^^Aa"" 
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ATTTAAAATA AATATTTATT AAACATTATG 
ATAATTTATT TTTGTAAAAA TAAATTAAAG 

5 

ATAAACATTT AAATACGATG TCGAAAATGG 
GctATCGCTA TTTTTAGTTA TAATTCCAAA 
TATTGTTTAA TTCAAATGTA TGAGGGTATA 

10 

ATTTTTGAAC AAACATACTT TTGTATTTAT 
AAACTAATTA ACTCCGTATA ATTATGAAAC 

15 TTTAATAAAG AGAATATTAA CATGGTGGAT 

ACCGGTATCG GTAATGCAAT GGAATGGTTC 
TACATTGGAG CGAACTTCTT CTCTCCAGTA 

20 TTCGCAGCAT TAGCCATTGC GTTTTTATTA 

ATTGGTGACA AATATGGACG TAAAGTTGTA 
TCAACATTAA CCATTGGATT ATTGCCAAGC 

25 

CTATTATTGC TTGCAAGAGT ACTACAAGGG 
ATGACATATG TTGCCGAATC ATCTCCAGAT 
GAAATTGGGA CATTATCAGG TTACATAGCT 

30 

TTTTTAACAG ATGAACAAAT GGCATCATTT 
TTCCTAGGAT TATTCGGCTT ATATTTACGT 

35 AATGATGTTG CAACACAACC AGAAAGAGAT 

TATTACAAAG ATATATTTGT ATGTTTTGTA 
ATGGTAACTG CATATTTACC AACCTATTTA 

40 ACAAGTGTAT TAATTACTTG TGTCATGGCA 

AAGTTAGCGG ATAAAATAGG TGAAAAGAAA 
TTATTCAGTA TCATCGCATT TATGTTATTA 

45 

GGTATATTTA TATTAGGATT TTTCTTATCA 
CCAACGATGT TTTACAGTCA TATAAGATAT 
GTTTCGATAT TTGGTGGTaC GaCGCCATTA 

50 

GATCCATTAG CmCCTGCGTA TTATTTAACA 
ACATTCTTAC ATTTAAGTAC AGCAGGAAAA 



AATTTTTAAA GAGTAATGTC TGACTCGTTG 6 840 

TAATGACAAA GTTATTGAAG TAAATTGAGT 6 900 

CGATAGCATA TCACTTACAT GAAGTTGTGT 6 960 

AAGTTAATCG TTCGATGATT TAAGAATTAT 7 02 0 

AAATCATTGA ATTTAATTCG ATAAAGCGAA 7 080 

ATAAAAGTTT AAATTCTTAT AAATTTGACA 714 0 

ATACAAGAGG GAGTGTATGA ATTCATGGAT 7200 

GCAAAGAAAG CTAAAAAAAC CGTTGTTGCA 7260 

GATTTTGGTG TCTATGCATA TAcAACTGCG 7320 

GAGAATGCAG ACATTCGACA AATGTTGACT 73 80 

AGACCAATTG GTGGTGTCGT ATTTGGTATT 744 0 

TTAACATCTA CAATTATTTT AATGGCATTT 7500 

TATGATCAAA TTGGACTTTG GGCACCAATA 7560 

TTTTCAACAG GTGGAGAGTA TGCGGGGGCA 7620 

AAGCG7CGTA ACTCATTAGG TAGTGGACTA 76 80 

GCTTCAATTA TGATTGCTGT ATTAACATTC 7 74 0 

GGTTGGAGAA TCCCATTCTT ACTCGGTTTA 7 800 

CGTAAGCTGG AAGAATCACC AGTTTTCGAA 7 860 

AACATTAACT TTTTACAAAT CATCAGATTT 7 920 

GCTGTTGTAT TCTTCaATGT TACAAACTAT 7980 

GAACAAGTTA TTAAATTAGA TGCAACGACA 804 0 

ATAATGATTC CATTAGCATT AATGTTTGGT 8100 

GTATTTCTAA TTGGTACTGG TGGGCTAACA 816 0 

CATTCACAAT CATTTGTTGT AATAGTAATC 8220 

ACTTACGAAG CGACAATGCC AGGGTCGTTA 828 0 

CGAACTTTAT CAGTAACATT TAATATCTCT 834 0 

GTkGCAmCaT GGTTaGTTAC GAAAACTGGA 84 00 

GCAATCAGTG TTATTGGCTT TTTAGTTATT 84 6 0 

TCTCTAAAAG GTTCGTATCC AAATGTAGAT 8 52 0 
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U(j L Cj<j ATGT A 


ATTACAATAC 


TAGGTGTTGC 


AGAAGATGCA 


TCAATTAAAG 


CAGCTATTGA 


5040 


AtjAAbOTI CAT 


AAAAATAATA 


AACAATTACT 


AGTTGATATG 


ATTGCTGTTC 


AAGATTTAGA 


5100 


AAAALLjiXjCA 


It to to r*^ to « ^wv* to 

AAAGAACTAG 


ATGAAATGGG 


TGCTGATTAT 


ATTGCAGTAC 


ACACTGGTTA 


5160 


T'f™* J\ 'T T*I' TV f~* TV TV 

1 III ALAA 


G CAGAAGGGC 


AATCACCATT 


AGAAAGTTTA 


AGAACCGTTA 


AATCTGTTAT 


5220 


TAAAAATTCT 


AAAGTTGCAG 


TAGCAGGTGG 


AATTAAACCA 


GATACAATTA 


AAGATATTGT 


5280 


C G CTGAAAG T 


CCTGATCTTG 


TTATTGTTGG 


TGGCGGAATC 


GCAAATGCAG 


ATGATCCAGT 


5340 


AGAAGCTGCG 


AAACAATGTC 


GCGCTGCAAT 


CGAAGGTAAG 


TAATATGGCT 


AAATTTAGTG 


5400 


ACTAT CAATT 


AATTCTAGAT 


GAATTAAAGA 


TGACTTTGTC 


ACATGTTGAA 


GCGGATGAGT 


5460 


TTTCAACTTT 


TGCATCCAAA 


ATACTACATG 


CTGAACATAT 


ATTTGTAGCT 


GGCAAAGGAC 


5520 


GTTCAGGATT 


CGTGGCGAAT 


AGTTTTGCAA 


TGCGCTTAAA 


TCAGCTCGGC 


AAACAGGCAC 


5580 


ATGTTGTTGG 


AGAATCAACG 


ACACCTGCGA 


TTAAGTCGAA 


TGATGTATTT 


GTAATTATCT 


5640 


CTGGTTCAGG 


TTCCACGGAA 


CATTTAAGAT 


TATTAGCAGA 


CAAAGCAAAA 


TCAGTAGGTG 


5700 


CTGACATCGT 


ATTAATTACT 


ACAAATAAAG 


ATTCTGCAAT 


AGGCAATCTA 


GCTGGGACGA 


5760 


ACATCGTTTT 


GCCTGCAGGT 


ACAAAATATG 


ATGAACAAGG 


CTCGGCACAA 


CCATTAGGAA 


5820 


GTTTGTTTGA 


ACAAGCATCT 


CAATTATTTT 


TAGATAGTGT 


TGTAATGGGA 


TTGATGACTG 


5880 


AAATGAATGT 


TACGGAACAA 


ACGATGCAAC 


AAAATCATGC 


TAATTTAGAA 


TAAAATAAAG 


5940 


ATAGTCGATA 


ATATGATGCC 


TAGGCAGAAA 


TATTATCGAT 


rp ^ T*n M | M f M | I* | M | t T\ 

ini j. x x x x i. n 


TTTAAATAAT 




AAATTATAGT 


ATAATATCAA 


TAATAAACGA 


ATAGGGGTGT 


TAATATTGAA 


GTTTGACAAT 


6060 


TATATTTTTG 


ATTTTGATGG 


TACGTTGGCA 


GACACGAAAA 


AATGTGGTGA 


AGTAGCAACA 


6120 


CAAAGTG CAT 


TTAAAGCATG 


TGGCTTAACG 


GAACCATCAT 


CTAAAGAAAT 


AACG CATT AT 


6180 


ATGGSAATAC 


CTATTGAAGA 


ATCATTTTTA 


AAATTAGCAG 


ACCGACCATT 


AGATGAAGCA 


6240 


GCATTAGCAA 


AGTTAATCGA 


TACATTTAGA 


CATACATATC 


AATCTATTGA 


AAAGGACTAT 


6300 


ATTTATGAAT 


TTGCGGGTAT 


AACTGAAGCC 


ATTACAAGTT 


TGTATAACCA 


AGGGAAAAAA 


6360 


L~l riTCGTGG 


TGTCTAGTAA 


GAAGAGTGAT 


GTATTAGAAA 


GAAATTTATC 


GGCTATTGGA 


6420 


TTAAATCACT 


TGATTACCGA 


AGCTGTTGGA 


TCCGATCAAG 


TAAGTGCATA 


TAAACCAAAT 


6480 


CCTGAAGGCA 


TACACACAAT 


TGTGCAACGC 


TACAATTTAA 


ATAGCCAACA 


AACGGTGTAT 


6540 


ATTGGTGATT 


CAACGTTTGA 


TGTTGAGATG 


GCACAACGTG 


CTGGTATGCA 


ATCTGCAGCT 


5600 


GTCACTTGGG 


GTGCACATGA 


TGCAAGGTCA 


TTACTTCATT 


CAAATCCGGA 


TTTTATTATT 


6660 
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TAATTTGCTA ATGTGCCTGC AGATGAGAAG GTTTCATCAT CAGGATGTGG AAATATTACT 3 24 0 

AATACATGTC TTTCGTCAGT CATGTTGATG CCTCCTCTAT AAATTAAATG GTCGCTCACT 3300 

5 AATTTGAAGT GCTGCAGCGA GTTGACCTTC GTAATTAAAA CCTGCAATTA AAAATTCATC 33 6 0 

ATGCTCATTG ACCTCAAAAT GCGTTAGACC TTGTACATAA ACCCAACCAC CATTTGATAG 34 2 0 

TTTAAGACCA ATGCGATAAG GTTCTTTATT ACCACCTTTT AGTTGTGCAT GCGTATATGT 34 8 0 

w 

TATTTGTATG TTTCTTAAAA AAGTACCAGC ATTAAAAACA CGTTGATCGA AATGGTTCGC 354 0 

ATAGGCCCCA TTTGTCGTTT CAACATGCAG ATACACAGGT TTATGTTCAA AAGAAGCAAG 3 600 

15 TAAATCTATA ACTTCTTGTT CTTTAATTGG TTCCAACACG TTCACTCCTT ACACTATCAA 3 660 

TGTGTTTATC TTTCTATTTT ACTAAAAACT ATTCGATAAT TGTATACGAT TGCTCAATTA 3 720 

TTTATAAATT AATTTTCATG AAGGGTAATT ACTCAGGATT ACGTAATCAT ACAGCATTAG 3780 

20 TTTTTTACTT TTAAAAATCA AAAATTTGTT GGAATTTGAA AAGTGTTAAA CATTAAAAAT 3 84 0 

GATGCTATAT TAATGGTGTA TGAATGAATT CATAAGTTTT TAAAATGTAT TAAATTTGTG 3 900 

GAGGCATGTA AACAATGAAA GTATTAAACT TAGGATCGAA AAAACAAGCA TCATTCTATG 3 96 0 

25 

TTGCATGTGA GTTATATAAA GAGATGGCAT TTAATCAGCA CTGTAAACTA GGTTTAGCAA 4 020 

CTGGTGGTAC AATGACAGAT TTGTATGAAC AACTTGTTAA GTTGTTAAAT AAAAATCAGT 4 080 

TAAACGTAGA CAATGTATCC ACGTTTAATT TAGACGAATA TGTAGGTTTA ACCGCATCAC 4140 

30 

ATCCGCAAAG TTATCACTAT TATATGGATG ACATGCTTTT CAAACAATAT CCTTATTTTA 4200 

ATAGAAAGAA C ATT CAT ATT CCAAATGGAG ATGCCGATGA TATGAATGCG GAAGCGTgCA 42 60 

35 AAATATAATG ACGTTTTAGA ACAACAAGGT CAACGTGATA TTCAAATTTT AGGTATTGGT 4320 

GAAAATGGTC ATATTGGATT TAATGAACCT GGTACGCCGT TTGATAGCGT TACTCATATC 43 80 

GTTGATTTGA CTGAAaGTAC TATTAAGGCT AATAGTCGAT ATTTTAAAAA CGAaGATGAT 444 0 

40 GTTCCAAAGC AAGCCATTTC GATGGGACTT GCTAATATTC TTCAAGCCAA ACGTATCATT 4 500 

TTACTCGCAT TTGGTGAAAA GAAACGTGCT GCTATTACAC ATTTATTAAA TCAGGAAATT 4560 

TCTGTTGATG TTCCAGCCAC ATTACTTCAC AAACACCCGA ATGTTGAGAT ATATTTAGAC 4 620 

45 

GACGAAGCTT GCCCGAAAAA TGTTGCGAAA ATTCATGTCG ATGAAATGGA TTGATTGCAA 46 80 

TGTTTAATTA AGAAATGCCT CGGGAAAGGT TCCAATAGAA AGATAAAAAG CATTGGAAGG 4 74 0 

ATGATTTTTA GTGGAATTAC AATTAGCAAT TGATTTATTA AACAAAGAAG ACGCGGCTGA 4 800 

50 

GTTAGCAAAT AAAGTAAAAG ATTATGTAGA TATCGTAGAA ATCGGTACGC CAATCATTTA 4 8 60 

CAACGaAGGT TTACCAGCAG TTAAACATAT GG CAGACAAC ATTAGTAATG TAAAAGTATT 4 920 
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LLL-Ab 1 i AAA 


TTAACACCTA 


AACTATTACC 


TACAAAATAA 


TTCATTTACA 


ACACCACTTA 


1440 


1 A ILinl 1 I J. 


TTATAATTAT 


ATCACATAAT 


ATTTAATTAC 


TTCTTTTAAC 


TGGAAGATGT 


1500 


GTTTATTTAT 


AAAACAACAA 


ATTTTGATAT 


TTATAATGAT 


AGTAGTTATT 


CAATCAcTAC 


1560 


GACCcAATAT 


ATCATkGTAG 


AG CTT AGGAT 


ATTGATTTAT 


GACTCAGGCA 


CATCAAATGa 


1620 


GAgGATTTAT AAArGAGATA TACAACTCTA 


GAAGGTATAA 


TAAAAACGCG 


CAACTAATGT 


1680 


TACGCGTTTG 


AATTAATCAT 


ATGATATTAT 


TTGCGATACT 


TTAATTTAGC 


GAAAgcATCA 


1740 


TGTTGATGGA 


TAGACTCTTC 


ATTACGACAT 


TCGATATCGA 


AACCGTCTAA 


CCAATCAAAT 


1800 


TCAACTAAGT 


CCGCGGCAAT 


TAAACGAATT 


AAGTCTTCGA 


CAAAACGTGG 


ATTTTCATAT 


1860 


GCACGCTCTG 


TCACACGTTT 


TTCATCAGGA 


CGTTTTAAAA 


TAGGGTATAG 


AATTGAACTT 


1920 


GCATTAGCTT 


CCATTGCATC 


TAAAATTTTA 


T1T1TATAGT 


CATCAACTAT 


GTCTTGATCT 


1980 


TTATTAATAT 


ATGTTTTAAC 


AGTGACAACA 


CCACGTTGGT 


TGTGCGCTGA 


ATACTCACTT 


2040 


ATTTCTTTTG 


AACAAGGGCA 


TAGCGTTGTG 


ACAGTTGCTT 


CAATAGTAAG 


TTCTTTACGT 


2100 


GTAnCTTTAT 


CACCGTCAAT 


TGCTAATCCA 


TAAGTGACAT 


CGGCATTACC 


AACTGCTTTA 


2160 


ATATTTGTGG 


TTGGACTATA 


GCGATCAAAG 


AACCATTTCC 


CAGAAACATC 


AACGCCTGCC 


2220 


GCATTTTGTT 


TCATATTCGT 


TTGTAAAGTG 


CGTAACACCT 


GATAAAGTGT 


ATTAAATTCA 


2280 


AGTTCAATAC 


CATTATCATA 


GTGCTTTTCA 


ACACTTTCGA 


TTATACGGCT 


CATATTAATA 


2340 


CCTTTTTCGT 


CTTTTGTTAA 


ACTTGTTGAA 


AAACTAAATG 


TGCCAGCTGT 


TTGATACTGG 


o /i n n 

^ ~t w w 


TCAACAAGTA 


CAGGGTACAC 


TAAGTTTTTA 


ATACCAACTT 


CTTCTATTTC 


AAATAAAAAA 


2460 


TCTTTATGTG 


TACTTTGTAA 


ATCTGTCATT 


TCGTTCTTAG 


TAGTAGGTTT 


CGTGCCTTCA 


2520 


ATAGGATCTA 


CGGAACCAAA 


GTGTTTCCAA 


CGACCTTCTC 


GTGTCGATAA 


ATCAAATTCA 


2580 


GTCATTTTTT 


TCCTCCGTTA 


AGATTTAAAG 


TGATATGTCC 


AATATGGTTC 


GACTGTTAAA 


2640 


AAGCTGTGTT 


GTTTACCATC 


GATTTCAGGA 


CTTGCTAATT 


GTTTTAAAAA 


TGGACCTGTT 


2700 


TGAGAAGCAT 


GTGCTTCAAA 


TGCCTTAATT 


TTAAGTTCTT 


TAAAATCTGT 


AATATCATTT 


2760 


TGAATATCAG 


GTTCTCCAAG 


AGCTTCGGTT 


GCATCATTAC 


TGAACGCAAC 


TAAAGTTAAA 


2820 


CGAGGGCGTV 


CTTCTTTAGG 


CATGCGTTCA 


ACCGTTCGAA 


TTACAGCGTC 


TGCTGTTGCT 


2680 


TCGTGATCAG 


GATGTACTGC 


ATAT CCAGGA 


TAAAATGAAA 


TAATCAATGA 


TGGATTTGTA 


2940 


TCATCGATTA 


AAGATTTAAT 


CATACCATCT 


ATATGTTCAT 


AGGGTTCAAA 


TTCGACAGTT 


3000 


TTGTCACGTA 


AACCCATTTT 


TCTTAAATCA 


GTAATACCGA 


TAACTTTACA 


AG CTT CTT CT 


3060 
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CTTnCCGGTG TTT 4 0 93 

(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17846 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 
TGCCAAACTA CCTTTTGACA GTCGTTGCTG TACTTCAGGA TGATCAATCA CATATnTTAC 60 
TTTATCAAAT AGGGCATCTT CATCATTTTT AGTAATTAAA TAACCATTGA AATCTGAAGT 120 
AATCAGTTCG TTAGGTCCAT ATTTAATATC ATAACTAATA ACTGGAACAC CATGTGCTAA 180 
AGATT CAAGT AGCGCTAAAG AGAAACCTTC CATGTTACTT GTTATTAAAC TCAAATAGGC 240 
ATCGCTATAT TCTTGGTCTA GATTGCTTAA AAAGCCGCGT AAGTAAACAT GATTTTCCAA 300 

TCCATATTTT TGTATCAATT CATTTAATTT TTTACTTTCA GAaCCAAAAC CATACATATG 360 

AaGCTCTATT TTTGGGACAT ACGATACTAA GCGTTTAATT AATTCAATTT GTTGATGTAA 420 

TTGTTTTTCA GGTGAATAAC GAGCAACGGA AATTAATTTA ACACTGCGCT GATCTAATGT 4 80 

TTGGACTGGT GTATCAATTG TTTCACTATA GCCGACAGGA ATATTAACAA CTGGAATAGT 540 

ATGGTTAATA CGTTTTTCAA CATCTAATTT TTGCTGCTCA GTAGAAACGA TAATTGCACG 600 

ATATCGAGAT AAATTTTCAA ACATCGCTTT ATATACATTT TTAAATGGCG ATGAATCTAA 6 60 

35 TGCATCAATA TTTTTAATGT GTGTACTGTG AAGCACAGCT ACTACTGGGA TTGACTCAGG 72 0 

CGTTAAGTTG AAAATAGGTG CTGTGTACAC ATTACGATCA CTGAAAAATA AATCCCCATG 7 80 

TTGATATAGT TGTTTAATGA AAAATGCGCC TAATTCCGTT TCATTATTAA AGAAATATTG 84 0 

TTTGTTAGCA TAGTAAACAA TAATTTTTTG TACTTCTGGT TTGCCATCCT TGTAAGAAAA 900 

ATACTTTTCT AATTTTGTGT CACCTTCTGG ATTATAGAAA AATTCACATA ATGTTTGTTG 9 60 

TTTATCAACA AGAATCCTAC TACAACTTAA AAAGCCACGC ACATCATAAA AATCACGTTT 10 20 

TACTTtTCGT CTTTGACTAT CAAAATGATT TACATAATCT AATATACGAT ATTTAGGATC 10 80 

TTGAAAATGG GCATACATTA AGAAACGCTC TTGATCATAT ATTCTAAAGT CATGACTATT 1140 

TTCAACATGT TTTAAAGTAT AATGACATTC ATCAGTCCAA TACGACAACC AGTCAAATGG 12 00 

TTCATTGCGT TCTAAATATG TTGCTTCTTG GAAGAAATCA TACATATTAA TATAGTCAGA 12 60 

ACTAGTAATA TAATTTTGGG CATTTCTATA TAAATATCTA TTCCATGACA GAAATACACA 13 20 
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TAATAATCAT TTCAATGACA CGCGATGTTT TCTCACTAGC AATTTCCATA GCTATTTGAG 234 0 

ATGCATAATT TAAAACAATG AAGAACATTA GAAAGATAAT GCCATmaGcT AAAGCATAGT 24 00 

TGAAAATCTT TTGTCCTTCT GATACTTTAT CGACTTCATC ATTAGAAATC ACCTTATTAT 24 60 

CAACTTTACT TTGTGCTTGT AATTTTTGTA AGTCTTCTTT GTTGATATTT AATTCCCCGG 2 52 0 

CTACCATATT TGTTTGAATA GCTGTAAGCA GTGCTTGTAC TTTTTGTGAA TCTTCATGAC 2 5 80 

TTACTCGCTT CTCACTAATG ATTGTCCCTT GTAACGTGCG ATTTTGATTC ACCTTGATAA 2 64 0 

TATAAGCTTT ATCAAGTTTA TGTTTTTTTA CTTCTTTTTC AGCATCTTCT ATAGAAACTT 2 7 00 

TAGTAAACTT AGCATCACTA TGAAATGTAT TCGCCTGTTG CTTGAAAACC TTATAGATTT 276 0 

GTTCATTCGG TGCTGCTACA CCAATTTTAT CTGGACCATC ATCAAACATG TTAATAATCT 2 82 0 

TATCAATGTT AGATAGGCCA ATCATTAAGG CAGCAATAAT AATCATAAAA ATTACAAATG 2 8 80 

ATTTAGCTTT AATTTTTTTG ATATATGTCA AAGTAAATGT CGCCCAAAAC TTATGCATCC 2 94 0 

TTGCCACCAA CCTTCTCAAT GAATATATCT TGTAATGATG GTTCTACAAC TTGGAATCGT 3 000 

TTAACATAAC CTTGATGTGC CACAACTTGA TAAATATCTT TGGCTACGTC TTCATTCTCA 3060 

ATCGTCAACT GAAGACCTTG CTTCATGTTT TCACTATGAA TGATGCCTCT AATGTTTGTT 312 0 

AAATCTGGTA GTGTTGTTTC TGATTCAATG ACAACTTTCT TGTTACCATT AGATGCACGT 3180 

ACATGATTGA TATCACCAGA AACAACAAGT TGACCTTTAT CTAAAATACA AACATCATCA 3 24 0 

CATAATTCTT C AACATG CTC CATACGGTGA GAACTATAAA CGATTGTACT GCCCCAATCA 33 00 

TTTAAGTCTT TAACTGCTTc TTTTAATAAC TCAACATTAA CTGGGTCTAG ACCACTGAAA 3 3 60 

35 GGCTCATCTA ATATTAGTAA TTCTGGTTTA TGTAACATAC TTGCTAACAG CTGAATTTTT 34 20 

TGTTGATTCC CTTTTGATAG ACTATCAATT CGTTTTTTGC GGTTTTCAGT AATATCAAAA 34 8 0 

CGCTCAAGCC AATACGATAT TTGCTGTTGT ATTTCTGTTT TTGACATTCC CTTTAAAGTT 3 54 0 

40 

GCCAAATATT TCAATTCTTC TTCAACTGTC AATTTCCCAT GTAAACCGCG TTCTTCCGGT 3 60 0 

AAATAACCAA TACGATTGTA CATTGTTTTA TCTAGTTTTT TACCGTTATA CGTrrTGTGT 36 6 C 

CCTTCAGTTG GTTCACTTAA GCCTAAAATC ATACGAAATG TCGTTGTTTT ACmTGCACCA 3 72 0 

45 

TTTCTTCCTA GAAAACCTAA CATTTT AC CT GATTCTAACT TTAATGAAAT ATCATTTACT 37 8 0 

GCCGTCATCT TGCCAAAACG TTT CGTAACA TGTTCAATTA CAAGTCCCAT ACTTTGCCTC 3 84 0 

50 CTAAAAAnAT ATGTATTTAT CTTAATATAA CATTTCCATT CTCTATAAAT GCAATATTTT 3 90 0 

TAAAATGAAT TTATTTTTAA AATTTCTGAA ATTGAAAAAT TTAAATAGTG CCATTTTTGC 3 96 0 
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TATCTTCAAA CACTTCATGT AAATCTAGAA TATCACCTGT AACAATATTT CGCTCATCTA 54 0 

ATACATACAT ATCTAATTGA TTACTTGAAA TGCGATGATT ATCAACGACA TTATTATCTC 60 0 

GATTCAAATT GAAGTACACA TGATTCGTAG GACTAAACAA TGTGTCTTCT GATGCAACTG 660 

CTTCGTATTC AATCGACCAT TGGTGATCCG CATCATAAAT ATGTGTAATC GTCACATCGA 72 0 

TATCACCCGG GAAATGATCA TCAGCTGATT TCAACACCGT CTTAAATATA ACTTTAATTT 78 0 

GAGCAATTTC ATTTCTAATT TCATAATCAA ATAACTTATT GTCCAAACCA TGACATCCAC 84 0 

CATGTAAATG ATGTTCACCG TTGTTTTTTT CTAACTGATA TTCTTTACCT TTCAACTTAA 900 

75 ATTTAGCATT ATCAATTCTA CCGCTATATC TTCCTATAGA AGCACCAAAT TTAAAAGGAT 96 0 

TACTATGATa AAATTCATCC GC TTCAACAA CATTTCCAAG AACAATATTA TTATCATGAT 102 0 

ATTTCCAAGA CACTACTCTT GCTCCATAAT TCGTAAAAAT AATTTTAGTT TCATCATTAT 1080 

CAATTTTGAT TAAATCTACA CCTTGTCTTT GGTGCTCAAC TTCAACTATC ATTTTTACTT 114 0 

CTCCCTTCTA ACCACAAGTG TTCAAGCTCT GCTGGGTAGC AACATTACTA AAACACCTAC 120 0 

AATACAAATG ATTGCACCGA TAACATCATA TTTATCTGGC ATTTGTTTAT CTACGACCAT 126 0 

CGCAAAAATC AAACTCATGA TGATAAATAC GCCACCATAT GCTGCATATA CTCTTCCGAA 132 0 

TGATGGAAAT GATTGAAATG TCGCAATGAC ACCATATAAC ATGAGTATCG CACCGCCTAT 138 0 

TAGCCCAACA AGTGAAGACT GTCCTTCCCT AAGCCACAGC CAAATCAGGT ATCCCCCACC 14 4 0 

TATTTCACAT AAGCCAGCTA ATATAAATAT AAAAATCGGA TATAACATGA AATCACTCCA 150 0 

TCACACATTT GCTATCAATA ATCTATCGGC TACATATCAT TTGTTTACAT TTCTTCTTAC 156 0 

35 TTCACATTCC CATTTTAAAA AGTTCGTTTT CACATTCATA TTGTACACTT TTTTAGACAT 162 0 

TATTCTATAG CTAAATATAA AAAAATAAGA GTAACACGCT TTCATCATCA TTTTATATGA 16 8 0 

TAAATGTGTG TCACTCTCAT CAATTTTATT TTTTAAATAC ACGTTTCATT GAATTAAATA 174 0 

AGCCACGTTC AAATGTAAGT ACTGAATCTT TATATGTTTT AATTGCAATC CATATCAAGA 18 0 0 

CAGCTACCAT TACAATTGAG ATTAAAGAAC TTAAGATGAC CT CAT AT ATT TGAAGCCCTG 186 0 

AAGTTTGAGC GCGTACAACT AATTGAAATG GCGCTAAAAA CGGAATATAA CTTGTGATTA 1920 

AAGCAAGTTG TCCATCAGGA TTATTTATCG TGAATATCGC GATATAAAAT GCAATCATAC 1980 

CAAGTAATGT CAGTGGCATC AAAGATTGAT TTAAATCTTC TATTCTAGAT GTTAATGATC 2 04 0 

50 CGAGGATGGC TGCAAGTAAT ACATACGCCG TAATTCCAAC AATACTACTT ATAATTCCGA 210 0 

CAATAATAAT TTGCCAAGAC AATTGATTCA TTTCCACGTT AAAACCTTGT AGCAAGTCTT 2160 

TTAAGTCAAA GGCAAAAATG CATATAACTG CCATCAATAC AATTAAAATA ATCTGAGTCA 22 2 0 
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ATTATTTATA TAGATGACAT TCAAAAATGG TTTAACCAAT ATACCGATAA ATTGACACAA 162 0 

AATCATAAAG GACAAGGACA CTCAAAATGG GAAGACTTTT TTAGAGGGAG TCGGATTACT 16 8 0 

GAGACTTTTG GTAAATATCA ACATTCACCA TTTGATGGTA AGCATTATGG CATTGATTTT 1740 

GCATTGCCAA AAGGTACACC AATTAAAGCG CCGACGAATG GTAAAGTAAC ACGTATCTTT 18 0 0 

AATAATGAAT TGGGCGG CAA GGTATTACAG ATTGCCGAAG ACAATGGAGA ATATCACCAG 186 0 

TGGTATCTAC ACTTAGACAA ATATAATGTC AAAGTAGGTG ATCGAGTCAA AGCAGGTGAT 1920 

ATTATTGCAT ATTCAGGCAA TACAGGTATA CAAACGACAG GCGCACATTT ACATTTTCAA 198 0 

AGAATGAAGG GTGGCGTAGG TAATGCATAT GCAGAAGATC CAAAACCGTT TATCGATCAG 204 0 

TTACCTGATG GGGAACGTAG CCTATATGAT TTGTAGTTAT AGAAGGGTGC CCGCAGTCTA 2100 

AAAAATTAAG CAATCATTGT GTGAGTATGA TACTTACATA ATGGTTGCTT TTTTCAATGA 216 0 

AAATCGTAAT GCTAAGTCAT ACTTGTTTGA TTTAGATATT ACTTAAAATG TAAGACAAGG 222 0 

TTGTTAGCAT TGGCAGTGAA ATATCGCACA TAAAAAACAT TATTGTCACA CTAGAAAATA 22 8 0 

GTTGTGCACT ATATCAATTT TCTGTATAAA AGTTTAATTC TGACAGTAAT G TAAACGTTT 234 0 

ACAATTTATG ATTGACATTA ATAATGACTG AATATATGAT TTATGTAAGT ATTTGTGCAA 24 00 

CGTTTTCACA AAGTGTATTG CACaAyCAAA CTGtAAACaA a GTATGGGGg GCCATAACAT 24 6 0 

GGCAGAACTA AGTTAGAGCn TATTAAAA 24 8 8 
(2) INFORMATION FOR SEQ ID NO: 108: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4093 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 

TTTTCTTTAT TTCAAmCTGT ATATTaATGA TGTCACTTCA TTTGATACGA TTCTTGATAA 6 0 

CCTATTCAAA ATTCCGCCAA ATAACATAAA TATTATATAA ATGCCGATAC TTTTAATCAT 12 0 

TTTCTACTTT TTCTTCGATA CGG AAACTTG TTTTCGAATT GAACACTTCA CCAGCTTTTA 180 

AAATTG A CGG TGCTTTTTCA C CAT ATAAAT TAATATCATT TGGTAAAAAT TGTGTTTCTA 24 0 

ATGTAAAGCC AGAATGTGGT TTATAAATAT TAAATGGACT ATCCCACTCA TCAGGCTGGT 3 CO 

TAAAAGTAAA GAACACAACA TGAGGCATAT CTGTATCGAO CTCTAACATA AATTrATCAT ^ n 
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(A) LENGTH : 2488 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS : double 
CD) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 

70 ATCAAAAATT GATTGTTTTC nATTTTTTGT TTCAGCGCGG GATCTTTTAC GTCTTTTGTG 60 

AAAACGaTTT TATTATTAAC TACTTTTACT GGATAACTTT TGTATGTCGA GTCAGTAGCA 120 

TTTTTTCTAT CGTTTGTAGT TGTGTCATAT TCACCAgTTA TTTTATGTGT GTTCTTATCT 180 

ACCTTTAACA ACATACGGTC TTCTTTTAAA AGCTCATCTG ATCCAACAAC TGAATAAGAG 240 

GATTCTATAT ACCATGTGTC TTGATCATTA TTTTCATAAT GGGGATTATC GTGACCATCA 300 

ATTTCATAAA GCGTTTCTAA GTTTTTAATA GGATACGTAC TTAGTACTTT TTTAAGACCA 360 

TCTTTCAAAT GAATTTGTTC CCACTTCATT GCCAAAAACA TATCGCCACT GACTACAATT 420 

GAAATAATAA TAATTGCTGC TAAGTTTAAC CAGAAAATTT TATGTGCTTT CATACATTCC 480 

CACCGTTTCT CAAAATACTT CATTAACACT ATAATAATAT ATTTTGAAAA ATATTTACAT 54 0 

CAGTATTAAA GTGAATATCA AATTTTAAAT TTATGAAAAT AATAGATATT TATAAAAAGC 600 

GGAAAAGAGA TACAATAAAA AACTGCATGA CGTTTGAGAC GTCACACAGT GTAACTAAAA 660 

ATTTAAAAAG TTGTTGCTAA TTTTTCAGCA TTATTAATAC TAGTTGCTTT AATTTCTTCA 72 0 

GTCTTATGAG GTTCAGCATT GTGTCCTTCA ATAATGATTG TTTCATATGA TGGCACACCT 780 

AAGAATGTCA TAATTGTTCT TAAATAACGG TCACCCATTT CAAAATCAGC AGCAGGTCCT 84 0 

TCAGTATAAT ATCCACCACG TGATTGAATG TGTAATACTT TTTTGTCAGT TAGTAAACCT 900 

TGTGGTCCTT CAGCAGAATA TTTAAAAGTT TTACCTGCAA TTGAAATAGC ATCAATATAT 960 

GCTTTAACTA CAGGTGGGAA AGAAAGGTTC CACATAGGCG TTACAAATAC ATATTTATCT 1020 

40 GCACTTAAAA ATTCTTCTAA AATGTCACTC AATCTTGAAA CTTTCATTTG TTCATCATCA 1080 

GTTAACGTTT CGCCATTACT CATTTTTCCC CAACCAGTTA ATACATCTTT GTCAATAACT 114 0 

GGAATATAAG TTTCArATAA ATCAATATGT TTCACTTCAT CATCAGGATG TTGTTGTTGA 1200 

TATGTTTCGA TAAATGCTTT ACCAGCCGCC ATAGAATTTG ATACCAGTTC ATTAAAAGGG 1260 

TGTGCTGTAA TATATAATAC TTTTGCCATT TGAAAATTCT CCTCTGkTTC TGTTATTTTC 1320 

TTAAGTATAA TTATTATACT CGATATAAAA TTTAATATCA ATCAAAATAT TCAAATTACC 13 80 

ATCATTTTCT TCATCTATAT nTGGCAGTAC TACTAAAGTA TGAGTGCATT TAATTATGAa 144 0 

ATAGTTGATT TaGAATAtAT ACTTAATACC CAAAATATAT GAAGGATGGA TGCCACTATG 1500 
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AGCGCCAAAA 


CAAGTTCAAA 


TCATTCCAGT 


TAACGTTGAT 


TTACATTATG 


ATTATGCGCG 


3240 


CCAATTACAA 


GATGAATTGA 


AATCTCAAGG 


CGTTCGTGTA 


AGTATTGATG 


ACCGTAATGA 


3300 


AAAAATGGGT 


TATAAAATCA 


GAGAAGCTCA 


AATGCAAAAA 


ATACCTTATC 


AAATCGTAGT 


3360 


TGGGGATAAG 


GAAGTTGAAA 


ATAATCAAGT 


GAATGTGCGT 


CAATATGGAT 


CGCAAGACCA 


3420 


AGAAACAGTT 


GAAAAAGATG 


AATTTATCTG 


GAATCTAGTT 


GATGAAATTC 


GTTTGAAAAA 


3480 


ACATAGATAG 


ACAGTTGTCG 


CAATAAAATG 


CTTTAAAACT 


TTTATTGCGT 


ATCAAGTTTT 


3540 


ACAGGGTTGA 


TTATGCGTGA 


TGAATCCTGT 


ATATTACAAG 


TTAGTTAAAA 


TATTAAATTG 


3600 


AGTTAGAGGT 


TGCATGTTTA 


ATTAGTAACT 


TGTCAGAAGT 


ATTTATGGTA 


CATAAGTTGA 


3660 


ACAAGTGAAA 


GGTAAAGATG 


CCGAAATAGA 


TATAAACCAT 


AAATTATATC 


TATTGGGACA 


3720 


GTTTTCGAAT 


AGGAACTGTA 


CTGTCACAGA 


ATGTGATGTG 


CTACCTTATA 


TAGATAATTG 


3780 


CCAAAGTGGT 


TGCATATCTT 


AAAGGTATGT 


AGCCACTTTT 


TTACTTTTAA 


TATCACTATG 


3840 


TTCTGTAAAA 


AAGGG TATGA 


AAGTGAATAA 


AGGTTATTTA 


TTTCTTGGCC 


TCTAAAACAT 


3900 


GGAAAGGGAG 


CTTATATGTC 


AAAAGTTCAA 


AATGAAAGTA 


ACAATGTTGT 


CAAAAGGGGA 


3960 


CTTAAAGATC 


GTCATATTTC 


TATGATTGCG 


ATTGGGGGTT 


GTATTGGTAC 


AGGTTTATTT 


4020 


GTAACTTCTG 


GTGGAGCAAT 


TCATGATGCA 


GGTGCTTTGG 


GTGCATTAAT 


AGGATACGCA 


4080 


ATTATCGGAA 


TAATGGTATT 


TTTCTTAATG 


ACGTCACTTG 


GCGAAATGGC 


TACGTATTTG 


4140 


CCAGTATCAG 


GTTCATTTAG 


TACATATGCT 


ACAAGATTTG 


TTGATCCATC 


TTTAGGGTTT 


4200 


GCGCTTGGTT 


GGAACTATTG 


GTTTAACTGG 


GTAGTGACTG 


TAGCAGCAGA 


TATTACGATT 


4260 


GCAGCACAAG 


TCATTCAATA 


TTGGACACCA 


TTGCAAGGCA 


TACCCGCTTG 


GGCATGGAGT 


4320 


GCGTTGTTCT 


TAGTTATAAT 


TTTTAGTCTG 


AATTCGTTAT 


CAGTTCGCGT 


CTATGGTGAA 


4380 


AG7QAATACT GGTTGGCATT GATAAAAGTG 


GTTACAGTTA 


TTGTTTTCAT 


TGCAATTGGT 


4440 


TTATTAACGA 


TTGTCGGAAT 


CATGGGTGGT 


CATGTTGTAG 


GATTCGAAAT 


ATTTAATAAA 


4500 


GGTGAAGGTC 


CAATTCTTGG 


TGGCAACTTA 


GGAGGAAGTT 


TGTTATCAAT 


TCTAGGTGTA 


4560 


TTCTTAATCG 


CTGGTTTCTC 


ATTCCAAGGT 


ACTGAGTTAA 


TTGGTATTAC 


GGCTGGTGAA 


4620 


TCAGAAAATC 


CTGAACGTGC 


TGTGCCGAAA 


GCAATTAAAC 


AAGTATTCTG 


GAGAATTTTA 


4680 


TTATTTTACA 


TTTTAGCCAT 


TTTTGTTATC 


GGTATGTTAA 


TTCCTTATGA 


TAGTAGTGCA 


4740 


TTAATGGGGG 


GTAGTGATAA 


TGTAGCAACG 


TCTCCATTCA 


CATTAGTGTT 


TAAAAATGCT 


4800 


GGATTTGCGT 


TTGCAGCATC 


ATTTATGAAT 


GCAGTCATTT 


TAACGTCTGT 


GTTA 


4854 


(2> INFORMA 


TION FOR SEQ ID NO: 107- 
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CTCGTCCCTT GTATAGGGGC GGGATTTTTT GTTTTTTTCA GACATAAATG TTTGTTGGTG 144 0 

TCATAAATTC CCTGTTTATT GTTAATAGGT TTAATGTTAA AACGATGATT GTTGTTCAAT 1500 

TTTTTAACGA GGTCAGATAA AAGTATTTAT AAAGCAAATA GGAGGGTTTA ACATGGAACA 156 0 

AATTAATATT CAATTTCCAG ATGGTAATAA AAAGGCGTTT GATAAAGGTA CTACTACTGA 162 0 

AGATATAGCA CAATCAATTA GTCCTGGATT ACGTAAAAAA GCTGTTGCCG GCAAATTTAA 16 80 

CGGGCAACTT GTAGATTTAA CTAAACCGCT TGAAACTGAT GGATCAATTG AAATTGTGAC 174 0 

ACCAGGTAGT GAAGAagcGT TAGAGGTATT ACGTCATTCT ACTGCACATT TAATGGCACA 180 0 

CGCGATTAAA AGGTTATATG GTAATGTTAA ATTTQGTGTA GGTCCTGTAA TAGAAGGTGG 1860 

ATTCTACTAT GACTTCGACA TTGACCAAAA CATCTCATCT GATGACTTTG AACAAATTGA 1920 

AAAAACAATG AAACAAATCG TTAACGAAAA TATGAAAATC GAACGAAAAG TGGTTTCACG 198 0 

AGATGAAGTG AAAGAGTTAT TCAGCAATGA TGAATACAAA TTAGAATTAA TCGACGCGAT 2 04 0 

TCCTGAAGAT GAAAATGTAA CATTATATAG TCAAGGTGAT TTTACTGATT TATGTCGTGG 2100 

AGTTCACGTT CCATCAACAG CTAAAATTAA AGAGTTTAAA CTATTATCTA CAGCAGGTGC 2160 

ATACTGGCGT GGAGATAGTA ACAACAAAAT GTTACAACGT ATATACGGTA CTGCTTTCTT 2220 

TGATAAAAAA GAATTGAAAG CACATTTACA AATGTTAGAA GAGCGTAAAG AACGTGATCA 22 80 

TCGTAAAATT GGTAAAGAGT TAGAACTATT CACAAATAGC CAATTAGTTG GTGCTGGTTT 234 0 

GCCATTATGG TTACCTAACG GTGCAACAAT TAGACGTGAA ATTGAACGTT ACATTGTTGA 24 0 0 

TAAAGAAGTT AGCATGGGAT ATGACCACGT TTATACACCA GTACTTGCTA ATGTTGATTT 24 60 

ATACAAAACA TCTGGTCACT GGGATCACTA TCAAGAAGAT ATGTTCCCAC CAATGCAGTT 252 0 

AGATGAAACT GAATCTATGG TATTACGTCC AATGAACTGT CCACATCATA TGATGATTTA 25 80 

TGCGAATAAA CCACATTCAT ATCGTGAATT ACCTATCCGT ATCGCTGAGC TAGGAACGAT 264 0 

GCATAGATAT GAAGCAAGTG GTGCTGTATC AGGATTACAA CGTGTTCGTG GTATGACTTT 2700 

AAATGATTCA CATATCTTTG TTCGACCTGA TCAAATTAAA GAAGAATTCA AACGCGTTGT 27 60 

AAACATGATT ATTGATGTGT ATAAAGACTT TGGTTTCGAG GATTATAGCT TTAGATTAAG 2820 

TTATAGAGAC CCTGAAGATA AAGAAAAGTA CTTTGATGAT GATGATATGT GGAATAAAGC 28 80 

TGAAAATATG CTTAAAGAGG CAG CGGATGA GCTTGGCTTA TCGTACGAnG AAgCGATTGG 294 0 

TGAAgCGGCA TTCTATGGTC CGAAACTAGA TGTTCAAGTT AAAACAGCGA TGGGTAAAGA 3 000 

50 AGAGACATTA TCAACAGCAC AACTTGATTT CTTATTACCA GAACGTTTTG ATTTAACTTA 30 60 

TATTGGTCAA GATGGTGAAC ATCATCGTCC AGTTGTTATT CATCGTGGTG TTGTATCAAC 312 0 
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CTTGCCTAAC ATTTCT 26 56 
(2) INFORMATION FOR SEQ ID NO: 106: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 854 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

10 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO; 106: 

15 AAAATGAGGG TTCTAGCGGA AATTACCAAA AGCGTGGTTC ATACTATGGG CAGCGTAATC 6 0 

GTATTTCAAA AGAAAAAACA CCTAAATGGT TAGaAAATAG AGATAAACCT AGTGAAGAAG 12 0 

ATTCGGCTAA AGATAATAGC GTAGATGATC AACAATTAGA GCAAGATCGA CAAGCATTTC 1B0 

20 TAGATAAATT ATCTAAAAAA TGGGAGGAGG ACAGTCAATA ATGAAGCAAT TTAAAAGTAT 24 0 

AATTAACACG TCGCAGGACT TTGAAAAAAG AATAGAAAAG ATAAAnCAGA AGTAATCAAT 3 00 

GACCCAGATG TTAAGCAATT TTTGGAAGCG CATCGAGCTG AATTmACGAA TGCTATGATT 36 0 

25 GATGAAGACT TAAATGTGTT ACAAGAGTAT AAAGATCAAC AAAAACATTA TGACGGTCAT 42 0 

AAATTTGCTG ATTGTCCAAA TTTCGTAAAG GGGCATGTGC CTGAGTTATA TGTTGATAAT 4 80 

AACCGAATTA AAATACGCTA TTTACAATGC CCATGTAAAA TCAAGTACGA CGAAGAACGC 54 0 

30 

TTTGAAGCTG AGCTAATTAC ATCTCATCAT ATGCAACGAG ATACTTTAAA TGCCAAATTG 6 00 

AAAGATATTT ATATGAATCA TCGAGACCGT CTTGATGTAG CTATGGCAGC AGATGATATT 66 0 

TGTACAGCAA TAACTAATGG GGAACAAGTG AAAGGCCTTT ACCTTTATGG TCCATTTGGG 72 0 

35 

ACAGGTAAAT CTTTTATTCT AGGTGCAATT GCGAATCAGC TCAAATCTAA GAAGGTACGT 78 0 

TCGACAATTA TTTATTTACC GGAATTTATT AGAACATTAA AAGGTGGCTT TAAAGATGGT 84 0 

TCTTTTGAAA AGAAATTACA TCGCGTAAGA GAAGCAAACA TTTTAATGCT TGATGATATT 90 0 

40 

GGGGCTGAAG AAGTGACTCC ATGGGTGAGA GATGAGGTAA TTGGACCTTT GCTACATTAT 96 0 

CGAATGGTTC ATGAATTACC AACATTCTTT AGTTCTAATT TTGACTATAG TGAATTGGAA 102C 

45 CATCATTTAG CGATGACTCG TGATGGTGAA GAGAAGACTA AAGCAGCACG TATTATTGAA 108 0 

CGTGTCAAAT CTTTGTCAAC ACCATACTTT TTATCAGGAG AAAATTTCAG AAACAATTGA 114 0 

ATTTTAAAAT GATTGGTGTA TAATGAATAC AAATCTAAAT CGTTTAAATG ATTGAAGACA 120 0 

50 AGATGATCTA ATCAATATTA CACAGAAAGC CATTGTTTGA TGAGAATATG GTTAATAAAT 12 6 0 
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TAGTTACCCC GATTAGAAGT GCTTTACGTC CTGTTTCTAG ATCGTAATAC ATATCTAGAC 900 

CCTCAGCCTC TTGGAAATCT CCTTTAAAGT TGTTATTCAC ACCGCCTATA TCGATGCGAC 960 

GTTTAAATAA CAATTCTTTC GTTTTGATAT CGAAGCCTTG TAAGTAGTTA GGGTTGGCTG 1020 

TATTCGAATC ACCTGTATAC CAATATAAGA TACCTGCATC ATAAGTGATA CCTTGCATAG 1080 

GTTGTGTATC TGAAGTGTAT TCCATAGGTA TATCCATTTG ATACAATACT TTGTCTATAC 114 0 

CTTTATCAAT ATCGT CAGCA CTTCTAACCT CAACAAAGTT CAACGAATTC TTAAGTTGTC 1200 

TTTCAGTGGG TTTATATTCA CGTCTAAAAA TCATTAAATT TTCTACCGGA TTATAAATCG 1260 

CTGACGTATA TCTGTCGTTA AATATATTCG GCATGACATC TTGCATTTCA TTACCATAAG 132 0 

TTATTTCTCC AGTTCTATAT TGGAAACGTA CAAACTTGTT GTTTTTGTTA CTGTCCAATA 1380 

CAGCTGAATA AATCCATAAT TCTCCATCAA TGTATCTATA CGCATTGTGT GTACCGTGAC 14 4 0 

OQ CGCCGTTTTT AACAAGCAAT CTATCAATAA ATTGTCCGTT GGGCTTCAAT CTAGATAACA 1500 

TGTAATGATT ACCTGGACGA GCTTGCGTCA TATAAATAAT TTTCGTTCTA GGGTCTACCC 1560 

AAAATGATTG CATTACTGCA TTTGTATATG GCGATAAATC AGTGATAAAT TCCGGTTCTT 1620 

25 GCTCTTTTGG TTCGAATCGG TATTCTGTCG CTCGATATTC TTTATAGTGT TCAT CTACAG 1680 

CTTTCTCAAC CTTTTTAGTG AAAACATCTA GTGTTGAATA ATCATGATAC AAACGATCTT 174 0 

GCAATGTCTT ATGACCATAA CCTGTATTAT CAACGCGCGC GTCTTTTAcT TCGTTGATAC 1800 

30 CGTCGCCGTT ATGACCTAGT AC CATGTTGC TAAATCGACC GTTTAAATAT GTTAAAAAGT 1860 

CAGAGACGTT ACTTGTAACA TTTAAATGTT CATACTTTAT TTGTTCTCCA TCATGTGCGA 1920 

ATACCTCTTT ATTTCTGTGG TATTCAAGAG AGAAATTAAA ATCCGTCAGC ATGTCTGAAA I960 

TAAGTTTAAA GTTATACTCA TTTTCATCTA CATATCTGTA GTCAAAGACT CTACTTAAAT 204 0 

CTGTAATTAG TTTATTACTC ATGTTTTCCT CCTTTACTAT CCATAAAACT GATmATAATT 2100 

TTTAATAAGC TCATACATAA TAACTTCATG ACCTCTTTCA TTAGGATGTA ATC CATCAGG 2160 

CATGCTAGAT TTTCTAAATG CTGGATTATA TGGTTTGAAA TAATCTGTGT GATAAGCATC 2220 

ATATACTGGT ACATCCAATT CACTACAAGC CAATATCTGA GCATTGACAT AATCCTCTAA 22 80 

AGTTAACCCT AGTTTGTTTT TGTCCGTATC TTTACGGCGT ATCGTTGTAC CACTCATAGG 2 34 0 

GCATTGCCTA GTAGCTGTCA TTACAAGTAT TTTTGAAGCT GGATTATTTT TCCTGATAAC 24 00 

TTCAATTGCA GAACAAAAGG CGCCGTAAAA CGTTTTAGTG TCGGTTTTAT CAGTGCCTAT 24 60 

50 CGGTACGCCT GCCCAATAAC CATGTAACCA GTCATCATCT GTACCTTGTA ATATGATTAG 2520 

GTCTCCTCTT ATTTGCTCTG CTTGTCTaTA AATGCTGTTT TCTaCCGCTT CTTTACCTAT 2580 
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10 



TTGAGGTGTC AAGAATTTGA AATTTATGAA TATAGATATT GAAACATACA GCAGTAACGA 13 80 

TATTTCGAAA TGTGGTGCCT ATAAATACAC AGAAGCTGAA GATTTCGAAA TTTTAATTAT 144 0 

AGCTTATTCG ATAGATGGTG GAGCGATTAG TGCGATTGAC ATGACTAAAG TAGATAATGA 1500 

GCCTTTCCAC GCTGATTATG AGACGTTTAA AATTGCTCTA TTTGACCCTG CTGTAAAAAA 156 0 

GTATGCATTC AATGCTAATT TCGAAAGAAC TTGTCTTGCT AAACATTTTA ATAAACAGAT 162 0 

GCCACCTGAA GAATGGATTT GCACAATGGT TAATTCAATG CGTATTGGCT TACCTGCTTC 16 80 

GCTTGATAAA GTTGGAGAAG TTTTAAGACT ACAAAGCCAA AAAGATAAAG CAGGTAAAAA 174 0 

TTTAATTCGT TATTTCTCTA TACCTTGTAA ACCAACAAAA GTTAATGGAG GAAGAACrAG 18 00 

AAACCTACCT GAACATGATC TTGAAAAAtG GCAACAATTT ATAGATTa CT GTATTCGAGA 1860 

TGTAGAAGTA GAAATGGCGA TTGCT 1885 
2o (2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2656 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
25 (D) TOPOLOGY: linear 



15 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 

30 TAATCCTTAG TTCACTGnCA AATTTCAAAA CACCAGTTCC CTCTATCTGC ATCCATAGAA 60 

ACTGnATGTT TGTGTCAATA ACCGGATTAT ATTGTGATGn TGTTTGTAAC TCGATTAAGT 12 0 

TATCATCTTT CGAAAAATTA TCTACTACCA TTATTCAACC ACCTTTCCTT CGAATAAACT 180 

35 

CCATTTACCA ACkCCACCAG TACCAAAGTT TCTAACTAAA AATTGATGTG CAGACGGGAA 24 0 

GTTATTACGT CTTAATACTT GTGTTGTATT ACCTGGTGTA TTCGATTTTA CTTCTAATAT 300 

CCAACCTGCA ATACCTTTAA AGTCTTTAGG AAAATCAGTA AATCGGTTTG ATTCTTCAGT 36 0 

40 

AGTGATATAG AAATCTAAAC CAACGATTTT TAAATCTGAT AATTTTGTAA TACTCTTAGG 4 20 

GATATGTTCC CAATAACCGG CGTTTTGCGG GCAGAAATTC CATGCTCCGT TGTTTTTCTT 4 8C 

45 ATTGAAAATG TCAATGACAC GTTCGAATTT AAG CAT ATTT CTACCTGTGC TGTTTCTGGt 54 0 

AAGTACTTGT CTTAGAGCAC CATTATAGTG TCCAGGCAGT ACATCCAAGA ACCACCCTGC 600 

ATCTCTAAAC GCTTTCGGTA ACGGGAAATC TAATGCATTT TGTGTGTCTT GaCGTATAGA 66 0 

50 TATAGTAATG ACCAACTTCC GTAATATCAC TTAGATATGC TGGGTTCTGT ATTGGTAACG 72 0 

ITTTAACAC^ T^CG^CTGAA TCAGT^ATTH A T A CTTO AGG ^CGATGTTT TTGAAGAA'^r ^ ° " 
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CCAATCAATA CCTAGTTCTC TTAGAgGCGT AAATGCTTCA TGCATGAGTT CTTGCAATTT 14 040 
TTCTGCATCT T 14 051 

(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1885 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 

TAATCCTCAA CTTnGATTAT ATGGCTTGGG CGCATATGAA CTGCTTAGTT TAGTGTATGA 6 0 

CATTCATACA GTTCGCATGA CTATCATACA ACCTCGAATA GATAACTTTT CTACTGAAGA 120 

GTTACCAATC TCAAGATTAC TTCAATGGGG AACCGATTTT GTTAAACCCT TAGCCAGACT 180 

TGCTTATAAC GGTGAAGGTG AGTTTAAAGC AGGTAGTCAT TGTAGATTCT GTAAGATAAA 24 0 

GCATTCATGT AGAACACGTG CAGAATACAT GCAAAATGTG CCTCAAAAGC CACCACATTT 300 

25 GTTGAGTGAT GAAGAGATTG CAGAACTTTT ATATAAACTG CCTGATATCA AAAAATGGGC 36 0 

TGATGAAGTA GAGAAATATG CGTTAGAACA AGCGAAAGAG AATGATAAAA CGTATCCAGG 420 

TTGGAAGCTA GTCACGGGAC GTTCAAGGAG AGTGATAACT GATACAAAAG CAGTCCGAGA 480 

CAGGTTAGTT GAAGCGGGTT ATAAACCTGA AGATATTACA GAAACCAAGT TACTTAGCAT 54 0 

TACGAATTTA GAAAAATTAA TCGGCAAAAA AGCATTTTCT AAAATTGCAG AAGGCTTTAT 600 

AGAAAAGCCG CAAGGTAAAT TAACACTTGC TACCGAGTCT GATAAACGAC CAGCTATAAA 660 

GCAATCTGCT GAAGATGATT TTGACAAACT ATAAAAATTA AAAAGGACGG TATATAAACA 720 

TGAAAGCAAA AGTATTAAAT AAAACTAAAG TGATTACAGG AAAAGTAAGA GCATCATATG 780 

CACaTATTTT TGaACCTCAC AGTATG CAAG AAGGGCAAGA AGCAAAGTAT TCAATCAGTT 84 0 

TAATCATTCC TaAATCAGAT ACAAGTACGA TAAAAGCCAT TGAACAAGCT ATAGAAGCTG 90 0 

CTAAAGAAGA AGGAAAAGTT AGTAAGTTTG GAGGCAAAGT TCCTGCAAAT CTGAAACTTC 96 0 

CATTACGTGA TGGAGATACT GAAAGAGAAG ATGATGTGAA TTATCAAGAC GCTTATTTTA 1020 

TTAACGCATC AAGCAAACAA GCACCTGGTA TTATTGACCA AAACAAAATT AGATTAACGG 1080 

ATTCTGGAAC TATTGTAAGT GGTGACTATA TTAGAGCTTC AATCAATTTA TTTCCATTCA 1140 

50 ACACAAATGG TAATAAGGGT ATCGCAGTTG GATTGAACAA CATTCAACTT GTAGAAAAAG 1200 

GCGAACCTCT TGGCGGTGCA AGTGCAGCAG AAGATGATTT TGATGAATTA GACACTGATG 1260 



55 



EP0 786 519 A2 



w 



15 



25 



GAATGCTTTA GAACCTGTCG CAAAAATCAA TTTATCGTAT GATACTTCAA TACCATTTGC 12240 

AGTAGTAACT GATTGATTTG CTCTATCTAC TTCAATTACA GGATCATTTG TAATTAACTC 123 00 

GATACCATGT TCCTCATACC ACTCATATGG ATTCATAATT GTTTCTTCAA CTGTCATTTT 123 6 0 

ATTTTGTAAA ATATTTGAAA GCATGATGCG GTTATAGTTT GGATAAGGTT CTTTACCTAT 12420 

TACCGTAATA TCATATAAAT CGTTGGCGCG CTCTAATATT TCTTCGATTG TTCGAATGCC 12480 

CGCCATACCG TTACCAATCA TTACTAGTTT TTGCTTTGCC ATAAAATATG CCCCTTTACT 12 540 

CCATAATATT TATTTCAAAA AAAGGTATTA ATTTTTCGTT AGTGCTTTTA TATTTTCATT 12600 

GGAATCATTA AGCTTTCTAA TCTATCGTTA ATGATTTGCT TTAAAATTGG GTCGAAGTTA 1266 0 

ATTGAAGGTG TGAAGTGTAT AT CTGTATTA ATAACCATGT CATTCATTTG CTGCTTCACT 12720 

TTGTTAACAA GTCTTCCGTC ATATAAAAAT AATGGTACGA CAATCAATTT TTGATACCGT 12780 

20 TTCGAGATGC TTTCTAAATC ATGTGTAAAA CTAATCTCTC CATATAGCGT TCTCGCATAT 12 840 

GTCGGCTTGC TAATTTGCAA ATTTTGAGCG CATATTTGTA ACTCTTCGTG TGCCTTAGTA 12900 

AACTTTCCAT TAATATTGCC GTGTGCAACA ACCATAACTC CAACTTGTTG TTCGTCACCT 12 960 

GCTAATGCGT CACAAATACG TTGTTCAATT AATCGTCTCA TTAAAGGATG TGTGCCAAGT 13 020 

GGCTCGCTTA CTTCTACCTT TATGTCTGGA TACCGTCGTT TCATTTCATG AACGATATTC 13 0 80 

GGTATATCCT TGAGATAATG CATTGCACTA AAGATTAGCA ATGGTACAAT TTTAAAATGG 1314 0 

TCAACCCCAC TTTGAATCaA CGTCGTCaTT ACCGTCTCTA AATCCtGATG CTCACTTTCt 13200 

AAAAACGCAA TATCATAGTG ATGTATATCA TCTTTTACTA ATTCAGAAAT AAATGCTTCT 13 260 

AACGCTTGaT TCTGTCGTCC GTGCCTCATG CCATGTGCAA CAATGATATT CCCATTCACA 13 320 

TTTACCAACC CTTTCACACG TATTGTATAC CAAATCATTT TGTTTTTGTG AAAAGAATCA 133 80 

CATTATAATG TAAAATCAGG GAATTCCCTG ATGCCTGTAG TCATGCATAT TCCTTATACA 13440 

TTTTCCCTTT TTGTTAAATC AAAAAAAGCG ACCGATATAT GAATCCCTAC TCAACATTTA 13 500 

TTTGAGCAAG CATTAATATA TCGGTCGCTT GTAGTGTATA TTATTATCTT AAAATGGTGG 13 560 

TTGGCCTAAT ATTGTTTCGT CAAAGCGCTC GGGTATCAAT ACTTTGCGCA TGATCACACC 13 620 

TAAATCGCCA TCATCATTTT CATGTTCGCT GTATATTTCA TAACCTCTTT TTTCATAAAT 136 8 0 

TTTAAGTAAC CACGGATGCA ATCTTGCAGA TGTACCTAAA GTAACTGCCG CTGACTTTAA 13 740 

CGTATCTCGC AAAAATGCTT CTTCAACATA AGTAAGTAAT TGGCTACCAT AGCCTTTCCC 13 800 

TTCATACTCA GGATTTGTCG CAAACCACCA GACAAAAGGA TAACCCGAAA TACTTTTCAC 13 360 
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AGTTGTTAAA AATTCAGCTT TTTCAACTTC TGTACCACCA TTACCACCGA TATAGATTTG 1044 0 

GAATCCATTT TCAACTGAGA TAATACCAAA ATCTTTAACA CCTGATTCAA CACAACTTCT 10500 

TGGGCAGCCT GATACACCCA TTTTGAATTT ATGAGGTGTA TCGATGTATT CAAATGTTTT 10560 

TTCTAAACGA ATGCCAAGTC GTGTCGTGTA TTGCGTACCA AATCGACAAA ACTCTTTACC 10620 

AACACAGCTT TTAACTGAGC GTGTTTTCTT ACCATAAGCT GATGcTGAAC GCATACCTAG 10680 

GTCTTCCCAT ATATTTGGTA ATTCTTCTTT TTTAACTCCA TACAAACCAA CACGTTGTGA 10740 

ACCTGTCACT TTAACTAGTG GCACATGATA TTTCTTAGCC ACTTCTCCTA GACGAATCAG 10800 

TTGGTCTGCA TCTGTAACAC CCCCACGCAT TTGAGGTATA ACAGAAAATG TACCATCATT 1086 0 

TTGAATATTC GCATGGTAAC GTTCGTTAGC AAATCTTGAT TCTCTTTCAT CTTCATGATC 10920 

ATGTGGATAA ACCATGTTTA AATAATAGTT GATTGCTGGT CGACATTTTG GACATCCACC 1098 0 

TTTATTTTTA AAGTTTAAAA CATGTCGAAC TTCTTTAGAT GTTTTTAAAC CTTTCGCTCT 11040 

TATTTGCGTT ACTATTTGAT CGCGTGTCAA ATCAGTACAA CCACATATAC CAGCAGGTTT 11100 

TGCGGCAACA AAGTCATCTC CTAAGGTGTG CTGCAATATT TGAGCAATTT GCGGTTTACA 11160 

25 TTTACCACAT GAATTCCCCG CTTTTGTTTT AGCCGTTACT TCTTCAACTG TTGTAAAGCC 11220 

ATTTTCCGTA ATCGCATTTA CTATAGTACC TTTATCAACA CCATTACAAC CACAAATTGT 11280 

TTCATCATCA GCCATATCAG CAATTGATAG CGATGCCTCT TCTCCACCTT TAGTAAGCAA 1134 0 

TGATACAAGT GTGTAATCTT CAGTGGATTC ACCTTTTTTC ATCATGTTAT AAAAGCGTGA 114 00 

ACCATCATCG ATATCACCAT ATAGTACTGC ACCAACTACA TTACCGTCTT TTAAAAAGAT 11460 

TTTTTTATAG TTATTATCAA CACTATTAAA TATTTCAATA CCTTTAATTT CTGCATTTTC 11520 

TACAATTTGA CCAGCACTAT ACAAGTCACA CCCAGAAACT TTTAATGACG TAAATGTTGT 1158 0 

TGATCCCTTG TATCCGTTCG TTTCTTTATT TGTTAAATGA TCAGCTAATA CTTTACCTTG 1164 0 

TTCATATAGT GGTGCAACGA GTCCATAAAC TTTGCCGTTA TGTTCTGCAC ATTCACCAAC 11700 

TGCATATACA TTGCTATCAC TTGTTTGCAT CACATCATTG ACAACAATAC CACGATTAAC 11760 

ATCTAGACCT GATTCTTTGG CTACTTCTGT GTATGGTCGT ATACCTACTG CCATAACAAC 11820 

45 TAAGTCTGCC GGAATCTCGC GTCCATCAGC CAATTTAACA CCCTCAACAT CATCTTCTCC 11880 

TAAGATTTCA GTTGTGTTGG CTTGCATTTC AAACTTCATA CCTTGCTTTT CTAGATCTGC 11940 

TTTAAGCATA TTTCCAGCTT TACGGTCTAG TTGCATTTCC ATCAACCATT CAGCTAAATG 12000 

TAACACCGTT ACTTCCATAC CTTGATCTAA TAAACCACGT GCACACTCTA AACCTAGTAA 12060 

TCCTCCACCA ATTACAATTG CTTTCTTTTT AGTCTTAGCA ATGTTCATCA TTTGTTCAGT 12120 
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AGGGG TG TTA 


ATTCATCTAT 


10080 


AP-TTPTPA PT 


TTa a TTTTTT 
1 lAAi 1 1 I 1 1 


L I 1 I ILs I 1 1 L 


P A TPT A H^T' A P 

LA I G I ATT AL 


ACCTTCTCCA 


L rrCAAAAAT 


10140 


TCTACGTGCT 


TGAGCATTGC 


TAGTTATTGC 


TTCCCAAGGT 


TCAGCTTCGA 


CTGCTTTTTT 


10200 


AG CAT CCA TA 


ATGCGTTCAA 


ATAGTTCATT 


TTGTCTTTCT 


GGGTCAAGTA 


AGACTTCTTT 


10260 




AA^PAAP'^ 
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AATGCGACAG TACT C CAT CC TTCAATCGGA 
CAACCGCCAC CATTCACACC TTGACAGCCA 
5 TAAATCGTAT CTGAGTTAAA CCAATGGTTA 

CCTTCAGTAT CGATAGCGTT TTGCGCAAAT 
TTTACGCCTG AAATGGCTTC TTGCCAAGCA 

w 

CCTTTTGATT CTAATTTATG ATCAAAACGA 
AAAATTGTAG CAATACGGAC TTTGTCACCA 
GGACGATTGA ATATCCCATC TCCATCACTA 
TCGTATCCAC CTTCTGTCAT TGATAATGTA 
AGTTTTAAGT TCCACTTCTT ACCTTCTTCC 

20 ACTAAACTAT CGCTGATTGC AT CATGAAT A 

TGACCTAAGT CACTCGCTCT TAAAAATCGA 
AGCATGATAA GAAACGGCAT ATCTGTATAT 

25 GGTTGATTAA CATAATGTTC TTGTAAAATA 

TCTGAACCAG GATTCGGTGC TAGCCAGTTA 
GGTGCTACTG AAATGACTTT TGTACCTTTA 

30 

GGAGTACGTG TTAAAGGTAC ATTAGAGCCC 
TCACTTGATT CAGGCACATC TGTTTGCTCT 
TCTGCATACC AGTCATAAAA ACTAAGCATT 

35 

CCTGCTGCAT AACTAATCAT TGACATCGCT 
GG AC CAT ATT TTTTTATTGT ATACAGTAAT 

40 CAATTTGAAC GCACGTGCCC TCCCATACCT 

TCATTTTCAA CAATAGACGC CCATGCAGCA 
TCAGTCCATA AATCCCAGAG TTTTCCACGA 

45 TATTCATACC AAGAGAATGA CGCACCTCGT 

TCCGGACCAC AACTTGGATA GTCAGTTTGT 
ACAAATACTT TCCAAGAACA TGAGCCTGTA 

50 

TTATCGTGGC TCCAACGTTC TCTGTACATT 
ATCGACCAAT TCCCATTAAA TTTTTCTGTT 



CGACATTTTT CTTGTCCCAC ATAGTGAGCC 6 84 0 

CATAACATAA CTAAGTTTAA GATTGAACGA 6 900 

ATACCCGCAC CCATGATAAT CATTGAACGC 6 96 0 

TCTTTCGCTA CTTGAATGAC AACACTTTGT 7 02 0 

GGTGTATATT TTGATTCTGC ATCGTCGTAT 70 8 0 

CGCACGCCAT ATTGACTTGC CATTAAGTCA 714 0 

TTTGCTAAAG TGACTTGTCG AGTTGGAATT 7200 

TCAAAGTATG GGAATTGAAT TGTTTCTAAT 7260 

GGGTTAATTT TAGAACCATC TTCTGTTTCT 7320 

CAACGTTGAC CCATTGTGCC ATTAGGTACT 73 80 

ACTGGCTTCC ATTCGCCTTG CTCTGTTGTT 744 0 

CCCGCTTTAT ATCCATTTTC ATCTTCATCC 7500 

TGTTTAGCGT AATTTATAAA GCGTTCATTA 7560 

ACATGCGTCA TTGCTTGTGC AATTGCAGCA 7620 

TCTGCAAATT TCACATTTTC TGCGTAATCT 7680 

TAGCGGACTT CAGTCATAAA ATGTGCATCC 774 0 

CACATAATAA TGTATGATGC GTTATACCAG 780 0 

CCCCAAATTT GTGGAGAGGC AGGTGGTAAA 7860 

TCACCACCAA GCAAATTGAT GAATCGAGCA 7920 

GGAATAGGTG TAAATCCTGC GATTCGATCT 7980 

TGTGCTGCGA TTATCTCTGT AACGTCTTTC 804 0 

CGGGCTTGCT TATATTGTTT GGCTTTGTCT 8100 

ACGCGATTAC CATTGTTTTC TTCTAATGCT 8160 

ATATATGGAT ATTTGATTCG AAGCGGACTG 8220 

GGACATCCTC TCGGTTCATA TTCAGGCATA 82 80 

TGATTTTCCC AGGTAATCAC ACCATTTTTC 834 0 

CAGTTAACAC CATGTGTTGT TCTTACTTCT 84 00 

TTTTCCCATT CTCTACTTTT ACTTTCTAGG 84 60 

GGCTTAAAGA AATTCAATCC AAATTTTCCC 852 0 
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GGCGCGTTGT 


GTGAACCACC 


A CGTG TAT PT 


PIT ft R TTTPTP* 
\J X AA IX Itiu 


ALL LALAjLLj X 


TTP A A T TPT 
1 1 VjAA 1 A 1\j I 


5040 




TTATCTTGTG 


CATGATACAT 


AAACATTGTA 


p pttt a cin p a 

^^.11 1 AvjOLA 


1 ALVjA 1 VjLoA 


AA 1 AAL 1 LtL 1 


5100 


5 


CTTGCCGTTA 


CAACACCATT 


ACGGTTATAC 


rtV- lit lAOV.^ 


A A TP ATT A TP 
AA1 LA 1 1 A 1 L 


1 IoAjAI Al LVj 


5160 




TGTTTTTCAG 


CATCTTCATT 


TGATATCCAA 


ACCGTTGGAC 


UHLL 1 L inAn 


1 Atj 1 1 LftAL 


522C 


10 


ATATGCTTAT 


TATCTTGATA 


CATTGAGTGT 


ATATTCCATT 


ttpp a tp ft. rip 


PPTTR ft ft T ft ft 

L<jX XAAAlAA 


5280 


CGCAgTACCA AAGCATCTGT ACCACCTTTA ATTTTCTTAT 


ptpt ft ■ t w i *ppp 


ft ft ftTAPPA ""Pr 


534 0 




GGCGGCAATG 


TCGGTTTATA 


TACTGGTAAG 


CTCTCCCCAA 


» ttpttp-P & a 

A 1 ILa 1 1 L*d AA 


ft ft /T'T 1 r^/~»TV^ 7\ 
AAL 1 I L(j I OA 


54 00 


15 


TCCACATAAT 


AACTTTGACG 


TCCTGTTAAT 


GTTCTAAAAG 


ulAL 1 AL*ALo 


TT CT AT ATT C 


5460 




GTTGTAAATG 


GTGAATATCG 


TCGACCTTGT 


TTATTTGAAC 


c* r n~if~ir* a ft T* a 
L AuyVjAA 1 AL 


I VjL I V_j I LVjCj X 


5 520 




ATTACTTCTC 


GTGGTTGTGA 


AGTTATATTT 


AAAAACGAAA 


lillLl LALjL 


ALjLo-LLjX XLLj 


C C O A 


20 


CTAGAAATAT 


CTTTTAACGG 


CATTCCAGTT 


TGTTCTTCGA 


oAl HI LAX A 


1 1 X i rTCj x 


5640 




GATAATTTAC 


CATTCGTAGC 


AGATGAAATA 


CTTAGTATTG 


p ft tp ft p a p 
LAX LALjL 1 AL 


AX XALXjXCjLI 


5700 




GTATCAATAC 


GTGGACGATT 


CGCTCTCACA 


GAATCATCAT 


1 1 vj 1 A 1 LAL I 


L lACGTAC CT 


5760 


25 


AACATACTTT 


TTAATTCTTC 


ATATTGTTCA 


CTGACACCGA 


AAL 1 lALALL 


A XL» 1 LrLTLCA 


5820 




ACTTTCCCTT 


TTTCAAGTAC 


AGGACCAAGC 


GTGACATATT 


TPTPPTA A ft T 
1 Vj 1 L\j 1 AAA 1 


TTTAGTGTAG 


5880 




TCGCGTTCTA 


CAATTGCAAA 


GTTAGGCATT 


GTACGTCCAG 


(jI ALUjL 1 XL 


AAI X 1LALLL 


5940 


30 


TTCGACCAAT 


CTTTCACTAC 


GCCGTATGGT 


GTTGAAATTT 


L 1 lLrL 1 1 lljl 


AILAHjALXA 


6 000 




AGTGGAGTTG 


TCACAACATC 


TTTAAACGTT 


CCAGGTAAAT 


ft P TPTT'lTP P" 1 
AL» 1L1 1 1 1 LL 


CATTT CTG AA 


606 0 


35 


AATGCTTTTG 


CCAACGTTTT 


ATAAATATCC 


CAGTCTGAAC 


P.PP.ATTPPPA 


T A A PP_P A TP A 
X AALVjVjA 1 LA 


blzU 


ATGGCAGGAT 


TGAAAGGATG 


TACATATGGA 


TGCATATCCG 


1 IvsAloAlAA 


J'PPS TP '1 *T'T" 1 1 
Al LAXLtX 1 1 X 


6180 




- 

TCATACCAAG 


TCGCTGCCGG 


CAAAACAATG 


TCAGAATATA 


a pp^stp. tto p 


L\j I LA X 1 L i L7 


bz4U 


40 


AAGTCTAAAG 


AGACCACTAA 


ATCTAACTTA 


CCTGTTGTTT 


PTTPAPPrPPA 


PP.T A ATTTPT 
t_VJ X AA X. 1 1 L 1 


DJUU 




TCTGGCTTTT 


CATCTTCATT 


TGGTGTAGCT 


AATAACCCTG 


ATTTTOTP.PP 


A APT A A ATP.P 


O J D W 




TTCATAAAGT 


ATTCTTGACC 


TTTTGCAGAA 


CTTGAAATTA 


AP^TTTH A A PP» 


PP AT ATA A AT 
LLA1 A 1 AAA 1 


£ A 0 Pi 




AATGATTTTG 


GATGATTCTT 


TTTCAAATCA 


GGATCTTCTA 


TTT3PAAATTP, 
X IvjLnnnl IVj 


TflTTTT TTTT 
lull 1\j1 111 


e a q n 

D 4 0 U 




GATTTCACTT 


CATCAATTGC 


ACGTTGCAAA 


ATCGCTTCAT 


1 ILsAAlL 1 Al 


ALL 1 1 LA 1 L 1 


6 5 4 0 




TTAGCTTCTT 


CTGCAAACAA 


CAAACTATTT 


TTATTAAATT 


GTGGATATGA 


TGGTAACCAA 


6600 


50 


CCAAGTCTAG 


CTGCTAAAAC 


ATTATAATCA 


GCTGG ATGTT 


GATGCTTTAA 


CTCCTCTGTT 


6660 
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TGTTTCTTCA AAAGTTTTTG GATGAAAAGT 
CATATATCCA AAACTTTCTT GATATTTTTT 
5 CTCCATAGAA ATTCTCATTA TAAATTTCTT 

CACAGCCTTC ACAGTTATCT CCAAAATGCT 
GTGCGTGATA CGTATCTAAA TAGGTTT CTT 

10 

CATATTTGGC TAGTCCTAAT AAACGATACA 
ATCGCTCTAA TCGAGACGTG TCAAATGGCT 

15 TCATCATTGC CATACGTTGT AGGGCTCCTT 

TATTAGCTAA GTATTCAATA GGTAAACGCA 
GATTTTGAGT TGTATTTTTA CCTTCAAAAT 

20 ACCAAACCAT CGGCATCGTT CTAAATTCAG 

TTGCTAACTT ATAAATTGGA GAGTTTTGTG 
CTTTTTCAGC TTGAGCAATG ACTTCTTCGT 

25 TTTCATATAA ATCTTTCTCG TCTACTGCTG 

ATAATAAAAC ACCTAAGTAA CGCATACGTC 
TACCCGCCTC GATTCTCGGG AAACAGAAAG 

30 

TGAAGTAAAC TTTCTTATAT GGACAACCTG 
CTTGGTCAAC TAATACAATG CCATCTTCAT 
ATGCAACGCA ACTTGGATTC AAGCAATGTT 

35 

TTTCGTCAAA TTGGAATTTA ATATCTTCTT 
CTGfAACATG ACCACCTGCT AAGTCATCTT 

40 TATCCCCCGT AATTTCTGAA TACGCTCTAG 

TTGTTAAATG TTCATAATTA TAGTTCCATG 
CTGGGTTATA AAAAATTTTA CCTAAAGCAA 

45 CAAGTTTCCC TTTACGATTT AGTACCCAAC 

GTTTCGGATA CCCTACACCT GGCtTCGTTT 
CTGGACGATT TGTCCaAGTG TTTTTACATG 

50 

TATCTAAATT TAATACCATC GCAAcTTGCG 
ATCTTTCTAA CTGCTACATA TAAATCCCTT 



TAATTTTTCT GGAAAACATA ACTGTTGTGC 3 24 0 

AAAATTATCG AAATTAATCA CGGAAAATCC 3 3 00 

GACCAGTTTT CCCTGAACCT ACTGCAACGC 3360 

CGCCGCCGTA ATTGTATCCT GTACTACCTT 34 20 

TGTGTGATGT TGGAATAACA AATCGATCTT 34 80 

TGTCTTTAGT TTGGCGCTCG GTTATACCTA 354 0 

GTTGAGTAAC TTGAGATCTC ATATAACTTC 3 600 

TTACTGGCTC TGTATCTCCT GCAGTGAAAA 3 660 

TTTCTTCAAT GGCTGGGAAA ATCGCATCTG 3720 

AGCTCATAAT TGGGCTAAGT GGTGGGCAAT 3780 

GATGTAACGG AAATG CAAGT TTATATTCAA 3 84 0 

CAGCTTCAAT CCAATCGTAA CCAATACCAT 3 900 

CAAATGGGTT TAAGAATATA T CT AATTGTT 3 96 0 

AAGCTGCTTC ATGAACTCGA TCTGCATCAT 4 02 0 

CTGTACAAGT TTCAGAGCAT ACCGTAGGCA 4080 

TACACTTTTC AGCTTTGTTC GTTTTCCAAT 414 0 

TCATACAGTA ACGCCATCCA CGACATGCGT 42 00 

CACGTTTATA CATAGCACCT GAAGGACACG 42 6 0 

CACATAAACG TGGTAAATAC ATCATAAAAG 43 2 0 

CTATTTTTTG GATGTTAGGA TCTTTTGGAC 43 80 

CCCAGTTAGG TCCCCATTCA ATTTCAATGT 444 0 

CAACTGGCGA ATGCTTCCCT GATTTCGCAG 4500 

GCTCATAATA ATCTTTAATT AATGGCATAT 456 0 

TTTTTGAAAT TCTACTTCCA GATTTTAATT 4620 

CACCTTTGTA GTGTTCTTGG TCTTCCCAAC 46 80 

CTACGTTGTT GAACCACATG TACTCAGCAC 474 0 

TCACACTACA CGTATGGCAT CCTATGCATT 4 800 

CTTTAATCTT CAAGCCAATT AACCTCCTTC 4 860 

TGGTTCCCAA TTGGTCCATA ATAATTAAAG 4 92 0 
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ATCGCATTCG CCACAGCACT GTAATTATCT TCTTCAGATA ATATATCTTT AGCAGCATCA 14 4 0 

TTCATTGCAA TAATTTTACC GTTATCATCA GCAAAAACTA TCTTTTCGAT TGAATGCTCA 150 0 

TAATATTTTT TCAATAAAGT ATCTAACTGT ATACTGTCCT CATTAATCAT GACTTACACC 156 0 

CTAATTCATC TCATTATTTA TCATCATTGA AAATACCAAA CTTACGTTGA ATATCATCAT 162 0 

TATCAAATAT TTTTGGTAAA GGACGACCAT CTCTTTGACC AAATAATAGT ACGCCATACA 1680 

CTTGATTCTT ATACCAAAGC GGCACTGCTA AAACTGCTGT TAATGATTCG CTCAATAAAA 174 0 

TTGGATAGTC AATCTTTTCT TCAGGCCCTA AAGCTAAACC AACATTGGCT ATT AC CAT AC 1800 

GCTTTCCTGT TTTCATAACA GTTCCAGCTA ATCCACGACC TTTTCTTAAA ATAATCAATT 1860 

TAAATCGATT ATTTTTATTA CCTGAAACAT AGTGCCATTT TATTGGAGAT GATGGTTTGT 192 0 

TAGATTCATA GAAAGCGATT GCCGCAAAAT CATAACCCTC TTCTTTGCGT ATTTTATCTA 1980 

ATGTCTCTTG AAATCTACGA TCTTCAATTA TTGCTTCTGG TGTCAAATCC TTTCACCTCT 2 04 0 

TATGCTTACA CTTTATTCTT ACGGTAAATA ATATATCTGC GATTTATATA TGTCAAAGGT 2100 

ACACTCCAAA CATGCACCAA ACGTGTAAAT GGCCAACAAG CCATAATAGT GAAACCTAAC 2160 

25 AATATATGCA TTTTAAATGC AATCGGCACA CCACTCATCA ATGACGCATC TGGTTTTAAC 2 2 20 

ATAAATAATT GTCTAAACCA AATTGATAAT GAAGTTCTGT AGTTAAAGTC TGGATGTTGT 22 80 

ATATTTGTTA CTAATGTTGC GTAACATCCC ATAAATACGA TAAGTAATAA TAAGAAATTT 234 0 

ACAAATATAT CCGACGCTGA ACTTAATCTT CGAATACTTT TCGTAGTAAC ACGTCTCGCT 24 00 

GTTAATAAAA ACATCCCTAT CAAAGTTATT ATACCAAAGA TGCTACCAAT ATAAACAGCG 24 6 0 

CCTATATGAT ATAAATGCTC AGACACACCC ACTGCATCCA TCCATGGTTT CGGTATTAAC 2 52 0 

AATCCAACTA CGTGTCCAAA AAACACTGGA ATAATACCTA AGTGAAATAA TAAACTTCCC 2580 

CACATCAACC TTTTTCTTTC TATTAATTCA CTAGATTTAG CTGTCCAAGA AAATTTATCA 2 64 0 

TAACGATAAC GTGCAATATG ACCTGCGACA AAGACAACTA AACATAAATA CGGAAATATA 2 7 00 

ACCCATAAAA ACTGATTAAG CATGATGTTT CACTCCTTTT GGTGATGTCA AACATAATTT 2 76 0 

CAATGTTTTT CTAAGTGCTT GAATCACATA GGCATATGGA TTGTTATCTT CACCAAGTGC 2 32 0 

ATTCGCCATC ACATATGTTC CATCCTCAAT AATCATAATG ATTAATTGAA TATTCTCTTC 2 8 80 

AGCTCTTGGA TCATTTCGCC ATTCTGCCAC TTGCAAAAAT TGAAGCATCA ACGGTAGATA 2 94 0 

ATCAGAAAGT TCATTATCTA CCATTTCTAG TCCAAACATT T CAT AT AAT A CCTTTAATTT 3 0 00 

AGCTAACATT TGCCCACGTT CTTTTTGCGT ATCAAATTTG TTATACGTCA TATATAATGG 3 06 0 
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GTGGAACTT 1524 9 

(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14051 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

GTGGCAATAT TTCTAGTTCT CGTTTTGATA AGATTTTAAA AGGATCTGTT GTGTTTGCAG 60 

TGTCCTGATT TGAATTAGAT ACAAATTCAT TCACTAAAGA TGTTGTAAGT TTCATATCTA 120 

CATATGTTTC ACCTTTATAT ACAGTTCGAA TAGCTAACAA TAATTGTTCA TCAGGTGCAT 180 

TTTTCAATAT GTAACCTTTC GCACCATTAC GCAACACATG GAACAAATAC TCCTCATCAT 24 0 

CAAACATTGT TAATATTAGT ATTTTAGTTT CAGGAAAACT GTCAGCAATT TTACTCGTAG 300 

CGATAAGACC TGACTCACCT GGTGGcATAC TTAAATCCAT TAGTAACACA TCAGGTTTAt 360 

25 ATTC.CATTAC TTTTTGGTAA GCTTCGACGC CATCTGCAGC CGTTGCAACA ACTTCCATAT 420 

CATTTTGATA ATTTAAAATC ATAGAGAACC CCGTACGGAC AACAGCGTGA TCATCGGCAA 480 

TGACTATTTT CAATTTTATT CCCCCAATGT ATGTTTCAAA TTGGAATGTT CAATGTAACA 540 

TTGGTACCCT CACCAATTTT CGTTTCAATA TTGACGCTAC CGCTGACTAA CTCAGCTCGC 600 

TCATTCATTC CATATAAACC GAGTCCAGAA CCTTTAGGCT TAGAACTTGG ATCAAAACCA 660 

TTTCCCGCAT CTATCACTTC TGCTACCAAA TGGCGCCCAG TTTGACGGAT ACCTACATTT 720 

ATTTCATTTA CATCAGCGTA TTTCAACGCA TTTAAAATAG CTTCTTGCAC TACTCGATAA 780 

ACAACCGTTT CAATATCACT ATCAAAGCGA GTATTTTTAA TATTTGATGT ATATATGATT 840 

TTTATTCCAT AATTTTCTTC AAACTGTTTA AAATATGATT TAAAAGCTGC TTCAAGGCCT 900 

AG AT CATC CA AAGAAGCGGG TCTTAATTCA ACCGACATAT TACGTATATC ATCAATTAAT 960 

TTAGCGACAA TATATTCAAT ATTTTCTGCG TCTTCCAAAA GCTTAGTTGT ATCTTCTTGA 1020 

45 TATTTTAATA ATCTCAATTG AACATCTACA TTGAGCATTT CTTGAATCAC ACTATCATGT 1080 

AACTCTCTAG AAATTCGCTT TCTTTCATTT TCTTGGGCTG AGATTGTTTT ACGCATCATA 1140 

CGTTGTTGAT GCAATTTCTC TTGCTGTTCA ATTTGTGATG AAACATTTTG AAGCGTAAAT 1200 

GCATGAATTC CCCTGTCTTG ATCAATCAAC TGATATGTTG CTGTAAATGG CATCACTTTT 1260 

TGATCTTTCG TCTTCATAAA TACTTGGAAA TTCGTAGCTT GTACTTGCAT CGATTCTAAG 1320 
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40 



AAA PPtPTP, TT 


nitl 1 1 A 1 (jL 


CAGTCACTAC 


ACCTTTACAA 


AAGGTAAATC 


AAGTAAAGTT 


13500 


CTTTC^AAAT 


ap.ta A rPTTP 
ny 1 AALo lib 


AAGTTGTACT 


CACTGGTGAT 


ACATTTGATC 


ACTGTTTAGC 


13560 


TGAAGCTTTA 




GTGAACATCA AATGAACTTT 


ATAGATCCAT 


TCAATAATGT 


13620 


a ^ — TV 1 Av-/lrt 1 X 


1 L 1 LjOAlAAL 


GTACGCTTGC 


TAAAGAAATG 


CTAGAACAAG 


CAAAGTCTGA 


136B0 


PAATP» TT A A P 


111 VjA 1 1A1L 


TATTTGCCGC 


AATTGGTGGT 


GGCGGTTTAA 


TTTCAGGTAT 


13740 


TAGTAPTTAp 


X X X/\n>vr4LL 1 


ATTCACCTAC 


CACGAAAATT 


ATAGGTGTTG 


AACCTTCAGG 


13800 


TGPAAGTAfST 


A x \J 1 A 1 La AA X 


CTGTTGTGGT 


AAATAATCAG 


GTAGTCACAT 


TGCCTAATAT 


13860 


PGATAAATTT 


P.TPP a ppptp 

vj 1 \j\jAHjO 1\j 


CATCTGTAGC 


TAGAGTTGGC 


GATATTACAT 


TTGAAATTGC 


13920 


AAAAPAAAAT 


v? 1 A(jA 1 viA 1 1 


ACGTTCAAGT 


AGATGAAGGT 


GCAGTTTGTT 


CTACGATTTT 


13980 


AGATATGTAT 


Tpa a a fir* a 


CAATTGTAGC 


AGAACCTGCT 


GGCGCATTAA 


GTGTAAGTGC 


14040 


nrTTfiAAAAr 
1 VjnnrtrtL 


1 A 1 AAAvJA 1 L 


ATATTAAAGG 


TAAAACAGTG 


GTTTGTGTCA 


TTAGTGGTGG 


14100 


T A &. T A A TP a T 


All AA X LLAA 


TGAAAGAAAT 


TGAAGAACGT 


TCATTACTAT 


ACGAAGAAAT 


14160 


p. A & p. p a. tt* a p 

oAA<jLA1 1AL 


1 I I AIL 11 AA 


ATTTCCCTCA 


ACGTCCAGGT 


GCATTGAGAG 


AATTTGTAAA 


14220 


HjALVjIAI 1A 


Vj(jALL I LAAvj 


ACGATATTAC 


TAAATTTGAA 


TACTTAAAAA 


AATCTTCTCA 


14280 


A A ATAPAPPT 
M/\A 1 ALALO 1 


A PT^^T/"*?* TTH 

AL IKj 1 LAI x A 


TTGGTATTCA 


ACTTAAAGAT 


CATGATGATT 


TAATACAACT 


14340 


LHAALAALLj 1 


U I AAAC LATT 


TCGATCCTTC 


CAATATTTAT 


ATTAATGAAA 


ATAAGATGTT 


14400 


RTfl TT P A TTP 
nlHi 1 LA 1 1 Vj 


1 * AA I 1 1 AAL 


ACATAGTAAG 


AAAAACAGTC 


ATAAATTGAT 


TTCTAATTGA 


14460 


A A T P A T PTT A 


1 LjAL IVjL III 


TTATTATACT 


TTACATTTCT 


CGTTTCGTCA 


GATTCAAACG 


14520 


TTTTP A f^flT 


PPPA APPfAT 

IjLLAAVjLLAI 


CTTTCTTTGT 


GTTTGCTTTT 


aTTTTGACGT 


TTTAGACATA 


14580 


A A A AAaP.AP.A 


L L I 1 vjUjVj 1 L 


TCAATGCGGC 


TCATCGCATC 


CAC w I*ri"i'*i'GC 


CTGGCAACGT 


14640 


T PT APT PT A P 


ppp a a pp t a a 

Lvj^jAALvj 1 AA 


GTTCGaCTAC 


CATCGACGCT 


AAGGAGCTTA 


ACTTCTGTGT 


14700 


T PGGP A TClCin, 


A A P fx PPTP TP 


ACCTCCTTGC 


TATAGTCACC 


AGACATATGA 


ATGTAATTTA 


14760 


TAPATTPAAA 

i — r\X X l/\rtrt 


APT AP.ATAP.T 
r\L x nun 1 no 1 


AAGTAAAAGT 


GATTTTGCTT 


lCtLAAAACAT 


TTATTTTGAT 


14 820 


TAAnTfT^rr 

1 -rT-r\VJ lul i LO 


nil \jA 1 1 AO 1 


ATTCGTCAGC 


TCCACATGTC 


ACCATGCTTC 


CACCTCGAAC 


14 880 


fTATTAArrr 

^-Inl I nn^_ L X 


LA I LA 1 L 1 1 1 


GAGGGATCTT 


ATAACCGAAG 


TTGGGAAATC 


T CAT CTTG AG 


14 940 


\jVj<j<jIjL 1 i LA 


TP PTT A P A TP 

I L»L 1 1 AOA 1 


CTTTCAGCAC 


TTATCCCGTC 


CACACATAGC 


TACCCAGCTA 


15000 


TGCCGTTGGC 


ACGACAACTG 


GTACACCAGA 


GGTATGTCCA 


TCCCGGTCCT 


CTCGTACTAA 


15060 


GGACAGCTCC 


TCTCAAATTT 


CCTACGCCCA 


CGACGGATAG 


GGACCGAACT 


GTCTCACGAC 


15120 
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ATGTTTTCGC AACTCAAACG CTATGGCAAA CAAAACCCAA AAACTTAAAA ATCGATATTA 117 00 

ATGGTACCTT ACCAACAGGC GTCTATGCTA AGGACATTAT TCTGCATTTA ATTAAAACGT 11760 

ATGGTGTTGA CTTTGGTACA GGCTATGCTT TGGAATTTAC TGGCGAAACA ATTAAAAACC 11820 

TTTCAATGGA TGGTCGAATG ACTATTTGTA ACATGGCTAT CGAAGGTGGT GCCAAATACG 118 BO 

GCATAATCCA ACCTGATGAT ATAACATTTG AATATGTTAA AGGGAGACCA TTTGC CGATA 11940 

ACTtCGCTAA ATCAGTTGAT AAGTGGCGTG AgCTATATTC TGATGACGAC GCGATATTTG 12000 

ATCGTGTAAT TGAACTTGAT GTTTCAACAT TAGAACCACA AGTGACATGG GGAACTAATC 12060 

CTGAAATGGG TGTTAATTTC AGTGAACCAT TCCCTGAAAT CAATGATATC AACGATCAAC 12120 

GTGCGTATGA TTATATGGGG TTAGAACCAG GTCAAAAAGC TGAAGACATC GACTTAGGGT 12180 

ATGTTTTTCT CGGTTCATGT ACAAATGCTA GACTATCAGA TTTGATTGAA GCTAGTCATA 12240 

TTGTTAAAGG AAATAAAGTT CATCCAAATA TTACAGCTAT TGTCGTACCA GGTTCTCGTA 12300 

CAGTAAAAAA AGAAGCAGAA AAATTAGGTC TAGATACTAT CTTTAAAAAT GCAGGATTTG 12360 

AATGGCGTGA ACCAGGATGT TCAATGTGTT TAGGCATGAA TCCTGACCAA GTACCTGAGG 12420 

25 GCGTACATTG TGCATCTACA AGTAATCGAA ACTTTGAAGG ACGACAAGGC AAAGGTGCAA 1248 0 

GAACACATTT AGTATCCCcT GCTATGGCAG CAGCAGCAGC TATTCATGGT AAATTTGTGG 1254 0 

ACGTAAGAAA GGTGGTTGTT TAAATGGCAG CAATCAAACC TATTACAACA TATAAAGGTA 12600 

AAATAGTCCC TCTCTTCAAC GACAATATCG ATACAGACCA AATCATTCCT AAGGTACACT 1266 0 

TAAAGCGTAT TTCAAAAAGT GGCTTTGGTC CATTTGCTTT TGATGAATGG CGGTACTTAC 12720 

CTGATGGTTC AGATAATCCT GATTTCAATC CTAACAAACC ACAATATAAA GGGGCTTCTA 12780 

TTTTAATTAC TGGAGATAAT TTTGGATGTG GTTCAAGTCG TGAACATGCT GCTTGGGCTC 12340 

TTAAGGACTA TGGTTTTCAT ATTATTATTG CAGGAAGTTT CAGTGACATA TTTTATATGA 12900 

ATTGCACTAA AAATGCGATG TTGCCTATCG TTTTAGAAAA AAGTGCCCGT GAACATCTTG 12960 

CACAATATGT TGAAATTGAG GTCGATTTAC CAAATCAAAC TGTGTCATCA CCAGACAAGC 13020 

GTTTCCATTT TGAAATTGAT GAAACTTGGA AGAATAAACT TGTAAATGGC TTAGATGACA 13 08 0 

TTGCAATCAC CCTACAATAT GAATCATTAA TAGAAAAATA TGAAAAATCa CTTTAAGGGA 1314 0 

GTTGAATATT ATGACAGTCA AAACAACAGT TTCTACGAAA GATATCGATG AGGCATTTTT 13200 

AAGACTTAAA GATATTGTCA AAGAAACACC TTTACAATTA GACCATTACT TATCTCAAAA 13260 

50 GTATGATTGT AAAGTCTATT TAAAACGAGA AGATTTACAA TGGGTACGTT CTTTTAAATT 13320 

AAGAGGTGCT TACAACGCTA TTTCTGTTTT ATCAGATGAA GCTAAAAGTA AAGGTATTAC 133 80 
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AAGAGGGTCA TATTTACCAG GATTCAAGTA TTGGTACTGG TTCAATCGTA GCAATTTACA 9900 

ATGCAGTTGA T CGTATTTT C CAGAAAGAAA CAGAATTAAT TGATTATCGT ATTAATTCTG 996 0 

TCACTGAAGG TACTGATGCC CAAGCAGAAG TACATGTAAA TTTATTGATT GAAGGTAAGA 10Q20 

CTGTCAATGG CTTTGGTATT GATCATGATA TTTTACAAGC CTCTTGTAAA GCATACGTAG 100 8 0 

AAGCACATGC TAAATTTGCA GCTGAAAATG TTGAGAAGGT AGGTAATTAA TTATGACTTA 1014 0 

TAACATTGTT GCCCTACCTG GTGATGGAAT CGGTCCAGAA ATTTTGAACG GATCTCTATC 10200 

ATTGCTTGAA ATTATAAGTA ATAAATATAA CTTTAATTAT CAAATAGAGC ACCACGAATT 10260 

TGGTGGTGCC TCTATTGATA CATTCGGCGA GCCTTTAACT GAGAAAACCT TAAATG CGTG 10320 

TAAAAGAGCA GATGCTATTT TACTGGGTGC AATCGGTGGA CCTAAATGGA CAGATC CTAA 103 80 

CAATCGACCA GAACAAGGAT TATTAAAATT GCGTAAATCC TTAAATTTAT TTGTAAATAT 1044 0 

20 ACGCCCCACT ACCGTTGTCA AAGGCGCTAG TTCTTTATCA CCTTTAAAGG AAGAACGCGT 10500 

TGAAGGCACA GATTTAGTTA TAGTCCGTGA ATTGACAAGT GGTATTTATT TTGGAGAACC 10560 

TAGACATTTT AATAATCACG AGGCCTTAGA TTCTCTTACT TATACAAGAG AAGAAATAGA 10620 

ACGCATTGTT CACGTAGCAT TTAAATTGGC CGCTTCAAGA CGAGGAAAAC TAACATCAGT 106 80 

TGATAAAGAA AATGTATTAG CTTCTAGTAA ATTGTGGCGC AAAGTCGTAA ATGAAGTAAG 10 74 0 

TCAATTATAT CCAGAAGTAA CAGTAAATCA CTTATTTGTT GATGCTTGTA GTATGCATTT 108 00 

AATCACAAAT CCAAAACAAT TTGACGTCAT CGTATGTGAA AACTTATTTG GCGATATTTT 10 860 

AAGTGATGAA GCTTCAGTGA TTCCTGGTTC ACTTGGTTTA TCACCTTCTG CTAGTTTTAG 10 920 

TAACGATGGT CCAAGATTGT ATGAGCCTAT TCATGGATCA GCACCAGATA TTGCAGGTAA 10980 

AAACGTTGCC AATCCATTTG GAATGATTCT ATCTTTAGCG ATGTGTTTAC GTGAAAGCTT 11040 

AAATCAACCA GATGCTGCAG ATGAATTAGA ACAACATATT TATAGCATGA TTGAACATGG 11100 

GCAAACGACA GCAGATTTAG GCGGCAAATT GAATACTACT GATATTTTCG AAATTCTATC 11160 

TCAAAAATTG AATCACTAAG GGGGAGATGT AAATGGGTCA AACATTATTT GACAAGGTGT 112 2 0 

GGAACAGACA TGTGTTATAC GGGAAATTGG GCGAACCGCA ACTATTATAC ATTGATTTAC 112 BO 

ACCTTATACA TGAAGTTACT TCTCCTCAAG CATTTGAAGG ACTTAGGCTT CAAAACAGAA 113 4C 

AATTAAGACG CCCAGATTTA ACATTTGCAA CACTCGATCA CAATGTTCCT ACTATTGATA 114 00 

TATTCAATAT TAAAGATGAA ATTGCAAACA AACAAATCAC AACATTACAA AAAAACGCCA 114 6 0 

TAGATTTTGG GGTGCATA7T TTTGATATGG GTTCTGATGA ACAAGGTATT G7TCACATGG 11520 
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CAAGCACGTA ATATTGCTTT AAGTTATGCA AAAGGTATTG GTGCAaCTCG TGCAGGTGTT 8100 

ATTGAAACAA CATTTAAAGA AGAAACTGAG ACAGATTTAT TTGGTGAACA AGCAGTACTT 8160 

TGCGGTGGTG TATCGAAATT AATTCAAAGT GGCTTTGAAA CATTAGTAGA AGCGGGTTAT 8220 

CAACCAGAAT TAGCTTATTT TGAAGTATTA CATGAAATGA AATTAATCGT TGATTTGATG 82 80 

TATGAAGGCG GTATGGAAAA TGTACGTTAC TCAATTTCAA ATACTGCTGA ATTTGGTGAC 834 0 

TATGTTTCAG GACCACGTGT TATCACACCA GATGTTAAAG AAAATATGAA AGCTGTATTA 84 00 

ACTGATATCC AAAATGGTAA CTTCAGTAAT CGCTTTATCG AAGACAATAA AAATGGATTC 84 6 0 

AAAGAATTTT ATAAATTACG CGAAGAACAA CATGGTCATC AAATTGAAAA AGTTGGTCGT 8520 

GAATTACGCG AAATGATGCC TTTTATTAAA TCTAAAAGCA TTGAAAAATA AGATAGACCT 85 BO 

ACAATGAGGA GTTGTTAAAT ATGAGTAGTC ATATTCAAAT TTTTGATACG ACACTAAGAG 864 0 

ACGGTGaACA AACACCAGGA GTGAATTTTA CTTTTGATGA ACGCTTGCGT ATTGCATTGC 8700 

AATTAGAAAA ATGGGGTGTA GATGTTATTG AAGCTGGATT TCCTGCTTCA AGTACAGGTA 87 60 

GCTTTAAATC TGTTCAAGCA ATTGCACAAA CATTAACAAC AACGGCTGTA TGTGGTTTAG 8 820 

25 CTAGATGTAA AAAATCTGAC ATCGATGCTG TATATGAAGC AACAAAAGAT GCAGCGAAgC 88 80 

CGGTcGTGCA TGTTTTTATA GCAACATCAC CTATTCATCT TGAACATAAA CTTAAAATGT 8940 

CTCAAGAAGA CGTTTTAGCA TCTATTAAAG AACATGTCAC ATACGCGAAA CAATTATTTG 9000 

ACGTTGTTCA ATTTTCACCT GAAGATGCAA CGCGTACTGA ATTACCATTC TTAGTGAAAT 9060 

GTGTACAAAC TGCCGTTGAC GCTGGAGCTA CAGTTATTAA TATTCCTGAT ACAGTCGGCT 9120 

ACAGTTACCA TGATGAATAT GCACATATTT TCAAAACCTT AACAGAATCT GTAACATCTT 9180 

CAAATGAAAT TATTTATAGT GCTCATTGCC ATGACGATTT AGGAATGGCT GTTTCAAATA 924 0 

GTTTAGCTGC AATTGAAGGC GGTGCGAGAC GAATTGAAGG CACTGTAAAT GGTATTGGTG 93 00 

AACGAGCAGG TAATGCAGCA CTTGAAGAAG TCGCGCTTGC ACTATACGTT CGAAATGATC 93 60 

ATTATGGTGC TCAAACTGCT CTTAATCTCG AAGAAACTAA AAAAACATCG GATTTAATTT 942 0 

CAAGATATGC AGGTATTCGA GTGCCTAGAA ATAAAGCAAT TGTTGGCCAA AATGCATTTA 94 80 

45 GTCATGAATC AGGTATTCAC CAAGATGGCG TATTAAAACA TCGTGAAACA TATGAAATTA 954 0 

TGACACCTCA ACTTGTTGGT GTAAGCACGA CTGAACTTCC ATTAGGAAAA TTATCTGGTA 9600 

AACACGCCTT CTCAGAGAAG TTAAAAGCAT TAGGTTATGA CATTGATAAA GAAGCGCAAA 966 0 

TAGATTTATT TAAACAATTC AAGGCCATTG CGGACAAAAA GAAATCTGTT TCAGATAGAG 9720 

ATATTCATGC GATTATTCAA GGTTCTGAGC ATGAGCATCA AGCACTTTAT AAATTGGAAA 9780 
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GATACACTAT TTTTAGGTAT GGGAGGAATG CATGGTTCTT ATGCTAGTAA CATGGCATTA 63 00 

ACTGAGTGTG ATTTACTCAT TAATTTAGGT AGCCGCTTCG ATGATAGATT AGCAAGCAAA 63 60 

5 

CCTGATGCCT TTGCACCTAA CGCCAAAATT GTACATGTAG ATATTGATCC TTCAGAAATC 64 20 

AATAAAGTTA TTCATGTAGA TTTAGGTATT ATTGCAGACT GTAAAAGATT TTTAGAATGT 64 8 0 

TTAAATGATA AAAATGTTGA GACTATAGAA CACAGTGACT GGG TTAAACA TTGTCAAAAT 6 54 0 

10 

AATAAGCAGA AACACCCATT TAAACTTGGT GAAGAAGATC AAGTATTTTG TAAGCCACAA 66 00 

CAAACAATCG AATATATCGG CAAAATTACA AATGGTGAAG CAATTGTTAC TACAGACGTG 6660 

15 GGACAACATC AAATGTGGGC AGCTCAATTT TATCCATTTA AAAATCACGG ACAATGGGTT 672 0 

ACAAGCGGTG GTTTAGGAAC AATGGGATTC GGTATTCCTT CGTCAATTGG TGCCAAATTA 6 78 0 

GCTAATCCTG ATAAAACAGT CGTATGTTTC GTCGGTGACG GTGGTTTCCA AATGACAAAC 6 84 0 

20 CAAGAAATGG CACTTTTACC CGAATATGGT TTAGATGTCA AAATCGTACT AATCAATAAT 690 0 

GGAACATTAG GTATGGTTAA ACAATGGCAA GATAAGTTCT TTAATCAACG CTTCTCACAC 6 960 

TCAGTATTTA ATGGTCAACC TGATTTTATG AAAATGGCAG AAGCATATGG CGTCAAAGGT 7 02 0 

25 TTCTTAATCG ATAAGCCAGA ACAACTGGAA GAACAATTAG ATGCAGCGTT TGCTTATCAA 7 080 

GGACCAGCTT TAATTGAGGT TCGTATTTCC CCTACTGAAG CTGTAACCCC AATGGTTCCG 714 0 

AGTGG CAAAT CAAATCATGA AATGGAGGGC TTATAATGAC AAGAATTCTT AAATTACAAG 72 0 0 

30 

TTGCGGATCA AGTCAGCACG CTAAATCGAA TTACAAGTGC TTTTGTTCGC CTACAATATA 72 6 0 

ATATCGATAC ATTACATGTt ACACATTCTG AACAACCTGG GATTTCTAAC ATGGAAATTC 732 0 

AAGTCGATAT TCAAGATGAT ACATCACTTC ATATATTAAT TAAAAAATTA AAACAACAAA 73 8 0 

35 

TTAATGTTTT AACGGTTGAA TGCTACGACC TTGTTGATAA CGAAGCTTAA TTTTAAGACA 74 4 0 

AAGGCAATGA TGCGCTAATT AGTTATAGAT ATATCATAGG CTGCTAGTTA ACATCTGCCA 750 0 

40 CT ATT AC AAA GTTATATTTC AGAATTTTCG AAACACAAAA TATTTAATTA TTTGGAGGAA 756 0 

TTTATTATGA CAACAGTTTA TTATGATCAA GATGTAAAAA CGGACGCTTT ACAAGGCAAA 7 6 20 

AAAATTGCAG TAGTAGGTTA TGGATCACAA GGTCACGCGC ATGCACAAAA CTTAAAAGAC 76 8 0 

^5 AATGGATATG ATGTAGTCAT CGGCATTCGC CCAGGTCGTT CTTTTGACAA AGCTAAAGAA 7 74 0 

GATGGATTTG ATGTGTTCCC TGTTGCAGAA GCAGTTAAGC AAGCTGATGT AATTATGGTG 7 80 0 

CTATTACCTG ATGAAATTCA AGGTGATGTA TACAAAAACG AAATTGAACC AAATTTAGAA 786 0 

50 

AAACATAATG CGCTTGCATT TGCTCATGGC TTTAACATTC ATTTTGGTGT TATTCAACCA 7 920 
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TATCGTTACT CGCGAAgCAA TTGATGATGC ATTTGCACTT GATATGGCTA TGGGTGGTTC 4 500 

AACAAACACG GTACTGCATA CGTTAGCCAT TGCCAATGAA GCTGGTATTG ATTATGACTT 4 560 

AGAGCGCATT AATGCTATTG CCAAACGCAC GCCATATTTA TCAAAAATAG CACCTAGTTC 4 620 

ATCGTATTCA ATGCATGATG TGCATGAAGC TGGTGGCGTC CCAGCAATTA TTAATGAATT 4680 

GATGAAGAAA GATGGCACGT TACACCCAGA TAGAATCACA GTTACTGGCA AAACGTTACG 4 740 
TGAAAATAAC GAAGGCAAAG AAATTAAGAA CTTTGATGTC ATTCACCCTC TTGATGCACC . 4 800 

ATATGATGCA CAAGGCGGTT TATCTATCTT ATTTGGTAAT ATCGCCCCTA AAGGCGCAGT 4 860 

TATTAAAGTT GGCGGCGTTG ATCCATCTAT CAAAACATTT ACTGGGAAAG CAATTTGTTT 4 920 

CAATTCGCAT GATGAAGCTG TTGAAGCAAT AGACAATCGT ACCGTTCGTG CAGGCCACGT 4 98 0 

CGTTGTCATT AGATATGAAG GACCTAAAGG TGGACCAGGT ATGCCTGAAA TGTTAGCACC 504 0 

20 TACTTCCTCT ATTGTTGGTC GCGGCTTAGG TAAAGATGTT GCATTAATTA CTGATGGGCG 5100 

TTTTTCCGGT GCCACAAGAG GTATTGCAGT TGGTCATATT TCCCCTGAAG CTGCATCTGG 5160 

TGGACCAATT GCCTTAATTG AAGATGGTGA TGAGATTACT ATTGATTTAA CAAATCGTAC 5220 

25 ATTAAACGTA AACCAGCCTG AAGATGTTCT AGCGCGTCGC CGAGAATCTT TAACACCATT 5280 

TAAAGCGAAA GTAAAAACAG GTTATCTAGC TCGTTATACT GCCCTAGTAA CTAGCGCAAA 534 0 

TACAGGTGGC GTCATGCAAG TCCCTGAGAA TTTAATTTAA TTTATTTTTA TATTGGAGAT 540 0 

GGTTAAAATG TCTAAAACTC AACATGAAGT AAACCAAAAT ATTGACCCTT TAAAAATGGC 54 60 

TGAATCACTT GAACCTGAAC AACTAAATGA AAAAACTTTA AATGATATGC GTTCAGGATC 552 0 

AGAAGTGCTA GTAGAAGCTC TACTTAAAGA AAATGTGGAT TATTTATTCG GTTATCCTGG 558 0 

TGGTGCCGTA CTACCTTTAT ATGACACGTT TTATGATGGT AAAATCAAAC AT ATTTT AG C 564 0 

AAGACACGAA CAAGGTGCTG TTCATGCTGC AGAAGGTTAT GCACGTGTAT CTGGTAAamT 5700 

GGCGTCGTTG TAGTTACAAG CGGTCCaGGT GCAACTAATG TAATGACAGG TATTACGGAT 5760 

GCACATTGCG ACTCTTTACC TCTAGTTGTA TTCACTGGAC AAGTTGCTAC ACCAGGCATT 5820 

GGTAAAGATG CATTCCAAGA AGCGGATATT CTATCTATGA CTTCACCAAT TACAAAACAA 5880 

45 AATTATCAAG TGAAACGTGT TGAAGATATC CCTAAAATCG TACACGAAGC TTTCCATGTA 5 94 0 

GCTAATTCTG GACGCAAAGG TCCTGTAGTG ATTGATTTTC CAAAAGATAT GGGTGTTTTA 6000 

GCTACAAATG TGGATTTATG CGACGAAATC AATATTCCAG GTTATGAAGT TGTTACAGAA 6060 

CCAGAAAATA AAGACATTGA CACTTTCATC TCACTTTTAA AAGAAGCGAA AAAGCCTGTC 6120 

GTATTAGCCG GCGCAGGTAT TAATCAATCA AAATCAAATC AATTATTAAC AGAGTTTGTT 6180 
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TTGTTTAGAA 


TCCGTAGTTA 


TTTCAGCTAA 


AACTTCATCG 


TTTTGCATCA 


ATGCTACTGA 


2700 


TAATGGTTGA 


TTCGATGTAT 


CAATGAGCAG 


CGAATTCATG 


GATAATTGCC 


TCCTTAATTT 


2760 


(jI I CAIAAIG 


TTCTCCTTGC 


GCGAACAACT 


CAATTTGTCT 


TGTATTTTCA 


GATATTGTTG 


2820 


AAATG TTAAT 


AGATAAATGC 


GTCGCTGGAA 


GTAAATCTTT 


TATAAATTGA 


CTCCATTCAA 


2880 


TAACAGTAAT 


TGCCTGATCT 


TCGAAAAATT 


CATCAAATCC 


TAAATCTTCA 


TCAGAATCTT 


2940 


CTAAGCGATA 


ACAATCCATA 


TGATGCAATT 


TTAAATTTT1' 


ACCCCTATAT 


GATTTAATGA 


3000 


TGTTAAATGT 


CGGGGAATTA 


ATCGTACGTC 


TTACACCAAG 


AGCTTTTCCT 


ATAAATTGCG 


3060 


TTAACGTTGT 


TTTACCTGCT 


CCTAAATCTC 


CGTTAAGTAA 


AATCAAATCA 


CCACTTTTCA 


3120 


ATTGCTCAAC 


TAAAAATATA 


GCAAATTGAT 


TCATTTCATC 


TAAATTATTT 


ATCTTTATCA 


3180 


ATGTTGATTC 


TCCTATATTA 


TGCTTTTCAT 


TCATAAAAAT 


GATTATCCAT 


TGTTCAATCG 


3240 


TATCTAACTT 


TATATTTAAC 


CTTTATATTG 


TAACAAATTT 


CAACTTAAAT 


TTCTTATCTT 


3300 


TGAAACAGAT 


TATCTATTCA 


AAGTTAATTG 


TAAGAAAATT 


TAAAATATTT 


GTTGACATAC 


3360 


TAAAGCAGAT 


ATAGTAAATT 


AAATTT AT CA 


AATTTTTAGA 


CAATTCTAAC 


TATTAAAGTG 


3420 


AT AT AT AC CA 


TTCACGGAAG 


GAGTATAATA 


AAATG CTTAA 


TCAATATACT 


GAACATCAAC 


3480 


CGACAACTTC 


AAATATTATT 


ATTTTATTAT 


ACTCTTTAGG 


ACTCGAACGT 


TAgTAAATAT 


3540 


TTACTAAACG 


CTTTAAGTCC 


TATTT CTGTT 


TGAATGGGAC 


TTGTAAACGT 


CCCAATAATA 


3600 


TTGGGACGTT 


TTTTTATGTT 


TTATCTTTCA 


ATTACTTATT 


TTTATTACTA 


TAAAACATGA 


3660 


TTAAT CATTA 


AAATTTACGG 


GGGAATTTAC 


TATGCGAaCG 


AgcATGATCA AAAAAGGAGA 


3720 


TCACCAAGCA 


CCAGCAAGAA 


GTCTTTTACA 


TGCCACGGGC 


GCGCTAAAAA 


GTCCAACTGA 


3780 


TATGAACAAA 


CCATTTGTAG 


CTATTTGTAA 


CTCTTATATT 


GATATTGTTC 


CTGGACATGT 


3840 


TCACTTGAGA GAGCTTGCAG ATATAGCTAA AGAAGCAATT AGAGAAGCCG GTGCCATTCC 


3900 


ATTTGAATTC 


AATACAATTG 


GTGTTGATGA 


TGGAATAGCT 


ATGGGACATA 


TCGGAATGCG 


3960 


ATATTCTCTA 


CCATCACGTG 


AAATTATTGC 


AGATGCAGCT 


GAAACTGTAA 


TTAACGCTCA 


4020 


TTGGTTTGAC 


GGCGTATTTT 


ACATTCCTAA 


TTGTGACAAG 


ATTACACCCG 


GTATGATTTT 


4080 


AG CA GC CAT G 


AGGACAAACG 


TACCAGCTAT 


CTTTTGCTCT 


GGTGGACCAA 


TGAAAGCTGG 


414 0 


CTTATCTGCA 


CATGGAAAAG 


CATTAACACT 


TTCATCAATG 


TTTGAAGCAG 


TCGGCGCATT 


4200 


TAAAGAAGGA 


TCGATTTCTA 


AAGAAGAATT 


TTTAGATATG 


GAACAAAATG 


CCTGCCCTAC 


4260 


TTGTGGTTCA 


TGTGCTGGGA 


TGTTTACTGC 


AAATTCAATG 


AACTGTTTGA 


TGGAAGTTTT 


4320 
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AACAATTAAT CGCTGAACAC CATATTCTTT 
CTCTACAACA CTGTTTTGAA AGCTCGTTGC 
5 TTGTCGTTGA TTGTGAAGTT GATTGATTAC 

ATAACTATCT TTATCCAACC AAACACGAGG 
CAACCGATCA ACTTGTGGAC CACCTGGATA 

w 

ATAAGCCTCA CCTACTGCGT CATCTCGTGT 
CTTCATATAA ACTAATTCAG TATGTCCACC 
TAATGGCTCT TCTATGTGAT TAGCATATAT 

15 

TGGCTnATCG TAAGCAAATG CCAATGCTTT 
AATTAGTCCA GGGCCTTCTG TAACCGCTAT 

20 ATCCCCTAGA GCCTCGTTTA TTGTTGCTGT 

TTCGGGAACG ACACCGCCAA ATCGTTTATG 
TAAAATATCT CTGCCATTTT TTATAACACT 

25 AGCTAGTATT AATATATCTT TAGTCATTTA 

CTTCACCATA ATAATTTTTA CGTTTACCAC 
GTTGTGCCAC TTTATTATTA ACT CTTACTT 

30 

CATAGTTTAT TCCGTATTTT AAAAGCATTT 
CGATTGCAAC TGTTGTAATT TGAGCTTGAT 
TAATTTGTTG TTCAAATTCt AAGACAAAAT 

35 

GATAAAATGC GTCAATTGTC CAAGAACTGT 
CTTQTGGCAC ATCTTCTTTA GTCATCTCTC 

40 CCAATTTCGT TCCGCCTCAG CTAATTTATG 

AGGTTTATCT AGCAATTGAT ACATGACTGA 
TTGTAATTCA TCTTGTAATT TTACAGTATC 

45 ATCTTCTAAA AAAGCTCGCA ATGCCTCTAT 

TAATTGACCA TTTTGCCACT GGAATATGCC 
CACAGGAACC AATAATTTAT CAGTATGATC 

50 

AACACCATAT AATTTAACAT CTAACGCATA 
ACGTAAGCCA GTATATGAAC CAGGACCTTC 



ACAAGCTTGA ATAGCTTTAA ACGTGAGCAC 900 

TACGTTAGCT TCAATGATTG GaATATTTTT 960 

GGCACTTTTC AACCCACTAA AACTAAAATC 102 0 

GAATGAATAA GTATCTTCAC CTTCAGCAGC 108 0 

ATTTAAACCA ATTGTTCGTG CCACTTTATC 114 0 

TTCACCAATG ACTTCAAATG ATAAATGATC 12 00 

TGAAACAATA AGTGCAATTA GCGGGAATGT 1260 

ATGTCCTGCA ATATGATGAA CAGGAATAAG 1320 

GGCTGCATTA ACACCTATTA GTAACGCACC 13 30 

GGCATCAATA TCTTCTATTG ATACATCGGC 144 0 

TATACCTTCA ACGTGATGTC TACTTGCCAC 1500 

ACTTTCAATC TGACTTAAAA CTGTATTTGA 1560 

AACGCTTGTT TCATCACAAC TTGTTTcAAC 162 0 

AATTCACCCA CATAACCATT GCGTCCTCAC 1680 

CATATTGAAA TCCTAAATTT TCATATACAT 174 0 

CTAAACTCAT CACATCACAA GTGTGACTTG 18 00 

GACCTAAACC ATAGCCTCTA TAATTATCAT 1860 

CGATAACAAT CCATAAACCT AAATAACCAA 192 0 

ATTTCGCAAA GTTATTTTGC TCTATTTCAT 1980 

CATTGAAACT CCGACGOTCA AGATCAAAGA 204 0 

TAATGTTTAA TTGTTCTTTT GACTGTTGAT 2100 

GTATTTAGGA GTAAATGTAT GTACGTCTGA 216 0 

TGCATTTGGT AGctGCGCAA TCACTTCACC 2220 

TTTCCCAATA TAAATAAATG GTTGGTTTAA 2280 

CGACATATAT TGATCTTCTA AAATAGTCAC 2 34 0 

TGTATAAACT GCTTGTCGTC TTGCATCAAA 2400 

GATTGTTGCT GCCAATGCCT TTAATGATGA 24 60 

CGCTAATGTT TTAGCAACAG TAACACCGAT 2 52 0 

AGCAACAATA ATCGCATCTA ATTGCTGTTT 2580 
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TAATACAAAT AATAAAGTTA CTAGAAAAAT GAGTGTCGCT AAAGTTGTCA TCATTAGCAT 13 80 

TCACCAGTCT TAAGGTTATG ACAAATACAT CGTTGGTTAG AGGTATGAAC CTTAGACAAG 144 0 

TTATTAATTA CGGACTCAAA AATATTATGA TTgAGCTGGT ATAAATGTTT ATTTCCGATT 15 0 0 

TTTCGTGTCG TAACTAAGTT GGTTTTTACT AATGCTTTCA TATGrTAGCT AAGTGTAGGT 156 0 

TGAGAGAATT GAAAATGTGC TAACAAATCA CAAGCGCATA ACTCTCCACA AGAAAGTAAA 162 0 

TCTAGTATTT CTAATCTGCT TGAATCTGAT AAAACTTTTA AAAATGTTGC TAGTTCTTTA 16 80 

TACGTCATAA CATACCTCCT AGACGTTAAA TAGATTATCA TCTATATAGA TGAATGTCTA 174 0 

TGTTCCTTTG GTATATTACA CGATATGACT ATGTAATTTA AATTTGGTTT TAG T ATT AAA 1800 

AGGGTATTAA AGATAAATTA TAGATATTGA TTTTGCAAAA TATACTCTTT GTTCTGCATT 186 0 

GAAAAAGG 1868 
20 (2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15249 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
25 (D) TOPOLOGY: linear 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 

30 

ATTTATGAAA TCCATAGCnA TAAACATTAT TCTTGCATCG GCTATACAAA CAGTTACCGC 6 0 

AAGCAAATTT GTATATCAAC CTGGAATTGT GTTCACGTCA ATGGCaAATG CCGATGATGT 12 0 

GTTATCAGGC GATAGTTATT TTATGGCTGA ATTAAAATCT ATTAAGCGTA TTGTTGAAAT 180 

35 

TCCAGATAAT CAAAAAATAT ACTGCTTTAT AGATGAAATT TTTAAAGGTA CCAACACAAC 24 0 

TGAACGAATT GCCGCTTCAG AATCAGTACT ATCATTTTTA CATGAAAAAT CTAACTTTAG 3 00 

4Q AGTTATTGCA GCAACACATG ATATTGAGTT AGCTGAACTC TTAAAACAAC GTTATGAAAA 3 60 

TTACCATTTC AATGAGGTAA TAGAAAATAA TAACATACAT TTTGATTACA AAATTAAG C C 420 

TGGCAAAGCA AATACACGTA ATGCCATCGA ATTATTAAAA ATCACTTCAT TTCCAGCAAA 4 80 

4 $ AATATATGAA CG AG CAAAAG ATAATGTCCC GAAAATTTAG CATTTAACTT TAAACATAAA 54 0 

AACGTCAGCT ATCACATGAC AGAAGACTAT GAACAGTTTC AATAATGTTC ATAGTAATCA 600 

TGTTAATAAC TGACGTTTAT TTTATTCTGC AGAATACTCT TCTAAATCTA TATTGCTGTG 66 0 

50 

CCCATTTAAT GCTAAATCAG CAAATCGACC TTGCTGATAC AAATAGTGGC CGGCAACGCC 72 0 
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GGATGAAAAT GTATATTTAA TGGATAAAAT ATCCTAATTT AGCATAAAAA AATGTTTTAA 912 0 

TAAAAGTATT ATTTGATATA ATCGATTTAT GTTTTGTTAC TGCTAAAAAA CATGTGGCG 917 9 

(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1868 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



15 



20 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

CCTTCAGCCA TTTGACTTCG ACATGAGTTG CCTGTACATA TAAAATAAAT TGTTTTTTTA 6 0 

GTCATAACAA TCTCCTAATT AATTAAAATA TGATAAGTGT TAGATACAAC CCTATGAGGG 120 

TTATAAATAG TACTGGAATT GTAATGATGA TACCAGTTTT AAAGTATGTG CCCCAAGAAA 180 

TCTTAACATC TTTTTGtGTT AAGACGTGTA ACCACAGTAA TGTAGCTAAA GAGCCTATCG 24 0 

GTGTAATTTT T GG AC CT AAA TCAGAACCGA TAACATTCGC ATAAATTAGG CCTTCTTTTA 300 

25 ACATGCCATG GACATTTGAT TGACCAATAG CAATCGCATC TATTAAAACT GTAGG CATAT 360 

TATTCATTAT TGATGATAAA AACGCTGAAA TGAAGCCCAT TCCCAAAATA GTGCTAAATA 420 

GACCGTAATT GGAAATATAT TCTAATATTT TAGCCAATAT TAAAGTAATG CCAGCATTTC 4 80 

TTAAGCCGAA TACGACGATA TACATAC CAA TTGAAAATAA TACTATATTC CAAGGTGCGC 54 0 

CCTTAATGAC TTG CTTAATA TTTACAGCAT TTGATTTACG AGC CAACATT AGAAAAATAA 6 00 

AAGCAATGAT TCCAGTGAAA ATTGATACCG GAATTTTAGT AAATTTACTG ATTAGATAGC 660 

CGAAAAGTAA TATAACTAGA ACAATCCaTG AAATTTTAAA TAGCTTTAAA TCATTAATGG 720 

CATCTTTAGG ATGCTTTATA TTATTATCAT CAAACGTTTT AGGTATCGCT TTTCTAAAAT 780 

ATAACCACAA TACTATAATA CTTGCTAAAA GCGAGAATAA ATTAGGTATA ATCATTCTAC 84 0 

TAAAATATCG AACGAATCCT ACATGAAAAT AATCAGCAGA TATAATATTC ACTAGATTGC 900 

TCACGATTAA AGGTAAAGAA GTTGTGTCAG CTATAAAACC ACTCGCAATA ATnAAAGGGA 96 0 

ATATGGCCCG CTTACTAAAA CCTATATTTT TAACCATCGC TAATACAATA GGCGTTAAGA 102 0 

TTAAc GTGCG CCATCATTTG CGAAAAATGC AGCAACAATG GCACCCAATA ATATGATATA 1080 

AACGAACATT TTTAAACCAT TGCCTTTTGA AGCATGAAGC ATGTGAATAG CTGACCATTC 114 0 

50 GAATAATCCA ACTTTATCTA ATATTAATGA AATAAGAATG ACTGAGACAA AAGTCAAAGT 1200 

AGCATTCCAA ACAATACCTG TTACTTCGAA AACATCGGAA AAACTTACAA CACCAGTAAT 1260 
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A A T A CC A TOT* 


r*f* A A P A TO A P 

GGAACATCAG 


GTATTTACTA 


TACGAATCAA 


CTTCATGTCT 


CTGTTGAAGA 


7320 






TGG1ACAGTAC 


CATGTTATTT 


ATGCGCAATT 


ATAGCAATTA 


TCTATGGTTT 


7380 


5 




PPTATTH AAA 

VjCj I A I I AAAA 


AGTCATCGAA 


TTCACGTTTA 


ACTTAATGTG 


AGCGTGGAAT 


7440 




AlrtiAlAAIA 


X GI^GAAACA 


CTTTAATCAT 


TTATAATTGT 


AGCGGTTATA 


ATTTGAAAAG 


7500 


10 


bill I AAV_ 1 I 


AGAATAAATA 


TCCTCTATGC 


ATATACTGAA 


TATGTTTTGT 


AGCGGAACAT 


7560 




PTA ATPTA A /*• 

G TAATGTAAG 


TTTTATGTCA 


TGATTTGTAA 


TGACTAAATT 


AATTGAGAAT 


7620 




1 1 IoAA(j<j(_AA 


GTATATTTGT 


AAGTACTTTA 


ACTAAAAATT 


TATCAATGTA 


TAGCCGATTT 


7680 


15 


UAIJA I GLCTA 


AATTTGGGTG 


TGTCAATGGC 


TGTATGTTGT 


TTATTCTTTA 


TTACAGAGTG 


7740 




AA 1 (-VjvjA 1 lXj 


Gl GAAAATCG 


AAATTTTGAG 


ATTTTTACCA 


ATTCGATTTT 


TTTCATAGAA 


7800 




a T ,r P n sin ji a/* 1 
nl IAAAAAALj 


L CAACAAG G C 


TCTTGAAACC 


TTGTTGGCGT 


AAACATAGCC 


ATCACTAATT 


7860 


20 


Avj 1 (j AA I G AA 


H'TT A T* A A OP A 

GTTATAACCA 


GCAGCTTGGC 


TAGCTGAGAT 


TGTACGTGAA 


GTTACAACAC 


7920 




(JTGGGCCATA 


ACCATAGTTC 


ATTTCTGAAA 


CTCTTACTGA 


ACCATTGCTG 


TTAACACTTT 


7980 




L-AACGTATGC 


AACGTGACCG 


TATGCACCTT 


GAGTTGTTTG 


CATAATTGCA 


ccagcititc 


8040 


25 


GTGTATTGTT 


CACTGTGTAA 


CCAGCTCTTG 


CAGCTGCGTT 


AGCCCAGTTA 


CTTGCATTGC 


8100 




CCCAAGTTGA 


ACCGATTTTA 


CGACCTACAC 


GATCAAATAC 


GTAGTATGTA 


CATTGACCAG 


8160 




AAGTGTATAA 


GTTACGTCCT 


GAAGTATAAC 


CACTTGAGAT 


TGAACGGCCA 


TTTGATGATG 


8220 


30 


GAGCCATAGT 


TGTAGTTACT 


TGAAGATTGT 


TGCTTGAAGT 


GCTGTAGCTT 


GCACCTAAAC 


8280 




(JALCAG 1 ACG 


GTAGCTGTTT 


GTGTTGTAAC 


TATTATAGTT 


ATTGTAGTTA 


TATGATTGAT 


8340 


35 


TATTATTTGA 


G TAGTTGTTG 


TAACGGCTGT 


AGTTATTGTA 


GCTATAACCG 


TTGTTGTAAT 


8400 




TG TTAT AGTT 


ATTGTAACCA 


TTGTAGTAGT 


AATAGCTGTA 


GTAGCCATTA 


TCTTGGTTTA 


8460 




A 1 TGACTTGG 


ATGCCAGTTA 


CCTTTCCATG 


TGTAATGGTA 


GTTACCTTGT 


GCATCAATAG 


8520 


40 


TPT A A pm^ T»A 

l\j 1 AAGTATA 


GCTATATGAT 


GTTGGGTCGT 


TTGGATTATA 


ACCGTAGTTA 


TCTTGCTCAG 


3580 




AAbLAI uACjL 


TTGATTTCCT 


GATG CAATTG 


CGATTGTAGC 


GAATCCTGCA 


GTTGCGATAG 


8640 




TAP pfPT A PP 

I AbL ibl Al?C 


GATTTT CTTC 


ATTTTAAAAA 


TATCCTCCTA 


AAAATTTTAA 


ATCTAAAATA 


8700 




TTTTPPT A A T 

III lcGIAAT 


GTCCGTGTGA 


CAAAATTAAT 


GTTATAAGTT 


ATCTCTCGTA 


ATTAAACGAC 


8760 




AAGAAAGACT 


ATAACAGAAA 


TTAGCGTCCT 


TGTGTGCTTT 


GTTAACGTTT 


TGTAATTTTT 


8320 




TGCTAATATC 


TTGACACAAT 


AGAATTTTAA 


AAGTATAGAA 


ATTTGCATTT 


TGCAAAACTT 


8880 


50 


ATAACTACGG 


CATTCTTTGT 


GAAAACTGAA 


TGTTTCGAAA 


ATAAGTCTGT 


TACAAATTTG 


8940 



CCAACATTAT TGAATGTCGG TCGCATTGAT 
GGAATTACTA AACATACAGT TAGATTACTT 
5 GGCCGTAGAT TAGGTTTTGA ATTATCAACA 

TTAGAACGTG ATAAAGAAGA TGAACCATTA 

TCACAAATTC CAGGACCAAA TCATGTAGAA 

10 

GGTTTAGTAC TATGGTCAAG CTTAGGTCGT 

GCAGTAATTG TAATTGCATC AACCATTTTA 

GTTGAAGAAA TTGGTTTAGA TAAGCTTGAT 

15 

TGAAGATAGA AAAGGATATA GCATTATGCA 
CATCAATAAG CGnAGGGGAC GATAGATGAT 

20 TTCGACAATG ATTGTAATGG TTATTGTTGT 

AATTCAAGTA TTATTAATTA TTGCCTCAGC 
CTTAACATGG CAAGATTTAG AAAAAGGCAT 

25 AATTTTAATT AT ACT AG CGG TAGGAATTAT 

GCCAGCCTTG ATTTATTATG GCTTAGATTT 
CTTTTTTATA AGTGCTGTTA CATCTGTAGC 

30 

TGCAGGGATT GCACTTATTT CTATTGGTAA 
GGGTGCTATT ATAGCAGGGG CTGTGTTTGG 
TAATTTAGCG GCGCTTGTTA CTAAAGTTAA 

35 

GACGACGATA CCTGCATCAA TCATAGGTTT 
TAAAGGGCAT TCAAATGATA AACAGATTCA 
TCAAATTAAC ATATGGGTCT GGGTTCCCTT 

40 

AATGGCTACA GTGCCAGCTA TGCTAATATC 
TAATCATCAT TTCAAAATGA CAGATGGTTT 

45 AATGATACAT CAGTCTCATA TTTCATCCAG 

GATGAGTATG ACCCAAATAT TAGTAACGAT 
AG AAAAAG C A GGATGTTTAG AAGTCTTATT 

50 AGGAAGTTTA ATATGTATTA CTGTTATTTG 

TGCTTCGATT GTAATTATTA TGGTCGGTGT 
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TATGCTGGCG AGTTCGCTTT ATATAAAGAA 552 0 

CATGCAATCG AATTAGAACG TTTGAATTTA 558 0 

GCTAAAGAAT CACGTATTGA ACGTGGTTAT 564 0 

AATCGTTTGT TTAATACAAG CCCAGTATTT 57 00 

AGCAGATATT TAACTGAAGA TATTGCATAT 576 0 

GTTATTGATG TACCGACACC AAATATAGAT 582 0 

GAGAGAGACT TCTTTGAGGA AGGCTTAACA 58 8 0 

TTAGAAAAAT ATTTAAAATA AATGATGGCT 594 0 

AAAGCAATAA ATTGAAGAAA AGAGGTTTCT 600 0 

GAAAAGAAAA CCCACCTTTT TAGAATCAAT 606 0 

TGTAACAGGC TTTGTGTTTT TTGATATTCC 6120 

ATATGCCACA TGGATTGCAA AACGTGTAGG 618 0 

TGCAGAACGT TTAAATACTG CAATGCCTGC 624 0 

AGTAGGCAGT TGGATGTTTT CTGGCACAGT 63 0 0 

ATTGAATCCA AGCTATTTTT TAATATCAGC 636 0 

AACTGGTACA GCATGGGGCT CTGCATCAAC 64 2 0 

TCAATTGGGG ATTCCTCCAG GGATGGCAGC 64 8 0 

CGATAAAATG TCACCATTAT CAGATACAAC 654 0 

TATATTTAAA CATATACATT CGATGATGTG 66 00 

ATTAGTATGG TTTATTGCTG GATTTCAATT 6660 

AACTTTGTTA TCAGAGCTTG CACAGATTTA 6720 

AATTGTGATC ATTGTTTGTT TGCTATTTAA 6780 

AAGCTTTTCT GCCATTATAG TGGGGACTTT 684 0 

CAAAGCAACA TTTAGTGGTT TTAACG AAT C 6900 

TGTGAAAAGC TTGTTAGAAC AGGGTGGTAT 6 960 

ATTTTGCGGA TATGCATTTG CAGGTATTGT 702 0 

AACTACTATT TCTAAAGGCA TCCATTCTGT 7 080 

TTGTATTGCG CTTGTATTCG CTGCAGGTGT 714 0 

GTTAATGAAA GATTTGTTCG AAAAATACCA 7200 



EP0 786 519 A2 

AGCAGCAGCC CAATTATTAG CATTTCCCCA AGTAGAACCG ATTTCTCCGC CAACTTTATC 3 720 

ATATACATAC CAAGTACATT GTCCTGCAGT GTATAAGTTA CCAGAATGTG AAATTGATGA 37 80 

5 

TGTAGTTGTC GTAGTTGTCG TAGTCGTTGT AGTTTGAGTC GTGTTGTAGT TATAGTTGTT 384 0 

GTAATTTGTA TAATTTTCAG CAGCATCTGC ATGATGTGCT TGACCTACTA ATGCTGTGCC 3 90 0 

GATTCCTGCT GTTAACGTAG TTGCTGTTAC TAATTTTTTC ATGAATAAAG TCCTCCAAAG 396 0 

10 

TTCTATATCT TTTTTTATAA ATAAAACGTA GCGACTGTTT TATTCTCACA TCTCGAATTG 4 02 0 

ATGACAATAG TTACTTTAAC AAAATtAATG cTTCTTGTGG GGAATGTTAT TGATTTGTAA 4 08 0 

15 AAGAATAAAA AAACTTTGAC TAATTTTGTA ATAAAAATTA GTCAAAGTTA CAATGAGATT 414 0 

AACAGATAAT TAATAGGAAA TATTTATTTG TAATATGTTT AAATAAATCG AATTGTTAAA 4200 

GGTATTATAT ATTCTTGGCC ATTATAATAT TTGACACACG CAATAATTGT GAATACAAAA 42 60 

20 GATAATATTG AGAAAGCGAA TATGGATAAA ATAC CGATAA ACGTAATGAT GAAACCTATA 4 32 0 

ATAATAATGA AATCAATATC TGTAGCAATT AGGAAAACGC CTATTAAAGT GATAACGACT 43 8 0 

AAAACGATAG ACCAAATAAT ATAAGAAATC GTATAGTTAA GATAATTTTT TCCAGCACGA 444 0 

25 TCAACTAGTT TCG ATT CATC TTTTTTCAAT AACCATATTA TCAGTGGACC AATAATAGAT 4 500 

GTGAATAAAC TTAATAAATA G ATAAGCAT C GCCATAATGT TCTCATCATT GGATTTGCGA 4 560 

TTCGGTTGAT GATTTGTTAC GTCGTTCATT TCAGTTGTCA TATTAGACAC TCCTTTGAAA 4 620 

30 

ATTGTAATAT TATCTTTAAC TATAACAAAA TATAATCAAA AATAAACATG TTTATTAAAC 4 680 

AATTATTAAA AATAAAAATA ATTGGTGGAC GTCGGCGTTT AAATAGGTTA ATTTAAGGTT 4 74 0 

ATATATACTT AACATTTATA ATGATGCGTA ATGAATTCGC ATCATTTTTA TATTGTCTTA 4 800 

35 

CGTATAATTT GTTTTTAATT TTAACCAAAG ATAGAAAGAG GGTTGTTTAT GAAAATAGCA 4 860 

ATTGTAGGAT CAGGAAATGG CGCAGTTACG GCAGCAGTAG ATATGGTGAG CAAAGGCCAC 4 92 0 

40 GATGTTAAAT TATATTGTCG TAATCAATCT ATAAGTAAGT TTCAAAACGC AATCGAAAAG 4 980 

GGCGGATTTG ATTTTAATAA TGAAGGTGAT GAACGTTTCG TAAAATTCAC TGATATTAGT b04 0 

GATGATATGG AATATGTTTT AAAAGATGCT GAAATTGTTC AAGTGATTAT TCCATCTTCA 5100 

45 TACATAGAGT ATTATGCTGA TGTAATGGCA GAGCATGTAA CTGATAATCA GTTGATATTC 516 0 

TTCAACATGG CTGCAGCAAT GGGGTCAATT CGTTTTATGA ATGTTTTAGA AGATAGACAT 52 2 0 

ATTGAAACAA AACCACAACT AGCGGAAgcT AATACGTTGA CGTATGGTAC GCGTGTCGAT 52 80 

50 

TTTGAAAATG CAGCAGTTGA TTTATCTCTA AATGTACGTC GTATCTTCTT TTCAACATAT 534 0 
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15 



25 



TAAAATACGT GTCAAACATA CTGCAAAAGT AATTAAGCGT TCTAGAAAAA TCGGTAAAAT 192 0 

TGCCCAATAT CGTAGTCGTT TATTTGTTGC AGTTAGAAAT CGTATTATGA AAATGATGCC 1980 

AAATG CATT A GCAGCTGGAC AAACTAAATT CTTATATAAA TCGAAAGAAA AATAATACAA 204 0 

CAATATGAAA ACCCCCGTAT GTTGAAACGA GAGCTCAACA TATGGGGGTT CTTGTTTTTA 2100 

TAATGTTATT ATAATAAATT CAATTATTAG TTAACGACAA ATTGTGGTTT CTCACCTTGA 2160 

ACGGCACTAA TTGCAGCATT AGCAACAATT TTAGACATCA TGTCACGTGC TTCAAATGTA 222 0 

GCATTACCAA TATGCGGTGT TAATACTACA TTATTAAGTG ATTTTAAGTC ATCGGTAATA 2280 

TCTGGTTCAA ATTCATATAC ATCAAGTGCA GCACCTTCAA TTTCATTATC TTTCAATGCT 234 0 

TGCACTAGTG CTTGTTCGTG CACGATTGGA CCACGAGAGG CATTGATTAA ATACGCCGTA 2400 

GATTTCATCA TTTTAAATTG TTCTGTATCA ATTAAATGAT GCATTTTAGG ATTATAAGCA 24 60 

20 GCGTTGATAG TGATAAAATC TGCATTCTTT AATAGTGTAT CTAAATCTAC ATATTTTGCA 2520 

CCGATTTCTC GTTCTTTTTC TTCTTTGCGA TTAGGTCCAG TGTATAGCAC ATCCATGTCA 2580 

AATGCTCTTG CACGACGAGC TACTGCACTA CCAATTTCAC CTAAACCGAT AATGCCGATT 264 0 

GTTTTCCCAG ATACTTCTCT ACCTCTGAAA AATAAAGGTG CCCATCCATC AAATCCAGTT 27 0 0 

GTACGTGATA ATTGGTCCCC TTCAACAATA CGACGCGCTA CTGCAAGTAC TAAT CCAATT 27 6 0 

GTTAAATCAG CAGTCGCGTT TGTTGATGCT TTAGGTGTGT TTGTAACATC TATACTTTTT 2 820 

TCTCGGGCAT ACTCGATATC AATATTATTA AAACCAGCGC CATAGTTGGC AATGATTTTT 2 880 

AAGTCTTTAC CAGCATCGAT AACATCTTTA TCAACGTTTG TAGATAATAA ACTAATTAAG 294 0 

GCAGTCGCGT TTTTAACACC TTTAATTAAA GTGTCTTTAT CGACTAATCC TTTACCTTCA 3 00 0 

TACATTTCAA CTTCAAAATG TTCTTGTAAA AGTTTTAAAC CTACTTCTGG TATtGCACCA 3 06 0 

gCAACATAAm CTTTTtCCAT AAAAGAtCAC TCCTTTTATC TTAGTATAGT AGAAGATTAG 3120 

ACAGTATACA ACTATGTCAT GATGTCTTGT GTATCAATGA TGTAAGCGCG TACTTTTGAT 3180 

GGAGGCGATA TAACTTAGGC ACTGTAGAAC TATGAATATT GTAATGTGGA AAAACTGGAT 324 0 

CAATTAAATT AGATAACGTA GTTTTAAAGT TAATAGTATT AGAAAAAATT AATATTTTGA 3 300 

ATATGGGAGG AAATATAAAT AAGTAGGTGG CAACGAAAAA TAGCAAAAAA AGAGCTTCTC 3360 

CTATAAAGGA AAGCTCAAAG TTTTTTGATG ACATATGTAC TAGAATTAAG TTTCAAGACA 342 0 

ATATGTATCA TCGTGTTTAT ATTAAATATG GATGTAGTTG TAGTTACCTG CTTCACTTGC 34 80 

AGAAATAGTT CTAGAACTTA CTGAGAAAGG TCCGCCACTA TAATTCATTT CTGAAATTGT 3 54 0 

AACTGAACCA TCACTGTTTA CACTTTCTAC ATATCCAACG TGACCAAATG GTCCTTCAGA 3600 
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ATATGAAGTA 


TATGGATATC 


ACAGAaAAAG 


TGCCAATGTC 


GG AATCTGAA 


120 




TGCTAAAAGG 


TAAGGGGATT 


TTAGAAAATC 


GAGGGAAAGT 


TTTTCTAGAA 


180 


VjL 1 LAAGAAA 


AATATGAGGT 


TAATGTCATT 


TATCTTGTTA 


GCCATGCATT 


AGTAGAAACA 


240 


Vjvj I AAL GVj LA 


A A A 7\ A TT 1 

AATCAGAATT 


AGCAAAAGGC 


ATTAAAGATG 


GGAAAAAACG 


CTATTACAAC 


300 


I I 1 I I IGGIA 


T AGG AG CATT 


CGATAGTAGT 


GCTGTTCGTA 


GTGGGAAAAG 


TTATGCTGAA 


360 


a a fir* a a pa rt 

AAGGAAv_AAI 


GGACATCAC c 


AGATAAGGCG 


ATT ATTGG TG 


GTGCAAAGTT 


CATTCGTAAT 


420 


G AA 1 A I I I I G 


A A A A /"»TV IV mnik 

AAAACAATCA 


ACTGAATTTA 


TATCAAATGC 


GATGGAATCC 


AGAAAATCCT 


480 


GL GLAALAT C 


AATATGCGAG 


TGACATTCGC 


TGGGCAGATA AAATTGCCAA 


ATTAATGGAT 


540 


A 7\ TV r T , / r "V "1 'TV T* A 

AAA rCC 1 ATA 


AG CAG TTTGG 


TATAAAGAAA 


HA THAT ATT A 


GACAAACATA 


TTATAAATAA 


600 


GAGA I GGG 1 G 


CTTAAAGGAG 


CTGGAACAAT 


TT A TTflTTT P 

1 X f\ ± 1U 1 X 1. W 


GAGCTCCTTT 


AG CG CATT CT 


660 


GAGTG TG XT A 


GTTAAATGGA 


TTTTAACCTA 




CTATATAGCA 


TCAAATATGC 


720 


TATATCCCAC 


ATCATTGtTA 


CAAATGTACA 


TfJATfiTAAA T 


GAATATTGCT 


GTCTAAATGT 


780 


GCATGTAATA 


TACAATGGTG 


CAGATAATAC 


Ac ™ i "l* a a t 1 ^^ 


TTAAAAATGA AACGTTAgTT 


840 


CCAAGAGTCA 


TTTTTAAACA 


ATAGTGCATG 


TGATAAAATA 


GAAAAGAATG 


AAAAATATAG 


900 


AGGTGACAAT 


ATGAAGATAG 


CAATTATAGG 


TGCAGGCATC 


GGTGGATTAA 


CAGCTGCTGC 


960 


ATTATTACAA 


GAACAAGGTC 


ATACTATTAA 


AGTCTTTGAA 


AAAAATGAGT 


CAGTTAAAGA 


1020 


AAI xGGLGLI 


GGG ATTGG TA 


TCGGAGATAA 


TGTGCTTAAA 


AAACTAGGTA 


ATCATGACTT 


1080 


a /*■» /"""p a t\ a r^r^ r v 

AGC i aaaggt 


A ' 1 " J ' A A A A A T^fi 

ATT AAAAA TG 


CTGGGCAAAT 


CTTATCTACA 


ATGACAGTGT 


TAGATGACAA 


1140 


AGA I GGCCTG 


TTAACTACTG 


TTAAATTAAA 


AAGTAATACA 


TTGAATGTGA 


CGTTACCACG 


1200 


^ O A A A f~* A T>T> A 

L LAAACATTA 


ATTGACATTA 


TTAAATCTTA 


TGTAAAAGAT 


GACGCAATAT 


TTACAAATCA 


1260 


TGAAGTCACG 


CATATAGATA 


ATGAGACAGA 


TAAAGTTACC 


ATACATTTCG 


CGGAACAAGA 


1320 


A A PTP A A I""* A 

AAG 1 GAAGCA 


TTTGATTTAT 


GTATTGGTGC 


TGATGGAATT 


CATTCTAAAG 


TGAGACAATC 


1380 


IG1 AAA1GL 1 


GAGAGTAAAG 


TATTATATCA 


AGGGTATACA 


TGCTTTAGAG 


GTTTAATTGA 


144C 


I G A 1 ATTG AT 


TTAAAGCATC 


CGGaTTGTGC 


AAAAGAATAC 


TGGGGaAGAA 


AAGGaAGAGT 


150C 


AGG I A TTGTT 


CCGTTATTAA 


ATAATCAAGC 


ATATTGGTTC 


ATTACAATTA 


ACTCGAAGGA 


1560 


AAACAATCAT 


AAATATAGTT 


CGTTTGGTAA 


ACCTCATTTG 


CAAGCATACT 


TTAATCACTA 


1620 


TCCAAATGAA 


GTTAGAGAGA 


TCTTAGACAA 


ACAAAGTGAA 


ACAGGTATCT 


T ATTG CAT AA 


1680 


TATTTATGAT 


TTGAAACCAC 


TCAAATCTTT 


TGTTTATGGT 


CGTACTATTT 


TACTAGGAGA 


1740 
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TTATTGCTAA TTACGTTAGG CGTCATGACC GCTTTTGGCC CACTAACTAT AGATATGTAC 432 0 

GTACCATCAT TACCTAAAGT GCAAGGTGAT TTTGGTTCTA CTACATCAGA AATTCAATTA 43 8 0 

ACATTATCAT TCACAATGAT TGGTCTTGCA CTAGGCCAAT TTATCTTTGG ACCTTTATCC 444 0 

GATGCTTTTG GTCGCAAACG GATTGCTGTA TCCATTTTGA TCATTTTCAT TTTGGTATCA 4 500 

GGTTTGTCTA TGTTTGTTGA TCAATTGCCA TTATTCTTAA CTTTACGATT TATTCAAGGT 456 0 

TTAACTGGTG GTGGCGTCAT CGTGATTGCA AAAGCCTCTG CTGGTGATAA ATTTAGTGGC 462 0 

AACGCACTCG CTAAATTTTT AGCATCTTTA ATGGTAGTTA ATGGCATCAT CACTATTCTT 4680 

GCACCATTAG CCGGTGGATT AGCTTTATCC GTAGCAACAT GGCGTTCTAT TTTCACAATT 474 0 

TTAACTATTG TGGCACTCAT CATTTTAATT GGCGTCGCTT CTCAATTACC TAAAACATCT 4 800 

AAAGATGAAT TAAAGCAGGT GAATTTTAGT AGCGTCATTA AAGATTTTGG AAGTCTTTTG 4 86 0 

AAAAAACCAG CATTTATTAT TCCAATGCTA TTACAAGGwT TAACTTATGT AATGCTATTT 4 92 0 

AGTTATTCAT CTGCATCGCC ATTTATTACT CAAAAATTGT ATAATATGAC ACCCCAACAA 4 980 

TTTAGTATCA TGTTTGCTGT TAACGGTGTA GGTTTAATCA TTGTCAGTCA AGTCGTTGCT 504 0 

25 TTATTAGTAG AAAAATTACA TCGCCACATA TTATTAATCA TTTTAACTAT TATACAAGTG 5100 

GTAGGTGTTG CTTTAATTAT CCTGACACTT ACATTC CATT TACCACTTTG GGTCTTACTC 516 0 

ATCGCATTCT TCTTAAATGT GTGTCCTGTG ACGTCAATTG GACCGCTTGG TTTCACAATG 522 0 

GCTATGGAAG AACGAACAGG TGGCAGTGGT AACGCATCAA GTTTACTTGG CTTATTCCAA 52 8 0 

TTTATCTTAG GTGGCGCTGT TGCACCATTA GTTGGCTTAA AAGGCGAATT TAATACATCA 534 0 

CCATATATGA TTATTATCTT CATTACAGCC ATTCTATTAG TCAGTCTACA AATCATTTAC 54 0 0 

TTTAAAATGA TTAAAAAGCA ACATGTCGCA TAACACTTCA ACATAATTAG AACCCTAGCA 54 60 

AAGATATCTA TCTTTGTCAG GGTTCTTCTT TATGAATTAT GAGATCGAAT CTTCAACTAA 552 0 

AATTACGCCT TCATAGCAAG GACATTTCTA TTCAATCACC CTTTAACAGG CATCCAAATT 558 0 

TcTGTAATAT ATTTTTCACT TGTAGTATCA CCAT 5 614 
(2) INFORMATION FOR SEQ ID NO: 100; 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9179 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 
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GCGGCAGCGT CAGGTTTGGA TTTATTAGAA 
TCATATGGTA CCGGTGAACT AATTAAAGAT 
5 TTAGGGATTG GTGGCAGTGC AACAAATGAT 

GTAAAGTTTA CTGATGTAAA CGGGGACTTA 
ATTGCACAAA TCGATATAAC CAATCTAGAT 

10 

GCCTGTGATG TTTCAAATCC TTTATTGGGT 
CAAAAAGGCG CTGATGCAAA GATGATACCA 
GATAAGATAA AAATGTGCAC AGGAAAGTCC 

15 

GGCGGTATGG GCGCAGCATT ATTAGCGTTT 
GTCGTCTTTG ACATTACAGA TTTTCATCAA 

20 GGAGAAGGAC GCATGGATTA TCAGACCATC 

GCTGCAAAAC AATATCATAT TCCTGTCATC 
CAACATGTTT ACGATTTCGG TATTGATAGT 

25 TTAGAAGATG TCCTACAAAA TAGCGAACAA 

CGTATTCTGA AATTACAATA ATGTCAAAGT 
ACTTGAATGA GGTGAAACCC ATGAAAAGAA 

30 

ACAATCAAAA CCAAAATCAT CGTCGTCAAT 
CTAAAGGCGA TCCTGAAGAA CACCCGGAAC 
AACAAATTCT TGAAGAAGAA AACGAGAAAT 

35 

TTATTGCCAT TCTCTTAATT ATTGTCGCTA 
ATAGCGATAA AGTTAGTAAT gaccctaaag 

40 atcaagacgg ccaaattaac cagcaagtag 

aaaaaactga tgacattatt aaaaatttac 

aacaaaacaa agctgattct aagctaactc 

45 cagaggcaaa taatgcactt aaaaacaatg 

atgatattaa tacaaaattc gacagtatta 

acaatggtgg cgctaattaa ttattacacc 

50 

ctttatctgt atcactacgt tattcgtgat 
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AAAGAGGAAC GTAATCCTTT ATACACATCA 2 5 20 

GCATTAAATC ATGGTGCTAA GACCATTATT 2 58 0 

GGTGGTACAG GTATGCTAAG TGCACTAGGC 2 64 0 

TTACAAATGA ATGGTGCTAA TCTTGCTCAC 2 7 00 

TCGCGATTAA AAGAGGTGAC CTTTAAAGTG 276 0 

GAAAATGGTG CTACCTATAT TTATGGTCCT 2 82 0 

AAGTTGGATT TCGCAATGTC GCATTATCAT 2 880 

GTTAATCAAA TACCAGGTTC TGGTGCAGCT 2 94 0 

TGTGAGACAA CTTTAACAAA AGGTATTGAT 3 000 

AGAATTAAAG ATGCAGACCT CGTTATTACT 3 06 0 

TTTGGTAAAA CACCCGTAGG CGTTGCGTTA 312 0 

GCGATTTGTG GCAGTCTAGG CGAAAATTAT 3180 

GCCTATTCTA TAATCTCTTC ACCTAGCACT 324 0 

AATTTATTAA ACACTGCAAC TGACATTGCT 33 0 0 

AAATCATCAG CTTTATTATT TGCAGTTAAA 336 0 

CTGATAAATA CCGTGATTCA TATCAATACG 3 420 

CTGAAGACGC ATCGTATAGA CAACAATATG 3 4 80 

GATACTATAA TGGTAGAGAT TATCGAAGAG 3 54 0 

CCCGCCGTTC AAAAAAATGG TTATATATCA 3 600 

TTTTTGTCAC ACGCGCCTTA CTTAACAATG 3 660 

TCTCTCAAAA TTATAAAAAA CAAGTTGAAA 3 72 0 

AT AATG CTAA AGAAAATATT AAAAACAACC 3 78 0 

AAAATCAAAT CGACAACTTG AAGCAGCAAG 3 34 0 

AATTTTATCA AGACCAAATC AACAAATTGA 3 90 0 

CAAGCCAAGG TAAAATTGAA AGCATGTTAA 3 96 0 

AATCTAAATT AGAAAGCTTA TTTAAAGATG 4 020 

TGCTTTGATG ATAAACATTA ATTCCCTATA 4 08 0 

GATGCATTAA GAGTATAGGG ATTTTTTATA 414 0 
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ATATGTAACT CCTkTCAATT AATAATCTAA 
GATGATATAC ATAATATAAA TTTGTTATTT 

5 

CATATATTAG TTGATAACGA TTATCAATGT 
AATTCACAAG GTTATGGGGC AGAAATGATA 
TTCTGGGAGT GGGACAGAAA TGATATTTTC 

10 

TTGCATTGTC TCTAGAAATT GGGAATCCAA 
TTGTAGAGTC TAGTACATTG ATTTGTATCC 

1S TCTAATGATC CTATGACTCA ACTATTAAAT 

TTAAATTCAT TTATTGTAAT ATTGCAAAAA 
AATTAATTAC ATAATAAATT GAACATCTAA 

20 TGAGGGGATT TATTTAGGTG TTGGTTATTT 

CCAATGTGCA AAAAACGCAA CAAGACAGCC 
TAAATTGAAC ATCCGTCATA CACCTCCTCT 

25 

CGACCATCAT ATTATATCAT TTATTTATTA 
GTTTTTTTTA GTGGTTTACG CTACTTTAAT 
TAAATGTGAT GGGTATCGTA ATAATTAAAC 

30 

TTAGCCAGGA TACAAATACA TATAATAAAA 
TAATTGGAAA ACTAATGAAT TTTCTCCAAG 

35 TCAAATAATA TGAAATCACA AAAGCGACTA 

CAATGTGTAA TAATTTTAAC AGCAATAAAT 
CGCCAACAAT GATAAATTTT AAAATTTCAG 

40 TCCTCAACAT CAAAATATAT GCATAACTAC 

ATCCGCTTCA CTTCAAATAT GCTTATTTCA 
CTCTTTCCCC TCATCCTTAT ACGCCATTAT 

45 

GCACTATAGA GATTACTTTA GTTCACTAGT 
AATGAGAGGA TGTCTACTAT GCAATTACAA 
GAAAGTATGA CCGCACAGCA AGTTGGCAAT 

50 

GGGAATACCC TTCATTATGA TATCATT CCG 
GCTTTAATGC ATGCAACAGG TGCCACTAAG 

55 



ATTAAGCCGC TTATATTATT TATTTCACTG 720 

GTTAAAAATT AATACTTATT ACAAGTACAT 780 

CGCGTGGATT TGTGACACAT TTCTTTTAAA 84 0 

AAGAGCCACT AATGATTTAT TATGTAGTGG 900 

ACAAAATTTA TTTCGTCGTC CCACCCCAAC 96 0 

TTTCTCTTTG TTGGGTCCCT GAATATAGCC 102 0 

CAATGTCCCT ATAATTGATT ATTCGCTTTA 10 80 

CATTTTTCGA AATACTTAAT TCTAATATAA 1140 

TACATTGCAC ACCTTGTTCA TCAATGCTAT 1200 

ATACACCAAA TCCCCTCACT ACTGCCATAG 12 60 

GTCACCTTTT TTATTGTTGC GCGTTCGTAA 1320 

GCTTATAGCT GAAGTCATGA TGTTAATTAA 13 80 

CTGCGTTAAA GTAACGCCCG AGATGTTAGG 144 0 

TATTTCACGC AATATTAAGG CTTAAGTAAA 1500 

TGCTATCTTT TAAAATCCAT TTAGATAATA 1560 

CAGCAAATGG TGCAATTTCT GCTGGCAAAT 1620 

CTGTTTGTAA GCTTACGTTG ACAATCTGCG 1680 

TAGGTTTTAC CCTGTAAACA AAATAACAAT 1740 

GAAATCCGGT AATATGACTA ATCATATATT 180 0 

AGACAACATA ATAATTTAAC GTATTAATGC 1860 

CATGCGTTTG TGTTAGTTTC ATATGTGTAc 1920 

GTTCTCGAAC ATACTCGAAT ATGCGAGCCA 1980 

ATCTTTATAC CCTTTCACAG CAAATTTAGT 204 0 

AATGTAACTG ATTTATCGCG TGACTCATTA 2100 

AATTTTATAT ACAATAAGAG CGACAACAGT 216 0 

AAAATTGTCA TCGCTCCTGA CTCATTTAAG 2220 

ATTATAAAAC AGGCTTTTAC TAATGTTTAT 22 80 

ATGGCTGATG GTGGTGAAGG TACCACAGAT 234 0 

TATACAGTCA TCGTTAATGA CCCTTTAATG 24 0 0 
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AAATCTCGCA TATCGTGCAG CGCAACTATT TATTGAGCAA TATCAACTAA AG CAAGGTGT 9660 

AACAATTTCT ATCGATAAAG AAATAC CTGT TTCTGCTGGC TTAGCTGGAG GTTCGGCTGA 972 0 

TGCAGCAGCA ACGTTAAGAG GATTGAAT CG ACTTTTTGAT ATAGGGGCGA GTTTGGAAGA 97 80 

ATTGGCTCTA CTAGGCAGTA AAATCGGGAC AGATATTCCG TTTTGTATTT ATAATAAAAC 9 34 0 

TGCACTATGT ACTGGAAGAG GAGAGAAAAT CGAGTTTTTA AATAAACCAC CTTCAGCTTG 9 900 

GGTGATTCTT GCTAAACCAA ACTTAGGCAT ATCATCACCA GATATATTTA AGTTGATTAA 9 96 0 

TTTAGATAAG CGTTACGACG TACATACGAA AATGTGTTAT GAGGCCTTAG AAAATCGAGA 10 020 

75 TTATCAACAA TTATGTCAAA GTTTGTCTAA TCGATTAGAG CCAATTTCTG TTTCAAAACA 10 0 80 

CCCACAAATC GATAAATTAA AAAATAATAT GTTGAAAAGT GGTGCAGATG GTGCGTTAAT 1014 0 

GAGTGGAAGC GGACCTACTG TGTATGGGCT AGCACGAAAA GAAAGCCAAG CAAAAAATAT 102 00 

TTATAATGCA GTTAACGGTT GTTGTAATGA AGTGTACTTA GTTAGACTAT TAGGATAGAA 1026 0 

GGGTTGAAAA GATGAGATAT AAACGAAGCG AGAGAATTGT TTTTATGACG CAATATTTGA 10320 

TG 10322 
(2) INFORMATION FOR SEQ ID NO; 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5614 base pairs 

(B) TYPE: nucleic acid 

30 (C) STRANDEDNESS : double 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO; 99: 

GATTGATTAA ATGTTTTAAT CCACTTCAAT GCCTTCGATA AACTCTACAA TCGCGCTATT 6 0 

CATATAATTA TTCGATTTCA TTTGTTCAGC ATATGTCTCA TTAAATCCAG ACATAACTTT 12 0 

TTTAAAwGCG AAAATTGAAA TTGGTATCGT TACTAATAAG GCACTAGCCA TACGCCAATC 180 

AATGAGCATT ATGTATAAAA AGATAGCAGC TGACAAAAGT AAGTTTCCTA TAACTTCAGG 24 0 

AATCATATGT GCTAAAGGTA ATTCTATTGT TTCAACCTTA TCGACAAATA TATTTTTTAA 3 00 

TTCACCTATT TTCTTAGATT CCaCTACGCC TAAAGGGAGA CGCATTAATT TTTGAGCTAA J60 

TTTTTTACGA ATTTCAGATA AAATTTCATA TGCCGTAATA TGTGATAGCA TCGTTGACGC 42 0 

TCCAAAACAA CACACTTGTG AAATATAAGC GATTAAAGCA ATAAAGATAT AAACCATAAT 4 80 

CGAATTAATC GTATATGTAT TGTTAATCAT CATTAAAATA ATTTTAAATA CTGCCCAATA 54 0 
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GCGGATGTAA GGCAAGCTTT AGAAGATGAA TGAGGAAGTG AAAATGTTGG ATAATAAAGA 7 86 0 

TATTGCAACA CCATCAAGAA CGCGAGCGTT GTTAGATAAA TATGGCTTTA ATTTTAAAAA 7 92 0 

5 

AAGTTTAGGA CAGAACTTTT TGATAGATGT GAATATCATT AATAATATCA TTGATGCAAG 7 9 80 

TGATATTGAT GCACAAACTG GGGTGATTGA AATTGGTCCA GGCATGGGGT CATTGACAGA 804 0 

ACAATTGGCC AGACATGCTA AAAGAGTATT GGCATTTGAA ATTGATCAAC GTTTAATACC 8100 

10 

TGTATTAAAT GATACACTAT CACCTTATGA TAATGTGACG GTGATTAATG AAGATATTTT 8160 

AAAAGCGAAT ATTAAAGAAG CTGTTGAAAA TCATTTACAA GATTGTGAAA AAATAATGGT 822 0 

75 TGTTGCAAAC CTGCCGTACT ATATTACGAC GCCAATTTTA TTAAATTTGA TGCAACAAGA 8280 

TATACCAATT GATGGCTACG TGGTGATGAT GCAAAAAGAA GTGGGCGAAC GCTTAAATGC 834 0 

TGAAGTAGGT TCAAAAGCAT ATGGTTCGTT ATCAATTGTC GTACAATACT ATACAGAGAC 84 00 

20 TAGTAAAGTA TTAACGGTAC CTAAATCTGT ATTTATGCCA CCACCTAATG TTGATTCAAT 8460 

AGTTGTAAAA CTGATGCAGA GAACTGAACC GTTAGTAACA GTAGATAACG AGGAAGCATT 8520 

CTTTAAGTTA GCAAAAGCAG CATTTGCACA AAGAAGAAAG ACAATTAACA ATAACTATCA 8 58 0 

25 

AAATTATTTT AAAGATGGTA AACAACACAA AGAAGTGATT TTACAATGGT TGGAACAAGC 864 0 

AGGTATTGAT CCAAGACGTC GCGGTGAAAC GCTATCTATT CAAGATTTTG CTAAATTGTA 8700 

TGAAGAAAAG AAAAAATTCC CTCAATTAGA AAATTAAATG ATTGACAAAG CAAAGCACTA 876 0 

30 

TTGTTAAAAT TTAAATTTTG TTTGACGAAA ACGTTGCAAA TATGGTATTA TGTAACTTGT 8 820 

AGCGAGGTGG AG GAAT ATG C CAAAATCAAT TTTGGACATC AAAAATTCTA TTGATTGTCA 88 8 0 

35 TGTAGGAAAT CGTATTGTAC TGAAaGCCAA TGGAGGCCGT AAGAaAACAA TAAAACGTTC 894 0 

TGGAATTTTA AAAGAAACAT ATCCGTCAGT TTTCATTGTT GAGTTAGATC AAGACAAACA 900 0 

CAAC.TTTGAG AGAGTATCTT ATACATACAC TGATGTGTTA ACTGaAAATG TTCAAGTTTC 906 0 

40 ATTTGAAGAG GATAATCATC ACGAATCAAT TGCACACTAA ATAAGACATA TAGAGATGTT 912 0 

AGACGTTTCT TAGTATAAGA AGTAAATATT ATGATAATTA TTTGAGTGTT GGGcATTATG 9180 

TTCAATACTC TTTTTATTTA CAAAATGTTT AACACTGATG TTTCGCTTAT AGATTTTTCA 924 0 

45 

GTAAATGGAT AATTGTATTT ATAAACACAA ATACAAGTAA ATACTAAGTA ATTAGATGGA 93 00 

GAAAATTACT TTTTTATTAA AAAAACACTA AAAAACAAAT TAAAATGTCA AATATTAATT 93 60 

CTCTTTATGT TAAAATCATC ATATTAAGAT AACGAAAAGA GGGCGGAAAA TGATATATGA 94 2 0 

50 

AACGGCACCA GCCAAAATTA ATTTTACGCT CGATACACTT TTTAAAAGAA ATGATGGCTA 94 80 

TCATGAGATT GAAATGATAA TGACAACAGT TGATTTAAAT GATCGTTTAA CTTTTCATAA 954 0 



55 
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AACCTCAAAT 


TGATATTAAA 


GACTTTGATA 


AAGTTGAAAT 


TAAGGCAGCA 


ACGATTATTG 


6060 


ATGCTGAACA 


TGTTAAGAAG 


TCAGATAAGC 


TTTTAAAAAT 


TCAAGTAGAC 


TTAGATTCTG 


6120 


AACAAAGACA 


AATTGTATCA 


GGAATTGCCA 


AATTCTATAC 


ACCAGATGAT 


ATTATTGGTA 


6180 


AAAAAGTAGC 


AGTTGTTACT 


AACCTGAAAC 


CAGCTAAATT 


AATGGGACAA 


AAATCTGAAG 


6240 


GTATGATATT 


ATCTGCTGAA 


AAAGATGGTG 


TATTAACCTT 


AGTAAGTTTA 


CCAAGTGCAA 


6300 


TTCCAAATGG 


TG CAGTGATT 


AAATAACTGT 


ATTTTTAAAA 


ATTAGGAGAG 


ATAATTATGT 


6360 


TAATCGATAC 


ACATGTCCAT 


TTAAATGATG 


AGCAATACGA 


TGATGATTTG 


AGTGAAGTGA 


6420 


TTACACGTGc 


TAGAGAAGCA 


GGTGTTGATC 


GTATGTTTGT 


AGTTGGTTTT 


AACAAATCGA 


64B0 


CAATTGAACG 


CGCGATGAAA 


TTAATCGATG 


AGTATGATTT 


TTTATATGGC 


ATTATCGGTT 


6540 


GGCATCCAGT 


TGACGCAATT 


GATTTTACAG 


AAGAACACTT 


GGAATGGATT 


GAATCTTTAG 


6600 


CTCAGCATCC 


AAAAGTGATT 


GGTATTGGTG 


AAATGGGATT 


AGATTATCAC 


TGOfJATAAAT 


fi^6H 


CTCCTGCAGA 


TGTTCAAAAG 


GAAGTTTTTA 


GAAAGCAAAT 


TGCTTTAGCT 


AAGCGTTTGA 


6720 


AGTTACCAAT 


TATCATTCAT 


AACCGTGAAG 


CAACTCAAGA 


CTGTATCGAT 


ATCTTATTGG 


6780 


AGGAGCATGC 


TGAAGAGGTA 


GGCGGGATTA 


TGCATAGCTT 


TAGTGGTTCT 


CCAGAAATTG 


6840 


CAGATATTGT 


AACTAATAAG 


CTGAATTTTT 


ATATTTCATT 


AGGTGGACCT 


GTGACATTTA 


6900 


AAAATGCTAA 


ACAGCCTAAA 


GAAGTTGCTA 


AGCATGTGTC 


AATGGAGCGT 


TTGCTAGTTG 


6960 


AAACCGATGC 


ACCGTATCTT 


TCGCCACATC 


CGTATAGAGG 


GAAGCGAAAT 


GAACCGGCGA 


-7 r* -i n 
f V 4. \J 


GAGTAACTTT 


AGTAGCTGAA 


CAAATTGCTG 


AATTAAAAGG 


CTTATCTTAT 


GAAGAAGTGT 


7080 


GCGAACAAAC 


AACTAAAAAT 


GCAGAGAAAT 


TGTTTAATTT 


AAATTCATAA 


AGTTAAAAGT 


7140 


GAGAAAGATC 


ACCGCCATAA 


ATGTAAACGA 


TGCTATATTC 


GTTTAATATG 


CTATGGTTCT 


7200 


TTCTCACTTT 


TTTAAATTAA 


AATATCGTGC 


ATGTGGAATA 


CGTGCGATAG 


AGATGGTTAG 


7260 


AGCTTTGAAA 


TTAAGAATTG 


TAGGAAGGCG 


TTTTAAATGA 


AAATCAATGA 


GTTTATAGTT 


7320 


GTAGAAGGAC 


GAGATGATAC 


TGAGCGTGTT 


AAACGAGCTG 


TTGAATGTGA 


TACGATTGAA 


7380 


ACGAATGGTA 


GTGCCATCAA 


CGAACAAACT 


TTAGAAGTAA 


TTAGAAATGC 


TCAACAAAGT 


7440 


CGAGGCGTTA 


TTGTATTAAC 


AGATCCAGAT 


TTCCCAGGAG 


ATAAAATTAG 


AAGTACAATT 


7 500 


ACTGAACATG 


TCAAAGGTGT 


TAAACATGCG 


TATATTGATA 


GAGAAAAAGC 


TAAAAATAAA 


7560 


AAAGGGAAAA 


TTGGTGTTGA 


ACATGCCGAC 


TTAATTGATA 


TTAAAGAAGC 


GTTAATGCAT 


7620 


GTTAGTTCAC 


CCTTTGATGA 


AGCTTATGAA 


TCAATTGATA 


AATCTGTGCT 


AATAGAGTTG 


7680 
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GTTTTTTAAT GTAAAATAAA TACATTGAAA GTAATAAATA CCTTAACATT GAATAAGATG 426 0 

AAAATGAGAT GACGAGATAA ATGTTCGCGT CCGTTGAAAT GCATAGAAAT CTTAGATATT 432 0 

5 

ATTTGAAGTG AGACATTACG AGGAGGAACA GTTATGGCTA AAGAAACATT TTATATAACA 43 8 0 

ACCCCAATAT ACTATCCTAG TGGGAATTTA CATATAGGAC ATGCATATTC TACAGTGGCT 444 0 

GGAGATGTTA TTGCAAGATA TAAGAGAATG CAAGGATATG ATGTTCGCTA TTTGACTGGA 4 500 

10 

ACGGATGAAC ACGGTCAAAA AATTCAAGAA AAAGCTCAAA AAGCTGGTAA GACAGAAATT 4 560 

GAATATTTGG ATGAGATGAT TGCTGGAATT AAACAATTGT GGGCTAAGCT TGAAATTTCA 462 0 

15 AATGATGATT TTATCAGAAC AACTGAAGAA CGTCATAAAC ATGTCGTTGA GCAAGTGTTT 46 80 

GAACGTTTAT TAAAGCAAGG TGATATCTAT TTAGGTGAAT ATGAAGGTTG GTATTCTGTT 474 0 

CCGGATGAAA CATACTATAC AGAGTCACAA TTAGTAGACC CACAATACGA AAACGGTAAA 4800 

20 ATTATTGGTG GCAAAAGTCC AGATTCTGGA CACGAAGTTG AACTAGTTAA AGAAGAAAGT 4 860 

TATTTCTTTA ATATTAGTAA ATATACAGAC CGTTTATTAG AGTTCTATGA CCAAAATCCA 4920 

GATTTTATAC AACCACCATC AAGAAAAAAT GAAATGATTA ACAACTTCAT TAAACCAGGA 4 98 0 

25 

CTTGCTGATT TAGCTGTTTC TCGTACATCA TTTAACTGGG GTGTCCATGT TCCGTCTAAT 5040 

CCAAAACATG TTGTTTATGT TTGGATTGAT GCGTTAGTTA ACTATATTTC AGCATTAGGC 5100 

TATTTATCAG ATGATGAGTC ACTATTTAAC AAAT A CTGGC CAGCAGATAT TCATTTAATG 5160 

30 

GCTAAGGAAA TTGTGCGATT CCACTCAATT ATTTGGCCTA TTTTATTGAT GGCATTAGAC 5220 

TTACCGTTAC CTAAAAAAGT CTTTGCACAT GGTTGGATTT TGATGAAAGA TGGAAAAATG 52 80 

35 AG TAAATCTA AAGGTAATGT CGTAGACCCT AATATTTTAA TTGATCGCTA TGGTTTAGAT 534 0 

GCTACACGTT ATTATCTAAT GCGTGAATTA CCATTTGGTT CAGATGGCGT ATTTACACCT 54 00 

GAAGCATTTG TTGAGCGTAC AAATTTCGAT CTAGCAAATG ACTTAGGTAA CTTAGTAAAC 5460 

40 CGTACGATTT CTATGGTTAA TAAGTACTTT GATGGCGAAT TACCAGCGTA TCAAGGTCCA 5520 

CTTCATGAAT TAGATGAAGA AATGGAAGCT ATGGCTTTAG AAACAGTGAA AAGCTACACT 5580 

GAAAGCATGG AAAGTTTGCA ATTTTCTGTG GCATTATCTA CGGTATGGAA GTTTATTAGT 564 0 

45 

AGAACGAATA AGTATATTGA CGAAACAACG CCTTGGG TAT TAGCTAAGGA CGATAGCCAA 570 0 

AAAGATATGT TAGGCAATGT AATGGCTCAC TTAGTTGAAA ATATTCGTTA TGCAGCTGTA 5760 

TTATTACGTC CATTCTTAAC ACATGCGCCG AAAGAGATTT TTGAACAATT GAACATTAAC 582 0 

50 

AATCCTCAAT TTATGGAATT TAGTAGTTTA GAGCAATATG GTGTGCTTAA TGAGTCAATT 5880 

ATGGTTACTG GGCAACCTAA ACCTATTTTC CCAAGATTGG ATAGCGAcGG AnAATTGCAT 594 0 

55 
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15 



TGGACTTATG TTCAGGCAAT GGGGTGATAC CCTTGTTATT GTTTGCGAAA CATCCACGAC 24 6 0 

ATATAGAAGG TGTTGAGATT CAAAAAACAC TTGTCGATAT GGCGCGACGC ACATTTCAAT 2 52 0 

TCAATGATGT TGATGAATAT TTAACAATGC ATCACATGGA TTTGAAAAAC GTTACTAAAG 2 580 

TATTTAAACC TTCACAATAT ACTTTAGTAA CGTGTAATCC GCCTTATTTT AAAGAGAATC 2 64 0 

AGCAACACCA ACATCAAAAA GAAGCACATA AGATAGCGAG ACATGAGATT ATGTGTACAC 2 700 

TTGAAGATTG CATGATTGCA GCCCGTCATT TATTAAAAGA AGGTGGCAGG CTAAACATGG 2 760 

TACATCGTGC AGAGAGACTA ATGGATGTCT TGTTTGAAAT GAGAAAAGTG AATATTGAAC 2 8 20 

CTAAGAAAGT CGTTTTTATA TATAGTAAAG TAGGGAAATC AG CACAAACG ATAGTAGTAG 2 8 80 

AAGGTCGAAA AGGTGGAAAT CAAGGTTTAG AAATCATGCC CCCATTTTAT ATTTATAATG 2 94 0 

AAGATGGTAA TTATAGCGAA GAAATGAAGG AAGTATATTA TGGATAGTCA TTTTGTATAT 3 0 00 

ATTGTAAAAT GTAGTGATGG AAGTTTATAT ACAGGATACG CTAAAGACGT TAATGCACGT 3 060 

GTTGAAAAAC ATAACCGAGG TCAAGGAGCC AAATATACGA AAGTAAGACG TCCGGTGCAT 3120 

TTAGTTTATC AAGAAATGTA TGAGACAAAG TCTGAAGCAT TGAAGCGTGA ATATGAAATT 3180 

AAAACTTATA CCAGACAAAA GAAATTGCGA TTAATTAAGG AGCGATAGTA TGGCTGT ATT 3 24 0 

ATATTTAGTG GGCACACCAA TTGGTAATTT AGCAGATATT ACTTATAGAG CAGTTGATGT 3 3 00 

ATTGAAACGT GTTGATATGA TTGCTTGTGA AGACACTAGA GTAACTAGTA AACTGTGTAA 3360 

TCATTATGAT ATTCCAACTC CATTAAAGTC ATATCACGAA CATAACAAGG ATAAGCAGAC 34 2 0 

TGCTTTTATC ATTGAACAGT TAGAATTAGG TCTTGACGTT GCGCTCGTAT CTGATGCTGG 3 4 80 

35 ATTGCCCTTA ATTAGTGATC CTGGATACGA ATTAGTAGTG GCAGCCaGAG AAGCTAATAT 3 54 0 

TAAAGTAGAG ACTGTGCCTG GACCTAATGC TGGGCTGACG GCTTTGATGG CTAGTGGATT 3 6 00 

ACCTTCATAT GTATATACAT TTTTAGGATT TTTGCCACGA AAAGAGAAAG AAAAAAGTGC 3 660 

40 TGTATTAGAG CAACGTATGC ATGAAAATAG CACATTAATT ATATACGAAT CACCGCATCG 3 72 0 

TGTGACAGAT ACATTAAAAA CAATTGCAAA GATAGATGCA ACACGACAAG TATCACTAGG 3 7 80 

GCGTGAATTA ACTAAGAAGT TCGAACAAAT TGTAACTGAT GATGTAACAC AATTACAAGC 3 84 0 

ATTGATTCAG CAAGGCGATG TACCATTGAA AGGCGAATTC GTTATCTTAA TTGAAGGTGC 3 500 

TAAAGCGAAC AATGAGATAT CGTGGTTTGA TGATTTATCT ATCAATGAGC ATGTTGATCA 3 96 0 

TTATATTCAA ACTTCACAGA TGAAACCAAA ACAAGCTATT AAAAAAGTTG CTGAAGAACG 4C2 0 

50 

ACAACTTAAA ACGAATGAAG TATATAATAT TTATCATCAA ATAAGTTAAT CACTTTATCG 4 080 
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GAATGGTTTC TTCGAAGATA TCATACATAC AAAGGTAAAT G7AGAGGATA AACAAATATA 6 60 

TAGTGATTTA AAAAATGATA TTGATCAATA TGCGCAAAAG TTGTCGTTTA ATCAATTAAT 72 0 

TTTGATGTTT GATCAACTGA CGGAAGCACA TAAGAAATTG AmTCAAAATG TAAATCCAAC 78 0 

GCTTGTATTT GAACAAATCG TAATTAAGGG TGTGAGTTAG ATGCCAAATG TAATAGGTGT 84 0 

TCAGTTTCAA AAAGCGGGAA AATTAGAATA TTATACACCT AATGATATAC AAGTAGATAT 90 0 

AGAAGACTGG GTAGTTGTCG AATCTAAAAG AGGCATAGAG ATAGGTATTG TTAAAAATCC 96 0 

ATTAATGGAT ATTGCTGAAG AGGATGTTGT GTTACCTCTT AAAAATATTA TTCGCATTGC 102 0 

TGATGACAAA GATATTGATA AATTTAATTG TAATGAACGA GATGCTGAAA ATGCATTAAT 1080 

ACTATGTAAA GACATTGTAA GAGAACAAGG TTTGGACATG CGTTTAGTCA ATTGCGAATA 114 0 

TACATTAGAT AAATCGAAAG TTATTTTTAA TTTTACGGCG GATGATCGTA TTGATTTTAG 12 00 

20 AAAATTAGTA AAAATATTAG CGCAACATTT AAAAACACGT ATCGAGTTGA GACAAATTGG 126 0 

TGTAAGGGAT GAAGCCAAAT TGCTTGGCGG TATCGGACCT TGTGGTAGGT CGTTATGTTG 132 0 

TTCTACATTT TTAGGGGATT TTGAACCAGT ATCGATTAAG ATGGCTAAGG ATCAAAATTT 13 80 

ATCATTAAAT CCAACTAAAA TTTCTGGTGC ATGTGGTCGT TTGATGTGTT GTTTAAAATA 14 4 0 

TGAAAATGAC TATTATGAGG AAGTACGTGC ACAATTACCT GATATTGGTG AAGCAATTGA 1500 

AACGCCTGAT GGTAACGGGA AAGTAGTTGC TTTAAATATA TTAGACATTT CTATGCAGGT 156 0 

GAAGCTTGAG GGACATGAAC AGCCACTTGA ATATAAATTA GAAGAAATAG AAACTATGCA 162 0 

TTAAGGAGGC ATT ATT A CAT TTGGATCGCA ATGAAATATT TGAAAAAATA ATGCGTTTAG 16 8 0 

AAATGAATGT CAATCAACTT TCAAAGGAAA CTTCAGAATT AAAGGCACTT GCAGTTGAAT 174 0 

TAGTAGAAGA AAATGTAGCG CTTCAACTTG AAAATGATAA TTTGAAAAAG GTGTTGGGCA 1800 

ATGATGAACC AACTACTATT GATACTGCGA ATTCAAAACC AGCAAAAGCT GTGAAAAAGC I86 0 

40 CATTACCAAG TAAAGATAAT TTGGCTATAT TGTATGGAGA AGGATTTCAT ATTTGTAAAG 192 0 

GCGAATTATT TGGAAAACAT CGACATGGTG AAGATTGTCT GTTCTGTTTA GAAGTTTTAA 1980 

GTGATTAATC AAGCACACTC AAATAGTGTT ATAATTATAA ATGAATATGG TTTGGATAAG 2 04 0 

TCTGAGACAA TGCATGTTTC AGGCTTTAAT TGTGTATAAA GTTTTGGTGA TTGCATAAGA 210 0 

GATGGCGGTA CTAAATGTTA TTATTAAGTG TGCACGCAgT ATCaTTAGTT ATAAAATGTA 216 0 

GCTGTTAAAA GTCAAAAATA CATCGAATGT AGTTAGGCAT ATAATATAAA AAGAGTTTTC 222 0 

AATTACTCAA TAGAAAAAGG TTGTCTTCAT AGGAGTTAAA AATGTTAAAA GAGAATGAAC 22 8 0 

GATTTGATCA ACTAATCAAA GAAGATTTTA GTATTATTCA AAATGATGAT GTTTTTTCAT 234 0 
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TCAAGCGGAA CAGCATTATG CACCAGTATT AACGCATTTT TTAGATCCAA GAGGGCAATA 24 0 

TATATTGGAA GTGATTTGTG GCAGTTATGA AGATTTAAAC GTATCTTTTT ATGGTGGACC 3 00 

TAATGCTGAA AGAAAAAGAG CAATCATTTC GCCG AACTAT TATGAACCTA AAGAAAGCGA 3 60 

CTTTGAATTA ACTTTAATGG AAATAGATTA TCCTGAAAAA TTCGTCACTT TAAAACATCA 4 20 

ACATATTTTA GGGACATTAA TGTCTTTAGG TATCGAACGC GAACAAGTTG GAGATATAAT 4 80 

TGTGaATGAA CGAATTCAAT TTGTTTTGAC AAGTAGATTG GAATCATTTA TTATGTTAGA 54 0 

ATTACAACGT ATTAAAGGCG CATCAGTTAA ACTTTATACT ATTCCAGTAA CAGATATGAT 6 00 

ACAATCTAAT GAGAATTGGA AAAATGAAAG TGCaCAGTTA GTTCTTTAAG GTTAGATGTT 66 0 

GTTATTAAAG AAATGATACG TAAATCACGT ACGATTGCGA AACAACTAAT CGAAAAAAAA 720 

CGTGTTAAAG TGAATCACAC TATTGTTGAT TCAGCAGATT TTCAATTACA AGCAAATGAT 78 0 

20 TTAATATCCA TCCAAGGTTT TGGTAGAGCA CACATTACTG ACTTAGGTGG TAAAACTAAA 84 0 

AAAGATAAAA CGCACATTAC CTATAGAACA TTATTCAAAT AGTAATGATT TAAGGAGGAT 90 0 

AACAAATGCC TTTTACACCA AATGAaATTA AGAATAAAGA GTTTTCACGT GTaAAGAATG 96 0 

25 GTTTTAGAAC CTACTGnAGT TGG 983 

(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 10322 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

TTTTGCAAAG CTTATTTTAT GTCAAACAGA TAGTCAATGT GAAACAAAGG TTAGTACATA 6 0 

40 TAATCATCCA GACTTTATGT ATATATCAAC AACTGAGAAT GCAATTAAGA AAGAACAAGT 12 0 

TGAACAACTT GTGCGTCATA TGAATCAACT TCCTATAGAA AGCACAAATA AAGTGTACAT 18 3 

CATTGAAGAC TTTGAAAAGT TAACTGTTCA AGGGGAAAAC AGTATCTTGA AATTTCTTGA 24 0 

AGAACCACCG GACAATACGA TTGCTATTTT ATTGTCTACA AAACCTGAGC AAATTTTAGA 3 00 

CACAATCCAT TCAAGGTGTC AGCATGTATA TTTCAAGCCT ATTGATAAAG AAAAGTTTAT 36 0 

AAATAGATTA GTTGAACAAA ACATGTCTAA GCCAGTAGCT GAAATGATTA GTACTTATAC 42 0 

TACGCAAATA GATAATGCAA TGG CTTT AAA TGAAGAATTT GATTTATTAG CATTAAGGAA 4 80 
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TCAATTGGAA CAGACAATTT TAATGGTAAC TCATTCAAAT ATCGATGCGT CTTATGCAGA 990 0 

GCGAGTCATT TTTATTAAAG ATGGGCGTCT ATATCATGAA ATATATCGTG GTGAAGAAAG 996 0 

TCAATTAGCT TTTCAACAAC GAATAACAGA TAGCTTAGCA CTTGTGAATG GAGGAAGTGT 100 2 0 

CAATATATGA AGTTAAGATT GTTATGnACA TAGTGCGACG TCAATTTATT ACGCAGCGAC 1008 0 

TTGTAATCAT TCCATTCATT TTAGCGGTAA GTGTACTATT CATGATTGAA TATACGCTTG 1014 0 

TGTCAATTGG GTTAAATAGC TACATAAAAC AGAAGAATGA CTTCCTAGTA CCATTTATTA 10200 

TCATAG CTAA TTTTTTTATG GCGCTTTTAA CTTTTATTTT TATTTTCTAT GCAAATCACT 10260 

TTATGATGTC ACAAAGACGA AAAGAGTTTA GCATTTTTAT GACATTGGGC ATGACCAAGA 10320 

AAAGTATGCG TTTAATTGTA GTGATGGAAA CTATCTTACA ATTTGTGATA ATTTCAGTCG 10380 

TTAGTATTGC CGGCGGATAC TTACTTGGTG CGATATTTTT CTTGTTTATA CAGAAAATAA 1044 0 

TGGGCAGTGA AGTTGCGACG TTAAGGTATT ATCCATTTGA CTCTGTAGCG ATGTTTATTA 10500 

CTTTGATTAT CATTGCTGTA TTAATGGGCA TGCTACTTAT ATTCAACTTG TTTAGTATTA 10560 

ATTTTCAACG GCCGATAACT TATCAACATC GTTCCGATTC TAGTGTCATA TCACGATGGT 10620 

25 TGCGTTACGT TTTAATTGTT ATAGGAAGCG CAnACTATAT TTAGGTTACT TTATTGCATT 106 80 

ACAACAAGAT ACGACGTTTG GTGCCTTTTT TAAAATATGG ATTGTCATAG GATTAGTTAT 10740 

TAT CGGTACT TATGCATTTT TTGTAGGTAT AAGTGAAATA ATTATTAGTA TATTGCAGCA 10800 

GGTATCAAAA GTTTACTATC ATCCACGGTA TTTTTTTGTG GTAGTTGGGA TGCGTGTACG 10 860 

TCTTAAAATG AATGCAGTCA GTCTTGCAAC AATCACTTTG CTGTGTACAT TTTTGATTGT 10 920 

AACGCTCACA ATGACATTAA CAACCTATCG TGATATGAAT CATAC CATTA CGAAATTGAT 10 980 

TACGAATGAT TakGATTTGT CATTTAGCGA CAATTCTAAG TCACAAaTAG AACGTCAACA 11040 

AACAATTGAG 11050 
(2) INFORMATION FOR SEQ ID NO: 97: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 983 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 
CGACATAACG AGGCAAGGGT ACATGATACT TTAGCCTCGT TTTTGATATG TATTTTTCTG 6 0 

AATATAAGGG CAATAGATGG TATTTTATAw TTTTTTTAAG GTAGTGATTA ACATAGATAT 12 0 
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ACAAAACATA CAGCTATCTT TGACTGAATT ACAAATATTA AAGTTATTAT TTCAAAATGA 8100 

AGaTAAATAT GTAAGTAGrA CTGCTTTAAT TGaAAAATGT TGGGaATCAG AAAACtTCAT 816 0 

5 AGATGATAAC ACATTAGCTG TTAACATGAC GCGCCTGCTG AAAAAATTAA ATACTATTGG 82 2 0 

CGTTAATGAT TTTATCATTA CAAAGAAAAA TGTCGGATAT AAAGTATAGG GTGAATGCAA 82 8 0 

TGACCTTTCT TAAAAGTATT ACTCAGGAAA TAGCAATAGT CATAGTTATT TTTGCTTTGT 834 0 

10 

TTGGCTTAAT GTTTTACCTG TATCATTTGC CATTAGAAGC ATATTTACTA GCACTTGGCG 84 0 0 

TTATTTTATT ATTATTACTC ATATTCATAG GTATTAAATA TTTAAGTTTT GTAAAAACTA 84 6 0 

TAAGCCAACA ACAACAAATT GAAAACTTAG AAAATGCGTT GTATCAGCTT AAAAATGAAC 852 0 

15 

AAATTGAATA TAAAAATGAT GTAGAGAGCT ACTTTTTAAC ATGGGTACAT CAAATGAAAA 8580 

CACCCATTAC TGCAG CACAA CTGTTACTTG AAAGAGATGA GCCTAATGTT GTTAATCGTG 8 64 0 

20 TTCGTCAAGA GGTTATTCAA ATTGaTAACT ATACAAGTTT AGCACTTAGT TATTTAAAGT 8 70 0 

TATTAAATGA AACTTCTGaT ATTTCTGTCA CTAAAATTTC GATTAATAAT ATCATTCGCC 8760 

CAATTATTAT GAAATATTCA ATACAGTTTA TTGATCAAAA AACAAAAATC CATTATGAAC 882 0 

25 CTTGTCATCA CGAAGTATTA ACTGACGTTA GATGGACCTC TTTAATGATA GAACAATTAA 8880 

TAAATAATGC ACTTAAGTAT GCGAGAGGTA AAGATATATG GATTGAATTT GATGAGCAAT 8 94 0 

CCAATCAATT ACACGTAAAA GATAATGGTA TCGGTATTAG TGAAGCGrAC TTGCCTAAAA 90 0 0 

30 

TATTTGATAA GGGCTATTCA GGTTATAATG GCCAGCGCCA AAGTAACTCA AGTGGGaTTG 90 6 0 

GTTTATTTAT CGTAAAACAA ATTTCAACAC ACACAAACCA TCCTGTTTCA GTCGTATCTA 9120 

AACAAAATGA GGGTACAACA TTTACGATTC AATTTCCAGA TGAATAAAAA CTTTCAATAT 9180 

35 

TGTAAGTATA CTAGTAACAT TTTTTTACTA ATTTAAATGT TATTAGTATT TTTTTGTTTT 924 0 

AAT^TAGAAC TAACAAAGAA ATGAGGTGCA TGCCATGTTG CTAGAAGTGn AACATGTAAA 93 00 

40 AAAGGTTTAT GGTAAAGGTT TGAATG CTAC GACAGCACTT AATCAAATGA ATTTATCAGT 9360 

TGGAGCTGGT GaATTTGTTG CaATTATGGG TGAGTCTGGG tCAGGGAAGT CTACACTACT 94 2 0 

AAATTTAATT GCtTCTTTTG ATGGACTAAC TGAAGGTGAC ATTATTGTGG ATGGCGCACA 94 8 C 

45 TTTAAATAAT ATGAAAAATA AAAGTAAAGC ATTGTATCGT CaACAAATGG TAGGTTTTGT 9 54 0 

TTTTcAAGAT TTTAATCTTT TACCAACAAT GACGAATAAA GAAAATATAA TGATGCCATT 9600 

AATTTTAGCT GGTGCTAAAC GAAAAGATAT AGAACAAAGG GTACATCAGT TGGCAGTACA 9 66 0 

50 

ATTACATTTA GAGGGATTCT TAAACAAGTA TCCTTCTGAA ATCTCTGGGG GTCAGAAGCA 9 72 0 
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ATTGAATGGC GTCATCGAAT GCTTTTTCAA AACCTTCCAT TTCAGACATA ACGCCTGTAA 63 0 0 

TATCGTTGGA ATGCGCTGAT TTATCTATAG AAGCACCTTC GACCATTAAA AAGAATCCTT 63 6 0 

TTTTATTGCG CTCAAGCTTA CTAAGTGCAC TTTGTTGCAT ATCAGCTAAT GATGGTTCGT 64 2 0 

CTTTAGAAGC ATCTATTGCA AGTGGCATAT TTTTATCTGC AAACAAACCA AGAACTTTAT 64 80 

CTTTATCAGA TTTTGATAAC TCCTTACTGT TCGTGGCAAG GTCGTAACCA TCTTTTTTGA 654 0 

ATTTTTTATC TAAATTGCCA TTACTTTTAC CGAAATATTT AGCGCCGCCG CCTAATAAAA 6 60 0 

CATCAACTTT ATGCTTTCCG TTGATTTTAT CTTTATAAAA TTGTTTAGCG ATTTCGTTTT 666 0 

TATCATCTCT AGAAGTCACG TGTGCAGCAT ATGCTGCTGG TGTTGCATCT GTTAATTCAG 672 0 

CTGTTGAAAC AAGACCAGTC GACTTACCTT TTTCTTTTGC ACGTTCAAGC ACCGTCTTTA 6780 

CTTTCTGCTT GTTACTGTCA ACACCGATGG CACCATTATA TGTCTTATGA CCAGAACTAA 684 0 

AGGCTGTTCC GCCAGCTGCA GAATCAGTAA TATTCTGTTT TGGGTCATTT GAATATGTAC 6900 

GATTTGTGCC TTTTAAATAT GAATCAAAAG CAGTAGGGGT CATTTCTTTA GCATGCGGAT 696 0 

CATTTTTATA ATAACGATAA GCTGTGTTAA ATGATGGACC CATGCCATCG CCAACTAAAA 7 02 0 

25 AGATAACATT TTTTGGATTT TTAGTATTAC CAACCGCGAA ACTTTCATCT TTAGAACTTT 7080 

TATCGGATTG CGCAATTGCA GGTGTGACAG AACTAAAAAC CGTTGACACG ATAATAAGGT 714 0 

TAGCAACTGC AAATTTTGTG GCTTTTTTAA CTGATAACAT AAGACATCCT CCTGAGTATA 7200 

TGACTATGTC TTCAGTGTAA AAGAGGAATT TtGAGCAATT ATGTAGTTTT AGTTAnAAAT 726 0 

ATGTAAACAG AGTGATTTAG AATAACAAAA aATGAATATA TATGACAATT TGTTATAGAA 732 0 

AGCGTTAGAA TAGAAGCGTG TGAAAATATA GAATTAAATA TAATTTGAGG TGGAAAAATG 73 80 

ATACTAGTAA TGTTATCTCC ATTATTAATC ATATTCTTTA TAGTGTTGTC TATTTTAGAA 744 0 

GAGCGTAAAC GTACGAAGAA AAAGCAACTC GAGAAAGAAA AAGCAAATAC ACTAAATCAA 750 0 

AATACAAATG ACACGGAAAG TTCAAATCAA GAGCCGTCAT TGCAGCAGGA TAAAGAACAA 756 0 

AAAGATAACA AAGGATAATT CAATTGAAGG AAGAAGATTA TAGATGAAAA TATTAATTGT 762 0 

TGAAGATGAT TTTGTTATAG CAGAGAGTTT AGCATCTGAA CTTAAAAAAT GGAATTACGG 76 80 

TGTTATTGTC GTTGAACAAT TTGATGATAT ACTGTCTATC TTTAACCAAA ATCAACCTCA 774 0 

GCTTGTATTG CTAGATATTA ATTTGCCAAC GTTAAATGGT TTTCATTGGT GTCAAGAAAT 78 00 

CCGAAAAACA TCTAATGTGC CAATTATATT TATTAGTTCC CGTATTGATA ATATGGACCA 7 860 

AATTATGGCA ATACAAATGG GGGGAGATGA TTTTATCGAA AAGCCATTTA ACTTGTCATT 7 92 0 

AACGATTGCC AAAATTCAAG CATTATTGAG ACGAACTTAT GACTTGTCAG TAGCTAATGA 7 980 
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ACCAATTTAA AAATAGAATC TATTATGAGC ATCCAAATGT AGCTAGTATT AAATTTGGTG 2700 

ATATTACTAA AACATATGTA GTATTAGTAG AAGGGCATTA CGACAATACA GGTAAGAACT 2760 

TAAAAACTCA GGTTATTCAA GAAAATGTTG ATCCTGTAAC AAATAGAGAC TACAGTATTT 2820 

TCGGTTGGAA TAATGAGAAT GTTGTACGTT ATGGTGGTGG AAGTGCTGAT GGTGATTCAG 2880 

CAGTAAATCC GAAAGACCCA ACTCCAGGGC CGCCGGTTGA CCCAGAACCA AGTCCAGACC 294 0 

CAGAACCAGA ACCAACGCCA GATCCAGAAC CAAGTCCAGA CCCAGAACCG GAACCAAGCC 3000 

CAGACCCGGA TCCGGATTCG GATTCAGACA GTGACTCAGG CTCAGACAGC GACTCAGGTT 3060 

CAGATAGCGA CTCAGAATCA GATAGCGATT CGGATTCAGA CAGTGATTCA GATTCAGACA 3120 

GCGACTCAGA ATCAGATAGC GACTCAGAAT CAGATAGTGA GTCAGATTCA GACAGTGACT 3180 

CGGACTCAGA CAGTGATTCA GACTCAGATA GCGATTCAGA CTCAGATAGC GATTCAGACT 3 240 

CAGACAGCGA TTCAGATTCA GACAGCGACT CAGATTCAGA CAGCGACTCA GACTCAGATA 3300 

GCGACTCAGA CTCAGACAGC GACTCAGATT CAGATAGCGA TTCAGACTCA GACAGCGACT 3360 

CAGACTCAGA CAGCGACTCA GACTCAGATA GCGACTCAGA TTCAGATAGC GATTCAGACT 3420 

25 CAGACAGCGA CTCAGATTCA GATAGCGATT CGGACTCAGA CAGCGATTCA GATTCAGACA 34 8 0 

GCGACTCAGA CTCGGATAGC GATTCAGATT CAGATAGCGA TTCGGATTCA GACAGTGATT 3 54 0 

CAGATTCAGA CAGCGACTCA GACTCGGATA GCGACTCAGA CTCAGACAGC GATTCAGACT 3 600 

CAGATAGCGA CTCAGACTCG GATAGCGACT CGGATTCAGA TAGCGACTCA GACTCAGATA 3 660 

GTGACTCCGA TTCAAGAGTT ACACCACCAA ATAATGAACA GAAAGCACCA TCAAATCCTA 3 720 

AAGGTGAAGT AAACCATTCT AATAAGGTAT CAAAACAACA CAAAACTGAT GCTTTAC CAG 3 780 

AAACAGGAGA TAAGAGCGAA AACACAAATG CAACTTTATT TGGTGCAATG ATGGCATTAT 3 84 0 

TAGGATCATT ACTATTGTTT AGAAAACGCA AGCAAGATCA TAAAGAAAAA GCGTAAATAC 3 900 

TTTTTTAGGC CGAATACATT TGTATTCGGT TTTTTTGTTG AAAATGATTT TAAAGTGAAT 3 960 

TGATTAAGCG TAAAATGTTG ATAAAGTAGA ATTAGAAAGG GGTCATGACG TATGGCTTAT 4 020 

ATTTCATTAA ACTATCATTC ACCAACAATT GGTATGCATC AAAATTTGAC AGTCATTTTA 4 080 

CCGGAAGATC AAAGCTTCTT TAATAGCGAT ACAACTGTTA AACCATTAAA AACTTTAATG 414 0 

TTGTTACATG GATTATCAAG TGATGAAACG ACATATATGA GATATACAAG CATAGAAAGG 42 00 

TATGCGAATG AACACAAATT AGCTGTGATT ATGCCCAATG TGGATCATAG CGCATATGCT 4260 

AACATGGCAT ATGGTCATAG CTATTATGAT TATATTTTGG AAGTGTATGA TTATGTTCAT 4 320 

CAAATATTTC CACTTTCCAA AAAGCGTGAT GACAATTTTA TAGCAGGTCA CTCTATGGGA 4 3 BO 
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TGAAAAACTT GTTGTTAAAG ATCATAAAAA TTGGTTAGTA AGCAAACATT TATTCAATGA 90 0 

TGTATGTGTT TAATATACAA TGTAAAATGA ATAAGTTGAA CATGAGGTCT AACGTACATT 96 0 

5 TATACGTTAG GCCTTTTTTG CTAGCATGAT GAATAATTTA AAATGTTAGT TAAATTTGAT 102 0 

TGTTGAAATT ACAGTAAAAT TTAAGGTGAT GAAAAATTTA GAACTT CTAA GTTTTTGAAA 108 0 

AGTAAAAAAT TTGTAATAGT GTAAAAATAG TATATTGATT TTTGCTAGTT AACAGAaAAT 114 0 

10 

TTTAAGTTAT ATAAATAGGA AGAAAACAAA TTTTACGTAA TTTTTTTCGA AAAGCAATTG 120 0 

ATATAATTCT TATTTCATTA TACAATTTAG ACTAATCTAG AAATTGAAAT GGAGTAATAT 126 0 

TTTTGAAAAA AAGAATTGAT TATTTGTCGA ATAAGCAGAA TAAGTATTCG ATTAGACGTT 1320 

15 

TTACAGTAGG TACCACATCA GTAATAGTAG GGGCAACTAT ACTATTTGGG ATAGGCAATC 1380 

ATCAAGCACA AGCTTCAGAA CAATCGAACG ATACAACGCA ATCTTCGAAA AATAATGCAA 144 0 

20 GTGCAGATTC CGAAAAAAAC AATATGATAG AAACACCTCA ATTAAATACA ACGGCTAATG 150 0 

ATACATCTGA TATTAGTGCA AACACAAACA GTGCGAATGT AGATAGCACA ACAAAACCAA 156 0 

TGTCTACACA AACGAGCAAT ACCACTACAA CAGAGCCAGC TTCAACAAAT GAAACACCTC 1620 

25 AACCGACGGC AATTAAAAAT CAAGCAACTG CTGCAAAAAT GCAAGATCAA ACTGTTCCTC 16 80 

AAGAAGCAAA TTCTCAAGTA GATAATAAAA CAACGAATGA TGCTAATAGC ATAGCAACAA 174 0 

ACAGTGAGCT TAAAAATTCT CAAACATTAG ATTTACCACA ATCATCACCA CAAACGATTT 18 00 

30 

CCAATGCGCA AGGAACTAGT AAACCAAGTG TTAGAACGAG AGCTGTACGT AGTTTAGCTG 1860 

TTGCTGAACC GGTAGTAAAT GCTGCTGATG CTAAAGGTAC AAATGTAAAT GATAAAGTTA 1920 

CGGCAAGTAA TTTCAAGTTA GAAAAGACTA CATTTGACCC TAAT CAAAGT GGTAACACAT 19 80 

35 

TTATGG CGGC AAATTTTACA GTGACAGATA AAGTGAAATC AGGGGATTAT TTTACAGCGA 2 04 0 

aGTTACCAGA TAGTTTAACT GGTAATGGAG ACGTGGATTA TTCTAATTCA AATAATACGA 2100 

40 TGCCAATTGC AGACATTAAA AGTACGAATG GCGATGTTGT AGCTAAAGCA ACATATGATA 2160 

TCTTGACTAA GACGTATACA TTTGTCTTTA CAGATTATGT AAATAATAAA GAAAATATTA 2 22 0 

ACGGACAATT TTCATTACCT TTATTTACAG ACCGAGCAAA GGCACCTAAA TCAGGAACAT 2 2 80 

45 ATGATGCGAA TATTAATATT GCGGATGAAA TGTTTAATAA TAAAATTACT TATAACTATA 2 34 0 

GTTCGCCAAT TGCAGGAATT GATAAACCAA ATGGCGCGAA CATTTCTTCT CAAATTATTG 24 00 

GTGTAGATAC AGCTTCAGGT CAAAACACAT ACAAGCAAAC AGTATTTGTT AACCCTAAGC 24 6 0 

50 

AACGAGTTTT AGGTAATACG TGGGTGTATA TTAAAGGCTA CCAAGATAAA AT CG AAG AAA 2 52 0 
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GATATACATT GAATGTGTTA TATGATCGTT ATCAGTTACC ACTTTTTATT GTGGAAAATG 1860 

GTTTTGGTGC AGTTGATGAA GTGGTAGATG GACATATTCa TGATGATTAT CGCATTGAAT 1920 

ATTTAAAAGC ACATATTACA GCAGCGATAG AAGCAGTTGA TCAAGATGGT GTAGATTTAA 1980 

TCGGTTATAC ACCGTGGGGA ATCATTGATA TTGTTTCATT TACAACCGGT GAAATGAAGA 2 04 0 

AACGCTATGG TTTAATATAT GTTGATCGAG ATAATGATGG TCATGGCACG ATGGAACGCT 2100 

TGAAAAAAGA TTCGTTCTAT TGGTATCAAC AAGTGATAGC ATCAAATGGA GATAAATTAT 2160 

AAAGG TAT AT TATAAGTATT TTAGGGTTAG AGCCCGAGAC ATAAATTAAT ATAGTAGGAC 2220 

CTACAGTGTT ATAATGGCGG gCCCCCAACA CAAAGAATTT CGAAAAGAAA TTCtAcAGGT 22 80 

aATGCaAGtT GCCGGGGcCC AACACAGAGA AATTCGAAAA GAAATTCTAc AGGTAATGCA 2 34 0 

AGTTGGGGAA GGACAGAAAT AAATT 2365 
(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS ; 

(A) LENGTH: 11050 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 

30 

CTGCGATACG ATTTGTTGAA AGTGGGGAAA ACAAAAAAGT TATCATTACC AATTTAGAGC 6 0 

AGGCATACGA AGCTTTGATT GGTAATAAAG GTACACACAT TCACATGTAG CACTTTATCA 12 0 

35 CGCGACAAAA CATTAAATAT GTTTCTCCGT TGATTCAAAT GAAAAAGTTG TCTGCTGACA 180 

CTTTGCAAGG TTTGAAGGAG TTTAACTTAT GACAGAAAAC TTTATTTTGG GTAGAAATAA 24 0 

TAAATTAGAA CATGAACTAA AGGCATTAGC AGATTACATT AATATAC CAT ATAGTATATT 3 00 

40 ACAACCATAT CAAAGTGAAT GTTTTGTCAG ACATTATACG AAAGGCCAAG TTATTTATTT 3 60 

TTCGCCACAA GAAAGTAGCA ATATTTACTT TTTAATTGAA GGTAACATTA TTAGAGAACA 420 

TTACAATCAA AATGGAGATG TATATCGTTA TTTTAATAAA GAGCAAGTAT TATTTCCAAT 48 0 

45 

CAGTAACTTA TTTCATCCGA AAGAGGTTAA CGAATTGTGT ACAGCATTAA CCGATTGTAC 54 0 

AGTTCTTGGA TTGCCTAGAG AATTGATGGC CTTTTTGTGC AAAGCTAATG ATGATATATT 6 00 

TTTGACACTT TTTGCATTAA TAAATGATAA TGAGCAGCAA CACATGAACT ATAACATGGC 660 

50 

ATTAACAAGT AAATTTGCTA AAGATCGAAT TATCAAATTG ATATGCCATC TATGTCAGAC 72 0 

AGTAGGATAC GATCAAGATG AATTTTATGA AATCAAACAG TTTTTAACTA TTCAACtCAT 78 0 

ss 



TGATACGAAt GCATTACAAT TCATATGCAA 
AAACAAATAG TTGATGTGAC ATCTGTAGCA 
5 ATGGAAATGT CGATGAAGTG TATCAGCAAT 

AAGAGAGTGA GAATACTGAC AATATTAATA 
AAGTAACAAC ACCAATTTTA GTGAAAGCAC 

10 

AAGTAAGAGA TTCAATTTTT AGAGAGAAAA 
ATGAAGAATC GAAAGTAATC GCACCGTTCA 
AGCATGCAGT TGGTATTCAA TCAGAAGACG 

15 

ATACAGTTGA CTTGGAAGGT AAAGGGTTCA 
AAGCAGGGCA AACGTTGTTG CAATTCGACC 

20 CTGACGTTAT TGTCGTTATT AGCAACTCTG 

ATGAAATCAT TACGACTGAA GATGTTATTT 
TAATAATGAC AAAATTACCG CAAAATTTCA 

25 TTGAAGGTGG ATATGATAAA GGTGGTAAAG 

GTGCACATGG CAAAGCACGT CAGATTACAG 
ATCATGAAGG TATTGATTTT TATCATCGTT 

30 

TGGGATTGAA ATGTTTACGT ACGTCGATTG 
AAGATGTGCC AAATGAAGAA GGACTCGCCT 
CACAAGGTAT TGAACCTGTT GTGACGTTAT 

35 

AACATTATGG TGGATTTAGA AATAGAGAAG 
TTGTATTTGA AAGATATAAA GATAAAGTTA 

40 ATCAGATGGA CACATCAAAT CCTATCTTTT 

AAAATGATAA TCCTGAAGAA GTCyTGTATC 
CyTTAGCAGT TCGTCTTGGT AAAGaGATtA 

4 * CAmaTGTACC CmTTTATCCa TAwTCGTGTC 

CGAATCGCTT ACGTTTCTTT TTCCCGGATG 
CTAAAAAAAT GTTGGCACGA AAAGGATATG 

50 

TTTTACAGCA GGGCACGGTT GATTATATTG 
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CATACAATTC CTTCTACAGC AAATGAAGTG 6 0 

GAAAATGATA CGCATTAGTC ATAAAATTAA 12 0 

TACAGCGATT AATTAAGAAT GCTAATGTCG 180 

GTCAAGATAC AAGTTATACA CCTCAAGTAA 24 0 

CAATCGCTGG TCGTCGTATT TTACTTAAAG 300 

TGGTAGGTGA AGGCTTAGCA ATCAAAGCTC 36 0 

ATGGTTTAAT ATCTATGATT GTACCAACTA 42 0 

GTGTGGACAT AGTCATTCAT ATTGGCGTGA 4 80 

AGTGCTTTGT AAAGCAAAAT GATCATGTTG 54 0 

AGCAATATAT ACAACAACAA GGCTACAATG 6 00 

CCGATTTAGG AAAAGTAGAA CTGACAATGA 66 0 

TTAAAATATT TAAAAACTAG GAGTGTGTTG 72 0 

TGTGGGGTGG CGCTCTTGCC GCAAATCAAT 78 0 

GGTTAAGTGT AATTGATGTT ATGACGAGTG 84 0 

AATCTATAGA TCCCAATCAC TATTATCCAA 90 0 

ATAAGGAAGA TATTGCCTTG TTTAAAGAAA 96 0 

CGTGGACACG TATCTTTCCG AATGGGGATG 102 0 

TTTATGATCG TATCTTTGAT GAATTAATTG 108 0 

CACATTTTGA GATGCCACTT CATTTAGCGA 114 0 

TTGTCGATTA TTTTGTGCAT TTTGCGCGTG 120 0 

CATATTGGAT GACGTTTAAT GAAATTAATA 12 60 

TATGGACGAA TTCTGGGGTA GCATTGACAG 132 0 

AAGTAGCACA TCATGAACTT TTAGCCAGTG 13 8 0 

ATCCgAaGTT TAAGATTGGr ACmATGATTt 144 0 

ATCCGAAAGA TATGATGGAA GCACAAATTG 150 0 

TCCAAGTGAG AGGTTATTAT CCAAGCTATG 156 0 

ATGTTGGATG GCAAGAAGGG G A CGACAGT A 162 0 

GCTTTAGTTA TTACATGTCT ACGGCTGTAA 16 SO 
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CCTTTACCTA GCTATTTTCC GCCTTTTGCA AATTTACTAC CATTTTCTAT AAACACATTA 22 8 0 

CCTGATGTAC GTTTGACTTC CACAAATGAA TTTGGACCTG CTGGGCCTTT CACTCCACCT 234 0 

GCTGTATTGa TAAATACACC GAATTTACTT GcATTTATAC CGTCTTGCTC TAAAAGTGTT 24 0 0 

GACGTAATAT CTAATCCTAT ATCTCTTTTA ATACTGTCTT TATTGTCATT TATATATTTC 24 6 0 

AATATACTTT TCGGGATATC GTCTTCTGGA TGTTCTTTGG CATATGCCTT TATAACAGCA 2520 

AAGTCTGCTT TATTTAAAGT TTCTTTCTCT GCTTTATGTT CAATTTTCCC CATAGCAACT 258 0 

TTCAAATATT TTTCATGACT TGCTTTGGCC CAATCAAGTT CTTTACCTGA AGGAATATTA 264 0 

AATTGATTTG TTGAAAAGTT CCAAAAATTC TGCGCTTGGG TAAGTCCTTG TTGGACAATT 270 0 

TTTTGAAATT CTTCAACTTC TTTAAATATT TCTGGTGATT TTTGATTAAA CTCACGCAAT 276 0 

TTGCGTAGCT TCTCTTCTAA TTCATGTTTT TGTTGACCTA ATGTTCGTAT TATTTGTTGG 2820 

20 TTCGATGAAA TGGCTTGCTG ATTATCGGAA GCATGCTTTT TCAAATTGTT ATTCAAATTT 2 880 

TCATATCGCG TAATTTGTTG ACTTAATGAT CTGATATCTT CTTCAAGCTC TGATTCTTTT 2 940 

AAAGATATGC TATCAACCTC ACTCGTATAA CGTGACACAA AATTaTCGCA AGCTTGCTTC 3000 

GTTAAATCAC TCAATGTTTT CATACTTGTT GATAATGGAA TTAACACCGT ACTAAAAAAT 306 0 

TGCTTAGCTG ACGTATACGC TTTCCCTTTA AGCGCATCAT CATTAATAAA TTGAGTAATT 3120 

GCTTTTTCCA ACGCATCATA ATTTGAATTC ATTGTTTGAC TCAAATTCCC CACACTTGAA 318 0 

GCTTGGTTTC GAGATCTGTC TAAATACATG TCAATACTCA TCGGCATGCT CCTTTTTCAA 324 0 

AAATATATGA TTTTCAAACT ATTTAAAATC AAATGCTTTT TACATCTACA AAGTTGTAAA 3 3 00 

ATTTTAAAAC TCGGCGATGA TTATTTCTTA TGTAAAGGAG TCTAGATGCA GGTAAATTGA 3360 

GATAACATGT CGCCTTTTTT CTTATTTTAG CATATGGATA TAATGGTGTC TTTGTATATT 34 20 

CGCAATTAAT CAATAAAAAT TATCTTTCAA TATTTTAATT TTATTGCGAC AACATCCTTA 34 80 

40 ACATTAAATA TATTAATATC TCAAAATATA TTCACTATTA AAATATGTCA TCAGTTGTTA 3 54 0 

AAAGTATTTC CTCATCATGC GAAATATCAA AACGTATCTA AAATACGAAT AAGTTTATAC 3 6 00 

AATCACACAA CATCATCATT CAAAATTTTA TTG 3 633 

45 

(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2365 base pairs 

(B) TYPE: nucleic acid 

50 (C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 
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TTATATGACC TTAAATATAT AACATGAATC TTTTTGTCTA TTATTGAAGA CATATTTATA 4 80 

AAGAAAAATA GCATTGTCAT AATAACCCAA GCAATAAATA CTATAATATT TTGGATAGAT 54 0 

AAACTAATCA rrACATCTAA GAACATGATT gATAATCCAC CACAGAAAAA ATAAGAAAAT 6 00 

AGTACAAAGC AAAGATTCTT GAATGATGGA AAAATCATAA TTTTTCCATT GCTACTCCGA 6 60 

TCATTATAGA TAGATAACTT TACTTTCTGA TTTAAATATA TATAAAACAC TAGAATACTT 72 0 

AATAATAAAA CCGAACAAAT GATAATAACG CAATTTTTTT CTAAATGAGA ATCAGGTATA 78 0 

TATATTTTAT CTCTAAACAT AGTGCCAAAT AAAAGTATGC TACCTATAGC TGGCCATAAA 84 0 

15 GCTTTaTTTT TAACTGGTTT GACAATATTT AAATTATCAA AATCTTCTCT GCTGATTTGG 90 0 

ACATATTTTT TTGGTATTAA CCAATTAATA AACGGAAAGA ACAAAACTAA CCAGGTGCTT 96 0 

ACTAAATCAA TCATCAGATA GTCGTTTTTA TATTTAATAA TTCTATATCT GGGATTTTTG 102 0 



w 
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ATACACTTTC AGATACTTTA TAAGTGTTTT GTATTTTAGT AACATACTAT TTTCCTGTTT 114 0 

ATTACTTAAC TTACGAACTA CAATCTAAGT TTAGTAATTT CTATTGCTTT TTAAGTTTGG 12 00 

CATAAACCTT TTTATTACTA ATTGAGCCCA TGCTTATTAG AAAGAAAAAA ATTGTAATAA 126 0 

TAATCCACAT AATAAATACC AGTAGATTTT GAGGTTTTAT AGTCATTAGC CATATTAAAA 1320 

ATAATATAGA ACAACCTCCT AATAATAGAT ATGTGAAAAC TATAAAACTT CCATCTTTAA 13 8 0 

AAGTAGGCAC TAATATAACC CTATTTTCAT TATCTAGATT ATCATCATAT ATCTTTAGTT 144 0 

TAAGCTTTTT ATTTAAGTAA ATGTAAAATG CTGCAATACC TATAAATCCT ATAAAACATA 150 0 

35 AAGATATTAA AATCTTATTA TCTAATTGAA CTTCAAACGT ATGTACATAT TTCCGTAAAA 15 6 0 

TAACTACAAA TAAAAACGAA CTACCAGTAA CTGGCCAGAA AATATTATTT TTATTTTGTT 16 2 0 

TATCAACATT TAAATTTTCA AGTTCCTTCT CACTAAGTTT TGCATACCTT TTGGGAATGA 16 8 0 

40 

ACCAATTAAT AAAAGGAAAA AAGTATACAA GCCAAGTGCT TACTAAATCA ATTAACAAAT 174 0 

ACTCATCATT ATATTGAACG ACTTTATATC TCGGATTTTT ATTAATAACC T7AATATTAA 18 00 

AAAGCAAAAC TCACCACGCC CATTTCATTG GATTTATATG ATTGCTAATA ATATTTTTAG 18 6 3 

45 

CTTCACTAAC AGCATTCCCA ACACTATCCA TGGATTTTTC TGTAGTTTTT TTAACAACAT 192 0 

CTATACTATT ATCGATTTTA TGCCCTACCC AGTCTACTTT ATCTTTTAAT CCAAAAATAT 1980 

so TATTTTGATA AATTAAATCT GTTCCTAATG CAAATACTGT ACTCATAGCC AAACCTGCTA 2 04 0 

AAATrArrrA TTTACTOOA TTACTTCCTA AAACAAAAGT CGCTAATCCA OCTCCAAOTG 21 CC 
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GAAAATATTT TAATTGCCTT AGATCGCTTA AGTAATTATG CAGAACTACG TTTAAGTGTA 594 0 

GATACTAGTA ATATCGAGGC ACAAGTATTG AGCGCTAAAT TATCTACTAC ATACGGTAAA 6000 

ATTGTTAAGC CAATTATCCT TT 6022 
(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 476 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

CCATCAATAA TGTATACATG ATTGGCATCA TATTCCCCTT TAATTAGAGA GCTACGTACA 6 0 

GTTTGTyTTA TTAAAGTAGA ACTAATAAAT AACCATCTCT TATGTGCACA AACACTTCCC 12 0 

GCAACAATTG ATTCAGTTTT ACCAACCCGT GGCATACCTC TAATGCCAAT CAACTTATGA 180 

CCTTCTTCTT TGAACAATTC AGCTAAAAAG TCTACTAACA AGCCTAAATC TTCACGCTCA 24 0 

AATCGAAAGG TTTTCTTATC TTTTGCATCT TGCTCAATAT ATCTTCCATG TCTTACTGCA 300 

AGACGGTCTC TTAATTCTGG TTTTTTAAGC TTTGTTATTT CAATTTCATT TATACCACGA 360 

GCTATTTGCT CAAAACGTTC AACTTTTTCA AGATTGTCTG TTTTAATTAA AAGGCCTCGT 420 

TTACCTTGAT CAACACCATT AATTGTAACA ATACTTATAC CTAACATACC TAATAA 476 

(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 633 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
- (D) TOPOLOGY: linear 



20 



25 



30 



40 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 

AGAAATACAA CGAAGCATAT AAATATAACC GATCTTTTTT CTAATTGAAT ATTAAGTAAG 60 

TGTATGTACT TTCTGGAAGT AGCACCTAGT rGGATTGTtC CTCCTACAAC AGGCCAAAAA 120 

TTTTTATTTT TAACTGGCTT AACAGTGTTC AGTTTTTCAT ACTCTTCTCT ACTAATTTTG 180 

GCGCACCTTT TTGGAATGAA CCAATTAATA AATGGAAAAA AGTATACAAG CCAAGTTCTT 240 

ATTACATCGA CCATTAAATA CTCATCATCA TACTTAATAA CTCTGTATTT CGGATTTTTA 300 

TTGATAATTT CGGTTTCACA AAGCAATAAT TATCACTTCC TATTAATAAC AAATTCACAC 360 
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ATTTTCAATC ACTACAGTAC CACCTATTAG ACTGCCAAGT GAAATCCCTA GTAATGGGAT 414 0 

AATCGGCAAA ATTGTTGGTT TTAGTAAATC ATGAATTAAA ATATAACGTT CATTCATACC 4 2 00 

5 GCGTAATCTT GATGCTTGTA CGATATTACT TTGCAATAAC ATCAATAAAT TAGAACGCAC 4 260 

TAAACGAATG ATGTATGCAC ACATACCTAA AGATAGCGTG ATTACAGGTA ATATAAACTG 4 320 

ACTTAGTATA ACGCTATCTA TATTCATTAA ATTTGTGACA ATAAATAATA AAATAATACC 4 3 80 

10 

GATAAAGAAC GCTGGTAAAC TAATCGATAG TGTTGAGATC ACTCTAATCA CTTTATCCGT 4 44 0 

CCACTTATGA AATCGTTTGG CTGCTATAAT GCCGAGCGGT ATAGATATGC ATAACGACAC 4 5 00 

TACTAATGTT GAAAATGATA TGAGTAATGT TATGGGTGCA TAGTTGAATA ATATCTGTGT 4 560 

15 

TACCGGTTCT TTTGATTCAA AACTTTTTCC TAAATTAAAA TGTAATAAAT GATTCATCCA 4 6 20 

ATGCCACCAC TGTACCAATA AAGAATCATT TAATCCCAAT TTATCTTTGG TTGCATTTAT 4 6 80 

TTGTTCCGTC GACACTTGTG CTACATCAAG ATGTAATATT TTATCAACAG GATTGCCTGG 4 74 0 

20 

TGATAATTTC ATTAAAATGA ATGTAAGTGT AGAAATAACA AATAAAACAA CTATCATTTG 4 8 00 

CATCAGTCTA TACAACATAG ACTTTATTAT GAACATAATA GTCCCCCTCC TTGTGTAAGT 4 86 0 

25 TACTAACACT TTCTTTTTAC ATGAGAATGG CGCATGTATA TGCAACTTAC ATATTAAGAA 4 920 

CTAACGTTCA TTATAGTATT ATCCATAAAG AAATTGAAGT ATATTTAATT TTTTAACAAA 4 980 

ATCATTATAA AATATAATAT TTTGAATCAA GTCAACCATG TAAAATATAA AAAAGTCAAA 5040 

30 ACAAAAACAA CTATAGCACT GTATTCCATC TCTTTCGAAA TAATTGTTAC TGCAGTGTAA 5100 

CTTAAAAGTC GATGATTTTG TGCATATAGT TGTCGAATAT TATTTTTTAT CTTTACGGCG 516 0 

AAGTTCAGCG CCCTCATAGC CGTATTTTTC AATTTGCTTT TCTAATTTAC GCGCTTTTCT 522 0 

35 TTCTTTACGC CAATTTCTAG TAAAATACCA TAATAGAAAA CTAATTAATA AACTCATAAT 52 80 

CGCTAAAAAT GCAGCGTATC CTAATAATGG TTGATATTTT ATATCTTGAA AATTTGGAAT 534 0 

AAAAAATGCA AGCACACCTA ATATAACAAA TGTAATTACT GCAGATACAA ACCATTTATT 54 00 

40 

TAAAACTAAG CAACAGAATA TTGTTAATAA AATCATTATT AATGTTGTGA TCCATAAATA 54 60 

ATTAGGCATA TCGAATAATG T CAT ATT CAT TCTCCTTTTA TTTCATTACT TTCCTTGTAT 5 5 20 

ACATTTTATT ATAAATTTTT AAAAACTTAA ACAATAGCAG TCAGTTTCAA GCAATATTCT 5 5 80 

45 

ATCTACTAAT AGAAAAATCA TTGTTCCTTG CGACATGGAA ATCGTAACAT TATCGTTTAG 5 64 0 

GAGACAAAAT TATGTATAAT GAATGTATTA TACCAAAGGA GTGATTATAT GTCTCAAGGT 570 0 

TTACCTTTAA GAGAAGATGT TCCTGTTTCA GAAACATGGG ATTTAGTAGA CTTATTTAAA 576 0 

50 

GATGATCAAC AATATTATGA AAGTATTGAC GCTCTAGTAC AnCAAGCAAA T CAATTT CAT 5 82 0 
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CCCTCTTTAA TTGTGTTCTA TATTTAATTA GACGTTCAGT ATACGGATGC AAATGCTCAT 234 0 

ACTTGAAATG ATTAATATTA CCTCGTTCAA TGATTTGACC TTCTTTTAAA ACATAAATGT 24 0 0 

5 ACTGACAATA TTTCAATACA TGACTTAAGT TATGTGTGAT AATAAATAAT GTTTGACCAT 24 60 

GTTCTAATAC AATATGCTGT AATAAATCCA TCACTTGATT ACCGTTCAAA GCATCCAATG 2 520 

ATGCAACTGG TTCGTCTGCA ATGATTAATT TAGGCTCCAA CATGAGAACG CTTGCTATGT 2580 

10 ATACGCGTTC AAGTTGGCCC CCAGAAAGTT GGAAACTATA TTTATTTAAT ATATCTTTGC 2640 

TTTGTAAATT AACCCACGAC AAAGCCTTAT CAACTTTGGA CAAAGCCTCT TCTTTACTAC 2700 

CTTTATAATG CTTACGATAA ATCGCAGTTA ACTGTTTACC TAATTTAGTA TGGTCGTTAA 2760 

AACTTTCTGC ATAATTTTGA GAAATATAGC CAATTGTATG ACCATAATAT TGACTCAATC 2 82 0 

TACTAACATT TTCCCCATCA AATTGGTACG AATCATACGT GCAGCTTAAA TCAAATGGTA 28 80 

AATATTCAAG TAAAGCTTTA GCAATCAAAC TTTTTCCAGC GCCGCTCTCT CCAATCAAGG 294 0 

CATTAATCTG TTGACTAAAA ATTTTCAAAT CAATCCCTTT AATAAGAGAT TTCTCACTAG 3 0 00 

TATTCTTTAT TGTTAAATTT TGTATATCAA TGAGACTCAT CATATTCACC CCGTTGTTTC 3 060 

AGCAATCTAT CTCTTAGTGC ATCACCGGTT AAATTAAAAA TTAAAATAGT TATAGCAATG 312 0 

ACTGAAGCAG GTGCAATCAA CATAATTGGA TGAGACGAAA TAAAATCACG ACCTTGTTGC 318 0 

AACATAGCGC CCCaCTCTGG TGTTGGCGGT TGTGCACCTA ACCCAATAAA TGATAGTGAA 324 0 

CTTATATATA GAATGATTTT ACCGAAATCA ACGACCATCA AAACGATAAT AGCCGGTATA 33 00 

ATTTTAGGTG TTAAATGACG TATTAATATT GTTCTTGTTG GTACATGAAA TAATTGTGCC 3 36 0 

ATTTTTATAT AAGGCTTATT CATTTCGCTA TTAACTATAC TTCTAGTCAA CCTTGTGTAA 342 0 

TTCATCCATT TTATTAATGT AATTGAGATA ACTAAATTCC ATAAAGATGG TTGAAAAAAA 34 8 0 

CTTGcTAAAG CAATCATGAT GATAAATTCT GGAATACTTA GACCAACATC AATAAACCTT 3 54 0 

AACACTAATC GTTCAATCCA CCCTTTTTTG TATCCGGCAA ATAGACCTAG TGTAACACCT 3 600 

40 ATGACAACGA TAGCTATTAA TGTTAAAACA GTAACAAACA ATGTTGAACG TGCACCGATA 366 0 

ATAATTCGGG TAAATAAATC TCTCCCATAA TCATCAGTTC CTAATAAATG CAACCAACTA 3 72 0 

ATAGGTTCAA AAGTTTGTGA TAAATTGACT TTGGTTGCAT TTTCACTACT GACAAAGAAT 3 780 

45 TGCAGTACAA TTACCAGAAA AATAAATGCA ACGAATACAA AAAATATCAG GTTATTCTTT 3 84 0 

GAAAATATTT TATGCATGAC GGTCACTACT TTCTGATATC AATGGTGTAT TGGTTTTGAT 3 900 

TTTTGGATTT CCTAATTGTA AACGCTGCTT CGGATCAAGT AATAACGTTA ATAAATCAGC 3 96 0 

AATCGTATTG ATAATAACAA CGAAGAAGCC AATAAATAAC ACGCATCCTT GAATAACAGG 4 02 0 
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GCTGCACAAC TTGACGCTGn AGAACGCGAT TTTTGAGCGT TTAGATCCAC ATAAGCTGGC 54 0 

CAGTTGTATT GATGTTGCAA ATATCGACAC GCCAATCAAG AkTATTTTAT TAAATATAGA 600 

5 CCCGGCACAA ATTACAACTA TATTAGACGA GCTAGATAAA TACCATCAAG AATTGGAAAT 66 0 

GATTCACCAT TCAAATGAGT ATAACATTGA TATAACAGCG CAAAATATTA ACAAATATAC 720 

TGCATTACAA TATATATTTG ATGCAGATGT TAAATATATA GCATTTGGTA ATGACCACAA 7 80 

10 

TGATATTGTC ATGTTACAAC ATGCTAGTAG TGGCTATATT ATAGGACCAT CAGAAGCATA 84 0 

CACACACGCA ATATTGAAAC TTGATAAAAT CAAACACATC AATAATAATG CACAAGCTAT 900 

TTGCAAAGTC TTAAAATCAT ATAAATAAAA ACACCCCTAT CAAATGATAA TCATTATCAA 960 

15 

TCGATAGGGG CTATTTTAAT AAAATTCGTC CTCGAACATT TCTTCCTCTT CATCTAATCC 1020 

AAATAATTCT GCCATTTCTC CATGTTCAAT TAACATGTTT AAATATGCAT CGCGGAGTTC 1080 

TTCTTCACTC ATATCATTAA TCATTTCTTT AAGACTATCA ATCCACATAT TTCTGCGTAA 114 0 

20 

TTGATAGTCT TCTTCAACTT CGTTTAACAT CATTATATGT TTATTTGCTG CTTCTGGACT 12 00 

AGCTGTAAAG AGTAATGCAA TCATATGTTT ACATATCACT CGTCTTCCAT CAGCATGAGG 12 60 

25 ACAATTACAT ATGGATTTTC TAGGATGTTC CATATCAATA TAACAACGAT ATACTTTGTT 132 0 

GCCACTGCCC TTTACTTCAG CCTCATGCTG CGTTTCTGAA AATGATTTTA AGTTAATGAC 13 8 0 

GCATTCACTT TGATAATAAT TAAAGCCTCT TTCTATAGAA CGAATACTTG CAATATCAAG 144 0 

30 TAATCCCATT AATGaTACTC CTTTTTATTA TTATTTTTAA ATAAAGAaAA TAAAATAGAT 150 0 

AAGTGTCTAG ATTAAAATAC TTGATTTATC TATATTTTAT AACAAGTCTA GAATTATCGC 156 0 

ATTCTTAAAT AACTAATATG AAAATGcTTG CACTAATTCt TTTGTATAAG GGTGTCTATC 1620 

35 AACATTAAAT AATTCCtCTA TTGCAAAATC ATCGACTATC ATGCCATCCT TAAGAACGAT 1680 

AATTCTATTA ACTAAGCGTT GTAACACGGA TAAATCATGA GAAATAACGA TAAAATGATT 174 0 

TAAGTTCGTA ATCGTTTGCG CTTTTAATAT ATTGATTACA TTTTGTTCAG CTATAACATC 1800 

40 

TAAATTTGAA GTTATCTCAT CACATATTAA AACGCGAGGC TGTGCTAATA ACGAACGCAT 186 0 

GACATTAAAT CTTTGTAATT GTCCGCCACT CACTTCGCTT GGTAATTTAG TCAATAATTG 192 0 

CGCGTTTAAC TCAAAAGTAG ATAAATGTTG TAATAATAAT TGATCCTGAG CAGTATTATC 199 0 

45 

AGTTAGACCT CTGTAATAAT ATAACGCTTC TTTTAATGAG GTCTCAATCG TCCAATCAGG 2 04 0 

GTTAAAGCTA GTTAAAGGGT GTTGGAAAAT CGGTAACACA GCATTGTCAC TTAAGTAAAT 210 0 

CTCTCCTTTA ACAGGTTTAA ACAAGCCAAG AACCAATGAA GCGAGCGTAC TTTTACCACA 216 0 

50 

GCCACTTTCG CCTAAAATAC CAACATTTTC TCCATCAGGT ATAGTAATAT TGAT AT CTTG 2 22 0 
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TCAAATAAAA AGTGATGTGA GTGAATTGTC AAAAAGTGAA GATCAACGTA TTACTAAAAC 1014 0 

AAAAGATGAA CAAATTAAGC AAATAGATAT ATCGGATATC AAACCGAATC CGTATCAGCC 10200 

CCGAAAAACT TTCGATGAAA ATCATTTAAA TGA1TTGGCA GATTCAATTA AGCAATATGG 10260 

AATTTTGCAA CCAATTGTGC TTAGAAAAAC AGTTCAAGGT TATTACATTG TAGTTGGTGA 10320 

AAGAAGGTTT AGAGCTTCGA AAATTGCTGG TCTAAAATAC GTATCAGCGA TTATCAAAGA 103 80 

TTTAACAGAT GAAGATATGA TGGAACTGGC GGTCATCGAA AATTTACAAC GAGAAGACTT 1044 0 

AAATGCGATT GAAGAAGCTG AAAGTTATCA ACGTTTGATG ACAGATTTGA AAATTACACA 10500 

ACAAGAAGTA GCGAAACGAT TGAGTAAGTC GCGCCCGTAT ATAGCGAATA TGTTGAGGTT 10 560 

ATTACATTTG CCGAAAAAGA TTGCTGACAT GGTAAAAGAT GGGCGACTGA CAAGTGCACA 10620 

TGGACGAACG TTATTGGCAA TTAAAGATGA ACAACAAATG CTTAGGTTAG CGAAACGGGT 10 680 

TGTTAAAGAA AAGTGGAGTG TCAGATATTT AGAAAACCAT GTTAATGAAT TAAAAAATGT 10740 

TTCGTCAAAG TCGGAAACAG ACAAAGTAGA TATAACTAAG CCTAAATTTA TAAAGCAGCA 10800 

AGAACGACAG TTGCGAGAAC AGTATGGTAC CAAAGTAGAT ATATCAATAA AAAAATCGGT 10 860 

25 TGGTAAAATC TCATTTGAGT TTGATTCACA AGAAGATTTT GTGAGAATAA TTGAACAATT 10 920 

AAATCGTAGG TATGGTAAAT AGTTACACAA TTTTATATAA TAACTCTTTG TGCAAGTGTA 10980 

AATAAATTGT AATCAGTGAC ATTTGATTCT AGAT 11014 
30 (2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6022 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
35 (D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

TCCCCTTATG GAATTTCACA TTCTAGTTTA CATAATATAT ATTATAGGAA GTTATATGTG 6 0 

TGTAACGCAA AAgGTACCCT ACATCATAAT CATTATCTAA TATCGTCACA TAACTTACTT 12 0 

ATGCTATAAT CATGGTATTA TATTGTTTGG AGTGATTTGA TGAGATTTGT CTTTGATATT 180 

GATGGTACGC TTTGTTTTGA CGGCCGATTA ATTGACCAGA CTATTATTGA TACATTGTTA 24 0 

CAATTACAAC ATGATGGTCA TGAACTTATA TTTGCATCAG CACGTCCGAT TCGTGATTTG 300 

TTGCCAGTTT TACCATCAGT ATTTCATCAG CACACATTAA TTGGCGCAAA TGGTGCTATG 360 

ATTTCACAGC AATCAAAGAT TTCTGTTATC AAACCAATTC ATACTGATAC ATATCATCAT 42 0 
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TCCACACCTC TACGCCAATC GAATATCACT TCTGTCTCTT TTGAAAGTGT CATACAATCT 8 34 0 

CTCCAATCTG AGCTTTATCT AATGCTTGGA TGATATCGCG TTCGATGTCT TCATAATTTT 84 00 

CAACACCTAG TGATAAGCGG ATTAAATACT CATCAATGCC ACGTTTATCT TTTTCAGCAT 84 6 0 

CTGGCATATC AACATGTGTT TGGGTGTAAG GGAAGGTCAC TAATGTTTCA GTACCTCCTA 8 52 0 

AACTTTCTGC AAAAATGCAA ATGTCTAAAT TTTCTAATAA TTTAGCGACG CTATAGGCCT 8 580 

TGTTAAGTCT TAAACTAAGC ATGCCAGTTT GCCCGCTATA TAGTACTTCG TCAATTGCTT 864 Q 

GAAGTGACTG ACATTTTTTA GCAAGTTTTC TAGCGTTTGA TTGCGCACGC TCAATGCGTA 8700 

AATGCAAAGT TTTAAGTCCA CGTAACAACA AATAACTATC TATTGGTGAA AGTGTTGCGC 8760 

CAGTCATGTT GTGAAAATCA AACAACTGTT GCGCGAGTGA TTCATCTTTG ACGGTTACGA 8820 

CACCTGCTAG TACATCGTTA TGTCCGCCAA TATATTTCGT GGCTGAATGT AAGACTATAT 8 8 80 

20 CAGCACCTTC TGCTAGTGGT GTTGAAAGAT AAGGTGTTAA AAAAGTATTG TCGATAATTG 8 94 0 

ACAATAAGCC TTTAGCTTTA CAAAGTTGAT AGTATGGCTT TACATCAATA GCAATCATTT 9000 

GTGGGTTAGA TATTGGTTCA ATGAATAATG CAACTGTTTT ATCAGTGATT TCTTTTTCAA 906 0 

CTTGTTCATA ATCTGTAAAA TCAACGTACT TAAATTTGAT ATCGTATTGT TGCTCGTAAA 9120 

ATTCAAATAA TCTAAATGTG CCACCATATA AATCGAATGA AACTAAAATT TCATCATGAG 9180 

GTTTAAATAG ATTACATATT AATTGAATGG CTGACATTCC ACTTGATGTA GCGAATGATG 924 0 

CAATACCATG CTCAAGTTTG GCAAAACAGG TTTCAAATGT TGAGCGTGTA GGATTTTTAG 93 00 

TACGTGTATA ATCAAAACCT GTCGATTGTC CTAGTTTTGG ATGCTTGTAG GCAGTAGATA 9360 

AATGGATTGG ATTCGCTATA GCACCGGTTG AATCATCGGT TAATGTGATT TGGGCTAACT 94 2 0 

GTGTATCCTT CATATTAAGA CCCTCCTATA AGAAAAAATA AAAAAAGCTT CCGTCCTTCG 94 80 

TACCCGAATG AATCGGATAA AAAGGACGAA AGCTTATGTT TCGCGGTACC ACCTTTATTT 954 0 

GTTATTCCAT CGCTGAAATA ACCTTATTCA GTACGCATTA AAAGTAAATA TGCTTACTGA 9 6 00 

ACAATTATCA CAATTAAAGT CAGTAAGTAA GGATATAGTA ATGTGCTATC CCATACTTAT 9660 

TAACAAAAAA TCGTGCGTAA AGAATCCAGT ACGCCATTTA ACATCAATGT TAATACTGTA 972 0 

45 TCGCTATAAC GGGCGAACCC GTAGACACCT CATATTGGCA TCAACACTCC AAGGCCATTT 97 9 0 

TCAAACACGC TTTCAAAATC TTCTCTCAGC TACTAAAGAC TCTCTGTATA AGCAGGGTGT 9 84 0 

GTTTTACTTy CCTCTTTATT GTGTTTACGT TTCATTAAAC TGTTATAAGA TATTAATTAG 990 0 

50 CTTACAGAGT AAAAAAAGAT TTGTCAACAA TTATTCAGAA AATTTTGATT TAAAAGTTAA 9 96 0 
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TACTTAAATT CACTATTTTC AATATCTAAT AAGCTGGCAT TTGGATAACA AGATAAGAAT 6 54 0 

GCGTGCTCTG GTAATTCAAT ATGTGTGAAA GACTCTTGCA TATGGTGCGG GCCATGATGA 6 6 00 

CAATTGAGTC CCACGATGTT TGCACCACAT TGAACGAGTT GTTTTAATCC TTCATTGATT 6 6 60 

GCCTGACCAT TAACTAAGTA ATTTGTGTTT GAAGCGGTTA ATTGAGCAAT GATTGGAATG 672 0 

TCGTATTTCT TTCTCGTTCG TGAAATGACA TTTGTTAACT CTTCTAGGTC GTAATACGTT 6780 

TCGAAAAGTA GCGCGTCAAC GCCTTCTTCA ATTAAGGTGT CTATTTGAAT TTCAGTATGA 6 84 0 

TAAAGAATAG TTTGTAAGCT GATATCCTCT TGTTTGATAC CTCTAAACCC ACCAACTGTG 6 900 

CCTAATATAT ACGTATCTTT ATTTGCTGCT TTTTTTGCGA TGCGAACGGC GGCTTGATGT 6960 

ATTGCTTTAA CTTTATCTTC AAGACCGAAT CGTTTTAACT TTTCAAAATT TGCACCATAA 7020 

GTATTGGTTT GAATGACATC AGCACCGGCT TCAATATATG AACGATGGAT GCGTTCAACT 7 0 80 

TTATCTGGAT GGCTAAGATT ATATGCTTCT GGACAGGTGT CTAATCCTTC AGAGTATAAA 7140 

ATGGTTCCTA TAGCGCCATC AG CT ACT AAA ACATTATCTT TCAATTGTGT GAGGAATTGA 72 00 

CTCATTGAAT GCCTCCTTTA ATGCGTATTT GATGTCTGCA ATGAGTTCAT CAGGATCTTC 72 6 0 

25 GAGACCAACA CTTAATCGGA ATAGACCGAA AGTGATACCA CGTTCTTGTC TCACTTCTTC 7320 

AGGTAGTGCA GCGTGAGACA TTGTTGCTGG ATGTGAAAGG ATCGTTTCAA CACCGCCCAG 73 80 

ACTCACTGAA ACGAGTGGTA ATGTCAGTGC ATCGACAAAT TGTTGTGCTT TAGACTCATC 744 0 

30 AGCTAAACGA AAGCCAATAA CGGCACCGCC ATTTTTAGCT TGTTCTAAAT GAGCAGTAGT 75 00 

GAGTCCCGGA TAATAAACTT CTGAAATTTC ATCTTGCTTT ATTAAAAATG ACACGATTTT 7560 

TTGAGCGTTT TCGACAGATT GTTTAAATCT GATTGGAAAA GTTTTTAAAT GTTTAGCAAG 762 0 

35 

TGTCCAGCTA TCCTGAGCAG ATAACATATT GCCTGTACCA TTTTGTATTA AATAAAGAGC 76 80 

GTCACTAATT GCCTCATTAT TAGTTATGAC AGCACCAGCA ATTAAATCGC TATGTCCACT 774 0 

TAAAAATTTT GTAGCACTAT GAATGACAAT ATCAGCGCCA AGTAATAAAG GTGATTGACc 78 00 

40 

TAACGGTGTC ATAAATGTAT TGTCCACAGC TACCAGTAGT TCATGCTTTT CGGCTATTTT 78 60 

AGAAACAGCT TTGATATCAG TAATTTTAAA ACAGGGATTC GATGGTGTTT CGATATAAAT 7920 

TAATTTTGTG TTTGATTGAA TGGCACCCTC GATTTGTTCG AGCTTTGTAG TATCTACGGT 7 9 80 

45 

TGTAAATTCA ATATTAAATC GATTCAAAAT TTGCTCAGTG AGGCGAAAAG TACCGCCATA 804 0 

TACATCATCG GGTAAGATGA CATGATCACC AGATTTGAAA GTCAAAAGTA CTGCTGAAAT 8100 

50 AGCAGCAATA CCTGATGCAA AAGCAAAAGC GAATTTTCCC TGTTCTAATC GTGCTAACTT 8160 

CTCTTCTAAA AGTTCACGGT TAGGGTTGCC CTTCGTGCAT AATCATATTT AACATCGCCA 82 20 
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AACACCAGCT TTTTCGAAAT AGTCATAAGC TTCACGTGTA ATATTTTCAT AGCTTTCGCT 4 74 0 

GTCGTCTGTA ACTAAGATTG GCTCATCAAC TTGAATGTAC TCAGCACCTG CATCAATTAA 4 800 

5 

TGATTCAAAC ACTTCTTTAT AAAGTGGTAA TAACGTTTTA ACTTTTTCTT CAAAAGTTTG 4 86 0 

GTGACCGCCT TTTGATAATT TAACAAAAGT AATCGGACCA ACAATGACAG GGTGAGCGTT 4 92 0 

AACGTTTAAA GATTGGGCAT ATTTAAAGCG ATCTAATAAT ACATTGCGAC TCACTTTAGG 4 98 0 

W 

CTCAACATTG TCCCATTCAG GTACGATGTA ATGATAGTTA GTGTTAAACC ATTTTATAAG 504 0 

TGCACTTGCA ACATGGTCTT TATTACCGCG AG CAATATCA AATAATAAAT CATCATCAAT 5100 

15 AGTTCTTCCT TGGAAACGTT CAGGGATGAT GTTGAATAAT AATGACGTAT CTAATATATG 516 0 

GTCATATAAA GAGAAATCAC CAACTGGGAT GCTATCTAAG TGATAGTACT TTTGtAATAA 522 0 

TAAATTTyCT TTATGTAGAT CAGTTAATGT TTGATCTAAT TCTTCTTTAG AAATCTTCTT 5280 

20 TGCCCAATAA CTTTCGATGG CTTTTTTCCA TTCTCTTTTT CTACCTAATC TTGGGAATCC 534 0 

TAAGTTTGAT GTTTTAATTG TTGTCATAAT ATTGCCTCCT TGTGAGCAGT AATAGATTTT 54 00 

GAGTATGCTG CAAGTTCTAA TGAATCTTCG ACATTTTGAA ACGGTGTGAT AATGTATAAA 54 6 0 

25 CCATTAAAAT ATTCATGAAC AGTATCGATT AAATCCTTTG AAAGCTTAAG ACTTAGTTCT 552 0 

CGTGTTTTGG CTTTATCATC TTTAACTGCT TCAAATTGTT GTAAAATTTC ATCTGACATC 55 80 

TTGATTCCTG GCACTTCATT ATGCAAAAAG AGTGCGTTTT TGTAACTTGC G ATAGG CAT A 564 0 

30 

ATGCCTATGA AAAATGGTTT GTTCAAGTGC TTAGTGGCAT GGTAAATTTC AATGATTTTC 57 0 0 

TCTTTGCTGT ACACGGGTTG TGTTATAAAA TAAGACATTC CGCTTTCTAT CTTTTTCTCT 5 76 0 

AATCTTTTGA CGGCACCATA TAATTTACGA ACATTAGGGT TAAAGGCGCC AgcGATGTTG 5 82 0 

35 

AAGTGTGTAC GTTTCTTCAG CGCATCACCG TCAGTGTTAA TACCTTGATT AAATCTTAGA 5 880 

GCGAGTTCAG TTAATCCTTT AGAATTAACA TCATAGACAT TGGTTGCACC TGGTAAGTGA 594 0 

CCAACTTTTG AAGGATCACC AGTTATGGCT AATATTTCGT TAACGCCAAT GAGCGATAAT 6 000 

40 

CCAAGTAAAT GGGACTGCAA GCCGATTAAG TTTCGGTCTC GACATGTAAT ATGTACGAGT 6 06 0 

GGTTCAATAT TGTAATATTG CTTAATTAAG CTAGCAGCAG CAATATTGCT AATTCTGACA 6120 

45 GTTGCCAATG AATTATCTGC GAGTGTTACC GCATCTACAT TAGCTTTATC AAGTTTAGCG 6130 

ATATTTTCAA AAAATCTATC CGTGTCTAAA TGTTTCGGTG TATCCAATTC GATAATAACG 6 24 0 

GTTGGACGTT CTTGAACCTT AGATGTTAAT GATTGTCTAA CTTTATTTTG AGATGGATTG 6 3 00 

50 AAAAGTGCTT TCGTTGGTAT CGGAATCACT TTTTTGTCAT TAACAGGTTT AAGTGTCTGA 6 3 60 

ATAGATTPTT TAATAAATTT GATGTGCTCT GGCGTTGTAC CACAGCAACC ACCAATTAAA *4?n 
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AAAGTGAATT GGTGCATCAA TGTGAGTACC ATATTGCGTT ACAATATTCC AACGTTGCAC 2 94 0 

A TAG AAA CCA TGATCTTTAA CCGTGAATAA AGTTGAAACT TCGCCTTTTT CAAACTCACT 3 00 0 

AAAACGTGGT ATTTCCGGAT CAAATGTATG CGTTAAATCA ACCCAAGTTG CTTGTTTTAA 3 06 0 

AGTATTTAAT TGTTGCCATA AAGGATATTG TGTCATAAAA TCACCCGTTT TTAGTTTATT 3120 

ATATGATAAA TGCTGCGATT ATTCTTGGCG TTTAGCTTTA ACAGCATTCA CAAGCACAGT 318 0 

CAATGCATCT TTAACTTCTT CTTCTTTTCG CGTTTTTAAA CCACAGTCAG GGTTTACCCA 3 24 0 

GAATAATGAG CGGTCGATTT GTTGTAGTGA ACGATTGATT GCTGTAGTAA TTTCTTCTTT 3 300 

TGTTGGAATA CGTGGACTAT GAATATCATA TACACCTAGA CCAATACCTA AATCATAATT 33 60 

15 

AATATCTTCA AAGTCTTTAA TTAAATCACC ATGGCTACGA GATGTTTCAA TTGAAATAAC 34 2 0 

ATCAGCATCT AAGTCATGAA TAGCATGAAT GATTTGACCG AATTGAGAAT AACACATATG 34 80 

20 TGTATGGATT TGAGTTTCAT CACGAACTGA AGACGTTGCA AGTTTAAATG ATAAAACAGC 354 0 

ATCTTTAAGA TATTGTTCGT GATATTCAGA GCGTAATGGT AAGCCTTCAC GTAATGCAGG 3600 

TTCGTCAACT TGGATAACTT TGATTCCTGC AGCTTCAAGT GCTAATACTT CTTCGTTGAT 3 6 60 

25 TGCTAAAGCA ATTTGATCTT GAACGACTTT ACGTGGTAAA TCAACACGTT CAAATGACCA 3 720 

GTTTAGAATT GTTACAGGTC CAGTTAACAT ACCTTTAACT GGTTTATCTG TTAAGCTTTG 3 780 

TGCATAAACT GTTTCATCAA CAGTTAAAGG CGCTGTCCAT TTTACATCAC CATAAATGAT 3 84 0 

30 TGGTGGTTTT ACGGCACGTG AACCATATGA TTGCACCCAA CCGAATTTAG TTACTAAGAA 3900 

ACCTTGTAAT TTTTCTCCGA AGAATTCAAC CATGTCATTA CGTTCAAATT CACCGTGAAC 3 960 

TAATACATCT AAGCCAATGT CTTCTTGAAT TTTAATCCAT CGAGCAATTT CATTTTTTAA 4020 

GAATGTTTCA TATGCTTCGT CTGTAATGCG TTTGTTCTTC CAATCTGCAC GGTATTTTCG 4080 

AACTTCTCGG CTtTGTGGGA ATGATCCAAT AGTTGTTGTT GGTAAATCCG GTAAGTTCAA 4140 

ACGTTTTTGT TGTTGTTCAA TACGTTGCGC GAATGGTGAT TGTCTTGAAG TACGCACGCT 4200 

TTCGAAATCA TAATCTAAGT TTTTGAATGA TTGATTTTGG AAACGCTCAT AACGTGCTTT 426 0 

TAATTTATCA TATTTAACAC TATCGTTTTG ATTAAATAGG CGACGCAATG CATCTAATTC 43 2 0 

GTCTAATTTT TCAGTTGCAA AGCTTAAGCC TTCGCCAACA CTTGTATCTA ATGTTTCATC 43 80 

ATCTAAAGAT ACTGGAACAT GTAATAATGA AGATGATGGT TGAATGACAA GTTCATTAGT 444 0 

GTGTGCTAAC AATTTATCGA TTAAGACTTT TTTAGCTTCA ATGTCACTTG CCCATACATT 4 500 

SO ACGACCATCA ATAATTCCAG CGTATAATGT TTTTGATTTA TCAAAATCTC CAGCTTCAAT 4560 

TTGTTTAAGG TTATAGCCAT TATCATGGAC AAAGTCTAAA CCTATACCAC CAACAGGTAA 4 620 
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TGACGAGAGT CGTATTAGCA GCAGCATACA GGACACCTAT TGGCGTTTTT GGAGGTGCGT 114 0 

TTAAAGACGT GCCAGCCTAT GATTTAGGTG CGACTTTAAT AGAACATATT ATTAAAGAGA 120 0 

5 

CGGGTTTGAA TCCAAGTGAG ATTGATGAAG TTATCATCGG TAACGTACTA CAAGCAGGAC 12 6 0 

AAGGACAAAA TCCAGCACGA ATTGCTGCTA TGAAAGGTGG CTTGCCAGAm ACAGTACCTG 13 20 

CATTTACGGT GaATAAAGTA TGTGGTTCTG GGTTAAAGTC GATTCAATTA GCATATCAAT 13 8 0 

10 

CTATTGTGAC TGGTGAAAAT GACATCGTGC TAGCTGGCGG TATGGAGAAT ATGTCTCAAT 144 0 

CACCAATGCT TGTCAACAAC AGTCGCTTTG GTTTTAAAAT GGGACATCAA TCAATGGTTG 1500 

ATAGCATGGT ATATGATGGT TTAACAGATG TATTTAATCA ATATCATATG GGTATTACTG 1560 

CTGAAAATTT AGTAGAGCAA TATGGTATTT CAAGAGAAGA ACAAGATACA TTTGCTGTAA 162 0 

ACTCACAACA AAAAGCAGTA CGTGCACAGC AAAATGGTGA ATTTGATAGT GAAATAGTTC 1680 

20 CAGTATCGAT TCCTCAACGT AAAGGTGAAC CAATCGTAGT CACTAAGGAT GAAGGTGTAC 174 0 

GTGAAAATGT ATCAGTCGAA AAATTAAGTC GATTAAGACC AGCTTTCAAA AAAGACGGTA 180 0 

CAGTTACAGC AGGTAATGCA TCAGGAATCA ATGATGGTGC TGCGATGATG TTAGTCATGT 1860 

25 CAGAAGACAA AGCTAAAGAA TTAAATATCG AACCATTGGC AGTGCTTGAT GGCTTTGGAA 192 0 

GTCATGGTGT AGATCCTTCT ATTATGGGTA TTGCACCAGT TGGCGCTGTA GAAAAGGCTT 1980 

TGAAACGTAG TAAAAAAGAA TTAAGCGATA TTGATGTATT TGAATTAAAT GAAGCATTTG 2 04 0 

30 CAGCACAATC ATTAGCTGTT GATCgTGAAT TAAAATTACC TCCTGAAAAG GTGAATGTTA 210 0 

AAGGTGGCGC TATTGCATTA GGACATCCTA TTGGTGCATC TGGTGCTAGA GTATTAGTGA 216 0 

CATTATTGCA TCAACTGAAT GATGAAGTTG AAACTGGTTT AACATCATTG TGTATTGGTG 222 0 

35 

GCGGTCnAAC TATCGCTGCA GTTGTATCAA AGTATAAATA ATAAGAAAAC AGGTTATCAC 2280 

AACAffTATTA ATtACATGTT GGCATAACCT GTTTTTATTT GTTTATGGAT TTATTGGGTA 2 34 0 

ATATTAGTCA TTTGATGGTT TAATTGCAAA TGCTCTAACA GGGAACCCAG GTGCATCTTT 24 00 

40 

TGGTTTAGGG CTGATAGCGT AAATGATGGC GCCACGAGTT GGTAATTGAT CTAAATTAGT 24 6 0 

TAATAACTCG ACTTGGTATT TATCCTGACC AAGAATATAA CGTTCGCCAA CTAAATCACC 2 52 0 

4 5 ATTTTTTACA ACGTCCACAG ATGCATCGGT ATCGAATGTT TCATGACCAA CAGCTTCAAC 25 80 

ACGACGTTCT TCAATTAAGT ACTTCAAAGC ATCTAATCCC CAACCCGGTG CATGTTGTTG 264 0 

TCCGTTCGCA TCTTTGTTTT CAAACTTTTC AATATTAGGC CAACGTTTTG ACCAATCGGT 2 70 0 

50 ACGAAGTGCA ACAAAAGTGC CAGGTTCAAT AGTACCATGC TCTTTTTCCC ATGCTTCTAT 2 76 0 

ST^^^r^^T ^T^m^A^^ AA^!"' ATTGTT flTT^GCTAOT Tr'TGTT^AAA AGT^TAATA^ 7370 
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TTTACCTGTT GATAAAATGA ACTTAGATAT CCCAAGATtA GTGCTTGATG GTATTGAAGT 12 6 0 

AGTAGGTTCA CTTGTTGGTA CAAGACAAGA CTTACGTGAA GCGTTTGAAT TTGCTGCTGA 13 2 0 

AAATAAAGTA ACACCTAAAG TTCAATTAAG AAAATTAGAA GAAATCAATG ATATTTTTGA 13 8 0 

AGAAATGGAA AATGGTACTA TAACTGGTAG AATGGTTATT AAATTTTAAA AATATCAACT 144 0 

GACTATATAG ATAAAGAAGG TAGTGCTCTG AACACTATCA TTATTAATCA AACCCCGAGG 150 0 

TTTTCCTGAA AAGATAGTGG nAAATCCCCG TGTTTTTTGG GTTTGAGGnG GTTGTnTGTA 156 0 
(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11014 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 

20 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

GTCCTGTnGC TGCAATGAAT ACGCCTAAAA ATCCAGGGAT GTAATGGATA CTTTGTGGTA 6 0 

25 GTACTAATGA TAGAAATGAT AAAAATGAAA TCACAAAGGC TACGCTCGCA AAAGCTTGAC 12 0 

ATGTACGCTT AT CG C CAT AA TCTAACCCTG TACGTATATG TAATAAATAC TGTAATCCGA 180 

TACTTAAATA CATAATTGCC ACGCATAAGA AGAATGGGAA GAATGTCTTT TCAAAGTCCG 24 0 

GATATAGGCT GTTAGATAGG AAGACCATGA TGAACATATT AAACATCATA AACGAGACGT 3 00 

CTTTGAATGT AACTTGACCA AATCGATTTG TAAAAAATGT TTGATGAGAC CACATTAACC 3 60 

ATAAGAACAA ACTCATGACG ATGTATTTGA AAAATAAATC AGCTGAAATG GAACCGTTTT 420 

GTGTTGTTAA AATCACATGT GCAATTTTTT GAATGGCATA GACGAAAATT AAATCAAAGA 4 80 

ACAACTCATG GAATCCTGCA CGCTTTTCAG CTAAATGTTT TGGTGTTAAT GCATTAACCA 54 0 

TAAAATTTTA ACTCCTTTAA GATGTGTAAT TAATTTACTA AGTATACTAT TTATTTTTTC 600 

TAGTGAATAG GGGCAGATTT GGCGATGAAG TGGAAGGAGA GGTGACTGCA AGGTAATTGC 6 60 

GGAATTAACA ATCATCAGCG ATTTAATATT TGACTGGAGA CGTCATGGTA ATAAAAAATT 72 0 

GATGAGAAAT TGATGGTGAA ACCAGCTGTG AATAsCGaTG cAATGATrsA TAGaATTTAA 780 

TTAGAGTCAT TACGCGaAAT GATTAATGAT AATTTGTGGT AAATCAAAGC aTAATTTTGT 84 0 

ACTATAGATG AGGATGATAG AGCATATTTA AGAGGGTGAA ATGTTAAAGT GAAACCGTTT 90 0 

50 ACGTTTCCGA TTGCCCAAAC AAATTACATC ATTGTATAAT ATGATTTGTT AAATGCATAA 96 0 

CAAGAATGAA AATGTAACAT ACGTAGCAAT TGGTTTCATA AATTGGATGT TAGTGGCGTA 102 0 
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GAAAGATGTG TATATTTTTT AGTTCTAGTT ATATTATTTT TTAAAAGACT CATCACGTGG 4 74 0 

TTCTTTAAGA ATTGCTTGTC TTAAAAGGAA AAATAGCAAC AATAAACCTG CAAGCATACC 4 800 

TGTGTGCCCA ATACCTGCAA AGCCTGCnAA TGCTTCTGGA GAGTATGATT TACCAGTGAC 4 86 0 

TTGGAAGAAT CCTTTTGTC 4 879 
(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1560 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: i 
20 ATAATGTCTT AGaTTGATTG GGAGTTTTTT 

yTAATAAAAA ATTTGAATAA CTGACACAyT 
TAATTCACAT TATAATAAGA GTGAAGATAA 
25 ATGTGAAGTA AAAATTACAC GTTAGCATAT 

CTgGGGaACG TTAAAAGTTA ACGGkTGATA 
TTTTAATCTG GCGTCTATTA CAAATAAAAA 
30 GAAAGTAATA GCCAATTAAT ATCATACATA 

GAGAGCAGCA GTTGTAACGA AAGATCACAA 
TTTAAAACCT GGTGAAGCGT TGGTACAAAC 

35 

ACATGTTAAG AATGCTGATT TTGGTGATGT 
TGGTAAAGTC ATCGAAGTTG CGGAAGATGT 
TATCGCTTGG ATGTTCGAAA GCTGTGGAAG 

40 

ACTTTGCCGT AGTGTGAAAA ATGCTGGTTA 
TATTGTTACT GCAGACTATG CTGTGAAAGT 

45 TTCTATTACA TGCGCAGGTG TGACAACTTA 

TGGACAATGG TTAGGTGTTT TTGGTATAGG 
TAAAAACGTT ATGGGGGCTA AAATTGTTGC 

SO CGCGAAAGAA TTAGGTGCTG ATGCTATTAT 

AGTTATGAAA TTAACTGATA ACAAAGGATT 



;EQ ID NO: 90: 

TAATTTTTTT GAAATTAAAT TAATCTGTAs 60 

TTTTTGATCA TAGCTAyATA CTTTGTGAAT 120 

GAGTATTATA AATnATCTTT AAATAAATAT 180 

CGATTATGgT CATTTCkTTT AACATATTAA 24 0 

TCyAACtAAA AACAAGGTCA CAGTAGTATG 3 00 

TTACATCTAT AATTATTCGT TTTCTTTTTT 3 60 

CTGGAGTGAC TATAAGGAGG ACATTATTAT 420 

AGTAAGTATT GAGGACAAAA AGTTAAGAGC 4 80 

GGAATATTGT GGCGTTTGTC ATACCGATTT 54 0 

TACAGGCGTT ACTTTAGGTC ATGAAGGTAT 6 00 

AGAATCATTA AAAATTGGAG ACCGTGTGTC 66 0 

ATGTGAATAT TGTACAACAG GTCGTGAAAC 72 0 

TACAGTAGAT GGTGCAATGG CTGAACAAGT 7 80 

ACCTGAAAAA TTAGATCCAG CAGCAGCGTC 84 0 

TAAAGCTGTA AAAGTAAGTA ATGTAAAACC 9 00 

TGGTTTAGGT AACCTAGCTT TACAATATGC 96 0 

CTTCGACATC AATGATGATA AATTAGCATT 102 0 

TAATTCTAAA GATGTTGATC CAGTTGCAGA 10 80 

AGATGCAACA GTGGTAACTT CAGTTGCTAA 1 1 4 n 
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TTGATGAACT 


TCCAGAACAA 


CTTAACAAAG 


TAATTAAACA 


TGAAAATAAA 


GGGCGCATTG 


2940 




TTATCGATTT 


CGGTGTAGAT 


AAATAGTATT 


CATGAAAAAG 


ACATCCCGTT 


ATGCGAGATG 


3000 


5 


TCTTTTTTAA 


TTTAGTATTT 


GATATACATA 


CCGCCTGAAT 


CTGGTTCGGT 


AGGTATAAAT 


3060 




CCAAATTTTG 


TATATAATTT 


ATCCGCTGGG 


TAGTCTGCAA 


TCAGAcTAAC 


GTATGTACTC 


3120 




TCAACAGCCA 


CACCTTTAAT 


ATATTGCATA 


ATATGCTCCA 


TAATTAGACT 


GCCGTAACCT 


3180 


10 


TGACCTTGGT 


AACTTTTCAA 


AACTGCAATA 


TCAACAATTT 


GAAAAACAGT 


TCCGCCATCG 


3240 




CCAATCACTC 


TACCCATACC 


AATTAACCGA 


TCTTTATCAT 


ACAAGGTTAC 


TGTAAATAAG 


3300 


15 


GCATTAGGTA 


ATCCTTTTTC 


aGCTGTTCGC 


GCGTCTTTGG 


ACTCATACCT 


GCGTTAATCC 


3360 


TTAATGCGCA 


ATAATCCTCG 


CAAGTCGGAA 


TATCATATGT 


CACTTTAACC 


ATTATTTACC 


3420 




CCACTTTTCA 


TCACACAATA 


TATCAACCTA 


GTATAAATGT 


TTATTTACAA 


TAGTCTTATT 


3480 


20 


CGCTTCTTTA 


AACACTTCAT 


GATGACTTGA 


AACATAACCC 


TCTGCATTCG 


CATCTGGTTG 


3540 




GATATATGTT 


TTAGCAAGGT 


TCGCTGCATT 


TGCACCATCA 


CTAAATGCAC 


TTGCAATTAG 


3600 




ATGTGATTTT 


GCATCATGAT 


AAACAATATC 


TCCACACGCA 


TAGATACCAG 


GTATACTAGT 


3660 


25 


TGTCGTATTA 


CCAAATCCTT 


TAACACGACA 


ATCATCATGC 


ATATCTAGCT 


TTGAAGATGT 


3720 




TtCACTCAAT 


AATGTATTAC 


AACGATCAAA 


CCCATGACTA 


ATAATGACAT 


CGTCAAATTT 


3780 




AACTGTATGC 


CTATCGCCAC 


TTTCAACATG 


TTCCAAAACA 


ACTTCACTTA 


TATGCGTTTC 


3840 


30 


ATCATCATTG 


CCGACCAAGT 


ATTTAATACG 


TGTTTTTGGG 


CATAGTTTCA 


CATTTAAATC 


3900 




TGTCACCAAC 


GTTTTCATCG 


CTTCATGACC 


ACTTACATCT 


TCTTTTCGAT 


AAACAACTGT 


3960 




CACG CTTTT A 


GCAATCTTGG 


CAATATCATG 


CGCCCAATCT 


AATGCTGTAT 


TTCCTCCACC 


4020 


35 


TGATATTAAT 


ACATCTTTAT 


CTTTGAAACG 


TCTGTAACTT 


TGTACAACAT 


AATGTAAATT 


4080 




AGTT5ATTGA 


TATCTCTCTA 


CACCTTTAAC 


ATCTAATTGT 


TTTGGATTAA 


TAATACCCGC 


4140 


40 


ACCAATTGCA 


ATGATAACTG 


CTTTCGATGT 


ATATATTTCT 


CCCGCTTCTG 


TTTCAACTTC 


4200 


GAAATGACGT 


TCTGCCTTTT 


TCCTAATATC 


TACCACACGT 


TCATTCAAAT 


GAACTTCCGG 


4260 




TTTAAAATAT 


AATCCTTGCT 


TAATTGTATC 


TTTTAAAATT 


TCATGACAAG 


GTTTTGGCGC 


4320 


45 


AATGCCGCCA 


ATATCCCAAA 


TAATTTTTTC 


AGGGTAAATT 


CTCATCTTAC 


CCCCTAATTC 


4380 


AGATTGAACA 


TCTATCAATC 


TTACAGACAT 


ATCTCGCAAT 


CCAGCATAAA 


AGCTTGCATA 


4440 




CAAACCAGAC 


GGACCGCCAC 


CAATGATTGT 


AACATCTTTC 


ATTATGTGCC 


TCCTATGACT 


4500 


50 


CTCTATATTC 


ATTTCTTTCA 


TTAACGTGCT 


CAAATTGATA 


ATTATTATCA 


TTTAAAGCCA 


4560 




TTATACTATT 


AATATTTATA 


TTGTTAAAAT 


AAATCGCATA 


GTTAGCCATG 


AATTATCAAT 


4620 
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TAAGGTTCTT ITT ATT AT AC CCTAATTTTT GTTCATTATT ATTTAATTTT TGTGAATTTT 114 0 

ATGtTTkCTA TAAATTTAAT TATTTTACTT TAACAATTCA TTACGCATTT AGCATTTCAA 12 0C 

GGTATACACA ATATTTA7TA CTATGATTTC ATTTTATCTG CTGCAAAAAC AATCATTATA 12 6 0 

ACTCTTTTTC CATAATTAAA TCTGTATCCG TTACATCACC TGTTTGAAAA TGATGTTCAC 13 2 0 

CAACCACTTT AAATCCATGA CGTTTATAAA ATGCTTGAGC ACGAGGATTA TGCTCCCAAA 13 80 

CTCCTAGCCA AATTTTATGT TTATTATGTT CTTGAGCAAT TTTTTCGGCC AATTCTATCA 144 0 

ATTGTGAACC TCTTCCGCCA CCTTGAAAGT CTTTCAAAAA ATATATGCGC TGCACTTCTA 150 0 

AATAGGTCTC CCCCATTTCT TCAGTTTGAG CACTATTAAT ATTCATCTTT ATATAACCAA 15 6 0 

CATTCGCACC ATCTTCTTGa TAAAAATAAT GAAATGAATC TACATGGTTA ATCTCTTGTG 162 0 

TAAATTTCTC TACAGTATAA TTGTCTTTAA AAAATTGATC AAAATCTTTG TCATCATAGT 16 8 0 

AAGAACCAAA CGTGTCATAA AATGTTCTAG TTGCTAATTC AACTAATTCA CTAGCATTTT 174 0 

GTTCTGAAAT TTCTTTGATT ATCCCAGCCA TATAAATCCT CCAATAAACA GTGATCGAAT 18 00 

CAAAATATTA CTTATGTTAT TTTTCAGCCA AAACTATTTA AAAATACATT AACACAAATC 18 60 

AATTACAAAT TGTATTGATT GTGTGTAACA TCAATAAATG ATACATTTAT TCCAGTAAAA 192 0 

TGGCCGTATT TTCAAAAGAG AAAAAGAGAG GATGTATCGT TGTGATAGAA ACATTTAAAG 19 80 

CGTTTGTAAT TGATAAAGAT GAGAGTGGTA AAGTGACACC AACTTTCAAA CAATTATCGC 2 04 0 

CTACTGATTT ACCTAAAGGA GATGTGCTGA TTAAAGTACA TTACTCTGGT ATAAATTATA 2100 

AAGATGCTTT AGCGACTCAA GATCATAATG CAGTCGTAAA ATCGTATCCT ATGATTCCAG 2160 

GAATAGATTT AGCTGGAACA ATTGTTGAAT CCGAAGCACC AGGCTTTGAa AAAGGAGAAC 2220 

AAGTAATTGT AACGAGTTAT GACCTAGGTG TCAGCCATTA TGGCGGTTTT AGTGAATATG 22 8 0 

CGCGTGTAAA ATCAGAATGG ATTATCAAGC TTCCTGATAC TTTAACATTA GAAGAATCAA 2 34 0 

TGATATATGG CACAGCTGGT TATACTGCCG GTTTAGCAAT TGAAAGACTT GAAAAAGTTG 24 0 0 

GAATGAATAT TGAAGATGGT CCTGTACTCG TTCGCGGTGC TTCAGGTGGT GTCGGTACTT 24 6 0 

TAGCAGTACT CATGCTTAAT GAACTTGGTT ATAAAGTTAT CGCAAGTACA GGTAAACAAG 2 52 0 

45 ATGTTAGCGA TCAATTACTT GAACTTGGTG CCAAAGAAGT TATCGATCGA CTTCCTGTTG 25 80 

AAGATGATCA TAAAAAGCCA CTCGCATCAT CAACTTGGCA AGCTTGTGTA GACCCTGTTG 2 64 0 

GTGGCGAAGG TATTAATTAT GTTACAAAGC GTTTAAATCA TAGTGGGTCA ATTACAGTTA 27 0 0 

50 TTGGTATGAC TGCCGGTAAT ACTTATACTA ATTCTGTATT CCCTCACATT TTAAGAGGTG 27 6 0 
TAAACATTTT AGGAATTGAC TCGGTATTTA CTGCTATGAA ATTAAGACAG COCGTTTGOr 
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CCTAGACTCA AAATAAAGTC TGGTAATTTT TTAGTAGAAA CTTTTTGAGC TATTTCAGGT 36 00 

CTCTTTTCTT TAATTAATTT TGCAATTTCC AACAAATTAA TTTGTCCATC AGCCGTCGCA 3 66 0 

ATAAATCGCT TGCCATTAGC TTGTTCATTT GTCATTGCCA AAATGTGCAG TTCAGCTACG 3 720 

TCTCTCACAT CAACAACATT TAACGGAATT TGCGGTACAC GTTTCATTGA ACCATTCAAT 3780 

AAATTTTCTA ATAAATGAAA GCTTCCTGAA ACGTCTGCAT CTAATGATGG CCCAAAAATT 384 0 

GCAACTGGAT TGATTGTGGC AAATTCTACT GTTGTATTTT CATTCT 3 886 
(2) INFORMATION FOR SEQ ID NO: 89: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4879 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

GTCATCTATC AAAAATTTGG TATACAGACC GACAATTATT AATTAATAAT TTAATTTCCC 6 0 

AGGCAATACC AG TG ATT AAA TATCCACAAA TACAACATAA AGAACAACCA TTAGAATCTA 12 0 

TTTCACAACT TATATTGTCT AAGATGACAT CTAATCAATA GTGTTTAAAT TTCTCAGTGG 180 

CTGTGAATGA GGTTTAAAAG TACTATAAAA CGTAAACTTT GATACTTTAA AATACGCAAA 24 0 

AAACGGTAAA CCCTAATTCA TATTATAGAG TTTACCGTTT TATTTTTTAA CTTGCATCAT 3 00 

AGTTATATTA ACATTATTGT TGGTAGTTTG GATCAGTAAC CATTGCTTGT CCAGTATAAT 360 

CAACCGTTAC AATTGAATAT TTTCCaTTTG CATTTGGGTC TTTAAAACTA AACACATACT 42 0 

TATAGTTGCC ATTATGTTCT TCAATAGAAT AATCATTATA CACTTTATTA TTACTACCAA 4 80 

ATTTATTTGC TTCATTATTA GCCGCATTTA AAGCTGTTTG GAAATTTGGC AATTGCTGTA 54 0 

AAGCTTGATT TTTATTTCCA TTAAACGGAT AAATTTGACG TGCAACCGGC GCGGCATTTT 600 

GnCCATAATA TGGTGCAACG TAACTTGATT TTTGATTATT ATTCGCTTGG TTATTACTTG 660 

ATTGGTTATT ATTTGTTTGG TTTTGGTCAT TGTTTGTTGC ATTTGAATTA GATTGTTGCT 720 

GGTTATCGTT TGCACTATTA TCTTTATTAT CTTTGTTTAC GTCTTTACTA TCATCTTTAT 78 0 

TATCTTTCTT ATCTTTAGAT GAATCATTTG TTTTTTTATC TTGTTGTTCA GTTTTCGCTT 84 0 

TATCATCTTT TTCTTTATTA CCGTCTTTTT GTTGGTCACT ATCTTGACCA CATGCAGCTA 900 

AAAATAATGA TAATGCTAGT AACCCTGTAA CTAATCTTTT CATACATATC TCCTCCTATA 96 0 

ATTCGATATT CATTGAATAA TCTTGAAATA CATATCTACC ATGTGTATCT TTTCATGGCT 1020 
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AAGTTGAACA TAGTGACATT ATGACAGCAA GTCCAGAGAT GGCTGACTTG TTTATTTGTG 18 00 

GTAGAGATTT AGCTGAAAAT GCCGAACGTC TAGGGGATGT CTTAGTTCTT GATAATATTT 1B60 

5 

TAGATAAAGC TGAATTACAA CAAAAGCTCT CAGAAAAATT ACAACAACTT AACATGATTT 192 0 

AAAGGAGGTA CGACCTATGC AAGCAATCCT TAATTTTATA GTCGATATTT TAAGTCAACC 198 0 

AGCCATTCTT GTTGCACTGA TTGCCTTTAT AGGTTTAATC GTTCAGAAAA AACCTGCCGC 2 04 0 

10 

AACGATCACT TCAGGAACCA TTAAAACGAT ATTAGGCTTC TTAATTTTAA GTGCAGGTGC 2100 

TGATGTCGTC GTTCGATCTC TTGAACCATT CGGCAAAATA TTCCAACACG CATTTGGTGT 216 0 

15 GCAAGGTATC GTACCTAACA ACGAAGCTAT CGTCTCACTA GCCTTAAAAG ATTTTGGAAC 222 0 

AACAGCTGCA CTCATCATGG TCTGTGGCAT GATTGTTAAT ATTTTAATTG CCCGCTTCAC 22 80 

TAATTTAAAA TATATCTTTT TAACAGGTCA TCATACATTT TACATGGCTG CGTTTTTAGC 234 0 

20 AATCATTTTA ACAGTCAGTC ATATTAAAGG CTGGCTAACG ATTGTTATCG GCGCACTCGT 24 0 0 

ATTAGGATTA ATCATGGCAG TATTACCTGC ATTACTCCAA CCTACGATGC GAAAAATTAC 24 6 0 

AGGGAATGAC CAAGTAGCTT TAGGTCATTT TGGCTCAATC AGTTACTTTG CCGCAGTGCT 252 0 

25 GTAGGTCAAT TATTCAAAGG TAAGTCTAAA TCAACGGAAG AGATTAAATT TCCAAAAGGC 25 BO 

TTAAGTTTCT TACGAGAAAG TACAATTAGT ATCTCGATTA CGATGGCATT ACTTTACTTC 2 64 0 

ATCGCATGCT TATTTGCGGG CGTTAGTTAT GTACACGAAT CTATTAGTGA TGGTCAAAAC 2 70 0 

30 

TTTATTGTCT TTTCATTAAT TCAAGGTGTG ACATTTGCTG CTGGTGTATT TATTATTTTA 2 76 0 

ACGGGCGTTC GTTTAATCTT AGCTGAAATC GTCCCAGCAT TTAAAGGAAT TTCTGAAAAG 282 0 

CTTGTACCAA ATTCTAAACC TGCATTAGAC TGCCCTATTG TGTTCCCTTA TGCACAAAAT 28 8 0 

35 

GCAGTATTAA TTGGATTCTT TGTCAGCTTT ATTACAGGTG TCATCGGTAT GTTTATCTTA 2 94 0 

TTCTTATTTG GTGGCGTCGT CATTTTACCT GGCGTAGTTG CACACTTCTT CTTAGGTGCA 3 00 0 

ACGGCTGCTG TATTCGGTAA TGCAAGAGGC GGTATTAAAG GTGCTATTGc TGGCGCCGCT 306 0 

40 

CTAAATGGTA TCCTAATCAC GTTTTTACCA IT ATT ATT CT TGCCATTTTT AGGCGAATTA 312 0 

GGTGGTGCTG CAACAACATT CTCAGATACA GACTTTTTAG CTGTCGGTAT CGTGTTCGGT 3180 

45 AACGCAGTAA AATATATGGG ATTATTTGG? GCGATTCTAT TT ATT ATT AT CGTAGGTGCG 3 24 0 

ACAACAATTT TATTAAAAGG CCGTCAAAAA GAACAGCAAT AGTGTTAACG TAGAAATATA 3 30 0 

AAACACCGTC ACATATTGAG TGAATGCCCC TTTtATCAAG AGGAAAGCCA CTTACTTATG 3 36 0 

50 GACGGTGTTT TGTATTATAT TAAATGATAC TTAGCCATAC TATCGACAGC TGCTAAAATT 34 2 0 

GCTTCTTCTT GTGTCGCAAT CGGTTPCCAA CCAAGTAAT^ ''T'rT^A^ ^^-r^ . , n . 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 88: 

AGTTGTAATG TCACATTTCC AGAGTCTGAA ATTATCTTTA TCACGTTACA TTTACTAGGC 60 

5 TCTAAAATGA CTGAACATAC AGCATCTTCA ATTACCTTTG AATACCATGA TTTATCGCAA 120 

AATATACATG AATTGAT CAC TTGTGTTAGC CAAGAATTAG GCATTGATAT GTCAAAAGAC 180 

AACAAGTTAC ATACCAGTCT GATCACACAT ATCAAACCAG CTATACATCG TATTAAATAC 24 0 

W GATATGCTAC AACCTAATCC TTTGAGGCAA GAAGTTATGC GTCGCTATCC TCAAATCATT 300 

GAAGCCGTTA GCAAGCATAT TAGTCCAATT GAACAAGATG CTGCTATTCG CTTCAACGAA 360 

GATGAATTAA CATACATTAC AATTCACTTC GCATCAAGTA TAGAGCGTGT TGCAACACAT 42 0 

15 

AAACAATCAA TGATTAAGGT TGTCTTACTA TGTGGTTCTG GTATAGGCAC GTCACAACTT 4 80 

TTAAAATCAA AACTAAATCA CCTGTATCCT GaGTTnCACA TTTGGGAtGc CTATTcCATT 54 0 

TaTcAATTGG aAGaAAGTCG ATTATTGCAA GATAACATTG ATTATGTCAT TTCAACAGTA 600 

20 

CCTTGTGAAA TATCAGCTGT ACCAGTTATT CATGTCGATC CATTTATCAA TCAACAATCT 660 

CGTCAAAAAT TGAATCAAAT TATCAATGAC TCAAGAGAAC AACGAGTCAT GAAAATGGCA 720 

25 ACTGATGGCA AGTCACTCGC AGATTTATTG CCTGAACATC GCATCATTAT AAATAAACAA 7 80 

CCATTATCAA TTGAATCCGC AATTGCAGTG GCTGTGCAAC CTTTAATCAA TGATGGCATT 84 0 

GTCTATTCAA ATTATACAGC TGCAATTTTA AAACAATTTG AACAATTCGG GTCATATATG 900 

30 GTCATTAGTC CACATATTGC ACTTATTCAC GCTGGTACTG ATTATGTACA GAATGGTGTA 960 

GGTTTCGCAC TAACATATTT CACTGAAGGG ATTATCTTTG GTAGTAAAGC TAACGATCCC 1020 

GTTCACCTTG TAATTACATT AGCAACGGAC CACCCCAATG CACATTTAAA GGCATTGGGA 1080 

35 CAGTTAAGCG AATGCTTAAG CAACGACTTA TATCGACAAG ATTTCTTAGA TGGGAATATT 114 0 

TTT^AAATTA AACAACACAT TGCTTTAACT ATGACAAAGG AGGCTTAATA ACGTGTCATT 1200 

AGACATTTTG TCAACAACAC GCATCATTGT AAAAGAACAA GTAAATGATT GGACTGAAGC 1260 

40 

TATAACTATA GCTTCTCAGC CATTACTACA AGAACAAATT ATTGAACAAG GCTATGTTCA 1320 

AGCAATGATT GATAGCGTTA ATGAACTTGG ACCTTATATC GTTATCGCAC CTGAAATTGC 13 80 

AATTGCACAT GCAAGACCGA ACAATGACGT ACATCAAGTT GGTTTAAGTC TATTAAAGTT 144 0 

45 

GAATCAACAT GTGGCATTTT GTGATGAAGA TCACTACGCA TCTCTCATTT TTGTATTGAG 150 0 

TGCCATCGAC AATCATTCAC ACTTATCTGT ATTACAAAAT TTAGCAACCG TACTGGGCGA 15 60 

50 TAACCAAACA GTCCAGCAAC TATTAACTGC AACAAATGCA CAAGACATTA AAAACATTTT 162 0 

AAAGGAGCAT GATTAATATG AAAATTTTAG TAGTATGTGG CCACGGTTTA GGAAGTAGTT 1680 

55 



EP0 786 519 A2 



TGAAGGCATT AAAGGCGTTG AATTTAGAGA TCCATTACAA AGTGTGACAG CGGAAATGAT 8 04 0 

GCGTATTTAT AAAGACGTAG ATACATTTGT TGTTATATCA CATTTAGGAA TTGATCCTTC 8100 

5 

AACACAAGAA ACATGGCGTG GTGATTACTT AGTGAAACAA TTAAGTCAAA ATCCACAATT 816 0 

GAAGAAACGT ATTACAGTTA TTGATGGTCA TTCACATACA GTACTTCAAA ATGGTCAAAT 822 0 

TTATAACAAT GATGCATTGG CACAAACAGG TACAGCACTT GCGAATATCG GTAAGATTAC 82 BO 

10 

ATTTAATTAT CGCAATGGAG AGGTATCGAA TATTAAACCG TCATTGATTA ATGTTAAAGA 83 4 0 

CGTTGAAAAT GTAACACCGA ACAAAGCATT AGCTGAACAA ATTAATCAAG CTGATCAAAC 84 00 

ATTTAGAGCA CAAACTGCAG AGGTAATTAT TCCAAACAAT ACCATTGATT TCAAAGGAGA 84 60 

15 

AAGAGATGAC GTTAGAACGC GTGAAACAAA TTTAGGAAAC GCGATTGCAG ATGCTATGGA 852 0 

AGCGTATGGC GTTAAGAATT TCTCTAAAAA GACTGACTTT GCCGTGACAA ATGGTGGAGG 8 5 80 

20 TATTCGTGCC TCTATCGCAA AAGGTAAGGT GACACGCTAT GATTTAATCT CAGTATTACC 864 0 

ATTTGGAAAT ACGATTGCGC AAATTGATGT AAAAGGTTCA GACGTCTGGA CGGCTTTCGA 8700 

ACATAGTTTA GGCGCACCAA CAACACAAAA GGACGGTAAG ACAGTGTTAA CAGCGAATGG 8760 

25 CGGTTTACTA CATATCTCTG ATTCAATCCG TGTTTACTAT GATATAAATA AACCGTCTGG 8820 

CAAACGAATT AATGCTATTC AAATTTTAAA TAAAGAGACA GGTAAGTTTG AAAATATTGA 8880 

TTTAAAACGT GTATATCACG TAACGATGAA TGACTTCACA GCATCAGGTG GCGACGGATA 8 94 0 

30 TAGTATGTTC GGTGGTCCTA GAGAAGAAGG TATTTCATTA GATCAAGTAC TAGCAAGTTA 90 00 

TTTAAAAACA GCTAACTTAG CTAAGTATGA TACGACAGAA CCACAACGTA TGTTATTAGG 906 0 

TAAACCAGCA GTAAGTGAAC AACCAGCTAA AGGACAACAA GGTAGCAAAG GTAGTAAGTC 912 0 

35 

TGGTAAAGAT ACACAACCAA TTGGTGACGA CAAAGTGATG GATCCAGCGA AAAAACCAGC 9180 

TCCAGGTAAA GTTGTATTGT TgtAGCGCAT AGAGGAACTG TTAGTAGCGG TACAGAAGGT 924 0 

TCTGGTCGCA CAATAGAAGG AGCTACTGTA TCAAGCAAGA GTGGG AAACA ATTGGCTAGA 9 3 00 

40 

ATGTCAGTGC CTAAAGGTAG CGCGCATGAG AAACAGTTAT TTCATAATCA ACAGTCATTG 93 6 0 

ACGTAGCTAA GTAATGATAA ATAATCATAA ATAAAATTAC AGATATTGAC AAAAAATAGT 94 2 0 

45 AAATA 94 2 5 

(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3886 base pairs 
so (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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TAGTTGATGT 


TGGTTTGACT 


GGAAAGAAAA 


TGGAAGAATT 


GTTTAGTCAA 


ATTGACCGTA 


6240 


ATATTCAAGA 


TTTAAATGGT 


ATTTTAGTAA 


CCCATGAACA 


TATTGATCAT 


ATTAAAGGAT 


6300 


TAGGTGTTTT 


GGCGCGTAAA 


TATCAATTGC 


CAATTTATGC 


GAATGAAAAA 


ACTTGGCAGG 


6360 


CAATTGAAAA 


GAAAGATAGT 


CGCATCCCTA 


TGGATCAGAA 


ATTCATTTTT 


AATCCTTATG 


6420 


AAACAAAATC 


TATTGCAGGT 


TTCGATGTTG 


AATCGTTTAA 


CGTGTCACAT 


GATGCAATAG 


6480 


ATCCGCAATT 


TTATATTTTC 


CATAATAACT 


ATAAGAAGTT 


TACGATTTTA 


ACGGATACGG 


6540 


GTTACGTGTC 


TGATCGTATG 


AAAGGTATGA 


TACGTGGCAG 


CGATGCGTTT 


ATTTTTGAGA 


6600 


GTAATCATGA 


CGTCGATATG 


TTGAGAATGT 


GTCGTTATCC 


ATGGAAGACG 


AAACAACGTA 


6660 


TTTTAGGCGA 


TATGGGTCAT 


GTATCTAATG 


AGGATGCGGC 


TCATGCAATG 


ACAGACGTGA 


6720 


TTACAGGTAA 


CACGAAACGT 


ATTTACCTAT 


CGCATTTATC 


ACAAGACAAT 


AACATGAAAG 


6780 


ATTTGGCGCG 


TATGAGTGTT 


GGCCAAGTAT 


TGAACGAACA 


CGATATTGAT 


ACGGAAAAAG 


6840 


AAGTATTGCT 


ATGTGATACG 


GATAAAGCTA 


TTCCAACGCC 


AATATATACA 


ATATAAATGA 


6900 


GAGTCATCCG 


ATAAAGTTCC 


GCATTGCTGT 


GAGACGACTT 


TATCGGGTGC 


l"i"jL'i"rrATGT 


6960 


TGTTGGTGGG 


AAATGGCTGT 


TGTTGAGTTG 


AATCGGCTTG 


ATTGAAATGT 


GTAAAATAAT 


7020 


TCGATATTAA ATGTAATTTA TAAATAATTT ACATAAAATC AATCATTTTA 


ATATAAGGAT 


7080 


TATGATAATA 


TATTGGTGTA 


TGACAGTTAA 


TGGAGGGAAC 


GAAATGAAAG 


CTTTATTACT 


7140 


TAAAACAAGT 


GTATGGCTCG 


TTTTGCTTTT 


TAGTGTAATG 


GGATTATGGC 


AAGTCTCGAA 


7200 


CGCGGCTGAG 


CAGCATACAC 


CAATGAAAGC 


ACATGCAGTA 


ACAACGATAG 


ACAAAGCAAC 


7260 


AACAGATAAG 


CAACAAGTAC 


CGCCAACAAA 


GGAAGCGGCT 


CATCATTCTG 


GCAAAGAAGC 


732 0 


GGCAACCAAC 


GTATCAGCAT 


CAGCGCAGGG 


AACAGCTGAT 


GATACAAACA 


G CAAAGT AAC 


73 80 


ATCCAACGCA 


CCATCTAACA 


AACCATCTAC 


AGTAGTTTCA 


ACAAAAGTAA 


ACGAAACACG 


744 0 


CGACGTAGAT 


ACACAACAAG 


CCTCAACACA 


AAAACCAACT 


CACACAGCAA 


CGTTCAAATT 


*7 e A A 

7500 


ATCAAATGCT 


AAAACAGCAT 


CACTTTCACC 


ACGAATGTTT 


GCTGCTAATG 


CACCACAAAC 


7560 


AACAACACAT 


AAAATATTAC 


ATACAAATGA 


TATCCATGGC 


CGACTAGCCG 


AAGAAAAAGG 


"j c o a 


GCGTGTCATC 


GGTATGGCTA 


AATTAAAAAC 


AGTAAAAGAA 


CAAGAAAAGC 


CTGATTTAAT 


1 £ O A 

76 80 


GTTAGACGCA 


GGAGACGCCT 


TCCAAGGTTT 


ACCACTTTCA 


AACCAGTCTA 


AAGGTGAAGA 


774 0 


AATGGCTAAA 


GCAATGAATG 


CAGTAGGTTA 


TGATGCTATG 


GCAGTCGGTA 


ACCATGAATT 


7800 


TGACTTTGGA 


TACGATCAGT 


TGAAAAAGTT 


AGAGGGTATG 


TTAGACTTCC 


CGATGCTAAG 


7860 


TAcTAACGTT 


TATAAAGATG 


GAAAACGCGC 


GTfTAAGCCT 


TCAACGATTG 


TAACAAAAAA 


7920 
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GCAAACTATA ACGATAAAAA TGAAAAATAT CATTATAAAA ACCTGTCCGA AGATGAAGCG 44 4 0 

AGTTCCAGCA AAATGGAAGA AACGATTCCA GG AACCTTTG ATTTTATTAA TGGTCATGGT 45 0 0 

GGTTTCTTAA ACGAAGACTT TAGATTGTTT AGTACGAATA ATCAGTCAGG CGAGTTAACA 4 56 0 

TATCaACGTT TCCtTAATGG TTATCCAACG TTTAATAAAG AAGGTTCTAA TCAAATTCAA 462 0 

GTCACTTGGG GTGAAAAAGG CGTCTTTGAC TATCGTCGTT CGTTATTACG CACCGACGTT 46 8 0 

GTTTTAAATA GTGAGGATAA TAAATCGTTG CCGAAATTAG AGTCTGTACG TTCAAGCTTA 4 74 0 

GCGAACAATA GTGATATTAA TTTTGAAAAA GTAACAAACA TCGCTATCGG TTACGAAATG 4 8 00 

CAGGATAATT CAGATCATAA TCACATTGAA GTGCAGATTA ACAGTGAACT CGTACCGCGT 4 86 0 

TGGTATGTAG AATATGATGG CGAATGGTAT GTTTATAACG ATGGGaGGCT TGaATAAATG 4 92 0 

AACTGGaAAC TGACAAAGAC ACTTTTCATT TTCGTGTTTA TTCTTGTCAA CATCGTGTTA 4 9 80 

GTATCGATTT ATGTTAATAA AGTCAATCGC TCACACATTA ATGAAGT CG A GAGTAACAAT 5 04 0 

GAAGTTAATT TTCAGCAAGA AGAAATTAAA GTACCGACTA GTATATTGAA TAAATCAGTT 510 0 

AAAGGTATAA AATTAGAGCA AATTACAGGG CGATCAAAAG ACTTTAGTTC TAAAGCTAAA 5160 

25 GGCGATTCGG ATTTGACCAC ATCAGATGGT GG AAAATTAT TGAATGCGAA CATTAGTCAA 522 0 

TCGGTAAAGG TCAGTGACAA TAACTTAAAA GATTTGAAAG ATTATGTTAA CAAGCGCGTA 52 8 0 

TTTAAAGGTG CTGAATATCA ATTAAGCGAG ATTAGTTCAG ATTCTGTAAA ATATGAACAA 5 34 0 

30 ACGTATGATG ATTTTCCGAT TTTAAATAAC AGTAAAGCGA TGTTAAACTT TAATATAGAA 54 0 0 

GATAACAAAG CGACTAGTTA TAAACAATCA ATGATGGATG ACATTAAGCC CACAGATGGT 54 6 0 

GCAGATAAGA AGCATCAAGT GATTGGTGTG AGAAAAGCAA TCGAGGCATT ATATTATAAT 552 0 

CGTTACTTGA AAAAAGGTGA TGAAGTCATT AATGCTAGAC TCGGTTACTA CTCAGTCGTG 55 80 

AATGAAACGA ATGTTCAATT GTTACAACCA AACTGGGAAA TTAAAGTGAA GCATGACGGT 564 0 

AAGGATAAAA CGAATACTTA CTATGTCGAA GCGACAAATA ATAAC CCTAA AATTATTAAT 5 70 0 

CATTAATATG AATCGTAATA AGCTAGCATT GCAAGCTCAT CATATGTGAG AAGCGGTGCT 57 6 0 

AGCTTTTTTG CTGGTACGGT TTATTATGGC TGATGTTTTT GCGTCTCCAA CGTGCGCATT 5 82 0 

TATTCATATT TTAAGTAGAA CCGCATTGTA AAATTAGTGT AACTGTTATT TTAAAAACTT 58 80 

TAGTATTTGT CTAATCATTG TTATAATAAT TAAGAAATTC ATTGCACGTG ATTATCAAAA 5 94 0 

TTTAAATATA AG AAA CCGGT CGATGAACTA AAGTTACATA ATAGGAAAGG TATACAAAAC 60 0 0 

AGCTAATATA CTGATAGTTT CTGTAGGGAA AATCGTATAT TTGCACTGAT GTATATTGCA 60 6 0 

GTCATATAGA GAGATTGACT GTTTAAAGAG AAAGGATGAG CCGCTTGATA CGCATGAGTG 6120 
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GTGATGGTAT TATTGCAACA GACCGCCGTG GACGTATTCG TATCGTCAAT GATATGGCAC 2 64 0 

TCAAGATGCT TGGTATGGCG AAAGAAGACA TCATCGGATA TTACATGTTA AGTGTATTAA 2700 

GTCTTGAAGA TGAATTTAAA CTGGAAGAAA TTCAAGAGAA TAATGATAGT TTCTTATTAG 2 76 0 

ATTTAAATGA AGAAGAAGGT CTAATCGCAC GTGTTAACTT TAGTACGATT GTGCAGGAAA 2 82 0 

CAGGATTTGT AACTGGTTAT ATCGCTGTGT TACATGACGT AACTGAACAA CAACAAGTTG 2 88 0 

AACGTGAGCG TCGTGAATTT GTTGCCAATG TATCACATGA GTTACGTACA CCTTTAACTT 2 94 0 

CTATGAATAG TTACATTGAA GCACTTGAAG AAGGTGCATG GAAAGATGAG GAACTTGCGC 3 00 0 

CACAATTTTT ATCTGTTACC CGTGAAGAAA CAGAACGAAT GATTCGACTG GTCAATGACT 306 0 

TGCTACAGTT ATCTAAAATG GATAATGAGT CTGATCAAAT CAACAAAGAA ATTATCGACT 3120 

TTAACATGTT CATTAATAAA ATTATTAATC GACATGAAAT GTCTGCGAAA GATACAACAT 3180 

TTATTCGAGA TATTCCGAAA AAGACGATTT TCACAGAATT TGATCCTGAT AAAATGACGC 3 24 0 

AAGTATTTGA TAATGTCATT ACAAATGCGA TGAAATATTC TAGAGGCGAT AAACGTGTCG 330 0 

AGTTCCACGT GAAACAAAAT CCACTTTATA ATCGAATGAC GATTCGTATT AAAGATAATG 336 0 

25 GCATTGGTAT TCCTATCAAT AAAGTCGATA AGATATTCGA CCGATTCTAT CGTGTAGATA 3420 

AGGCACGTAC GCGTAAAATG GGTGGTACTG GATTAGGACT AGCCATTTCG AAAGAGATTG 34 8 0 

TGGAAGCGCA CAATGGTCGT ATTTGGGCAA ACAGTGTAGA AGGTCAAGGT ACATCTATCT 354 0 

TTATCACACT TCCATGTGAA GTCATTGAAG ACGGTGATTG GGATGAATAA TAAGGAGCAT 360 0 

ATTAAATCTG TCATTTTAGC ACTACTCGTC TTGATGAGTG TCGTATTGAC ATATATGGTA 366 0 

TGGAACTTTT CTCCTGATAT TGCAAATGTC GACAATACAG ATAGTAAGAA GAGTGAAACG 3 720 

rAACCTTTAA CGACACCTAT GACAGCCAAA ATGGATACAA CTATTACGCC ATTTCAGATT 3 780 

ATT CATTCGA AAAATGATCA TCCAGAAGGA ACGATTGCGA CGGTATCTAA TGTGAATAAA 3 84 0 

CTGACGAAAC CTTTGAAAAA TAAAGAAGTG AAGTCCGTGG AACATGTTCG TCGTGATCAT 3 900 

AACTTGATGA TTCCTGATTT GAACAGTGAT TTTATATTAT TCGATTTTAC GTATGATTTA 3 96 0 

CCGTTATCAA CATATCTTGG TCAAGTACTG AACATGAATG CGAAAGT AC C AAATCATTTC 4 020 

AATTTCAATC GTTTGGTCAT AGATCATGAT GCTGATGATA ATATCGTGCT TTATGCTATA 4 080 

AGCAAAGATC GCCACGATTA CGTAAAATTA ACAACTACAA CGAAAAATGA TCATTTTTTA 414 0 

GATGCATTAG CAGCAGTGAA AAAAGATATG CAACCATACA CAGATATCAT CACAAACAAA 4 2 00 

50 GATACAATTG ATCGTACGAC GCATGTTTTT GCACCAAGTA AACCTGAAAA GTTAAAAACA 4 2 60 

TATCGCATGG TATTTAACAC GATTAGTGTT GAGAAAATGA ATGCTATACT ATTTGACGAT 4 320 
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GACCTCATCA TTGTGTTAAA TATCATTGTC ACAATCCGCC GTGAGAAACT AATAAAAAAT 84 0 

AGTAATATAT AAGTTTATAT TGGAAAATAG AATTAATAGC TTATAAATGG TAAATTATAT 900 

AATAGGTTAC TATACGTTAT AAGACGGAAA ATGCGCACAA TAACAAAAAT AGTAAGCGAC 960 

ATCCTGTGAT TTTTTACACA AACATAAACG ATAAAGAACA AAAAATGATA AAATAATATT 102 0 

AATGATTTAA GAAAAGAGGT TTATGCAAAT GGCTAGAAAA GTTGTTGTAG TTGATGATGA 10 BO 

AAAACCGATT GCTGATATTT TAGAATTTAA CTTAAAAAAA GAAGGATACG ATGTGTACTG 114 0 

TGCATACGAT GGTAATGATG CAGTCGACTT AATTTATGAA GAAGAACCAG ACATCGTATT 120 0 

ACTAGATATC ATGTTACCTG GTCGTGATGG TATGGAAGTA TGTCGTGAAG TGCGCAAAAA 12 6 0 

ATACGAAATG CCAATAATAA TGCTTACTGC TAAAGATTCA GAAATTGATA AAGTGCTTGG 1320 

TTTAGAACTA GGTGCAGATG ACTATGTAAC GAAACCGTTT AGTACGCGTG AATTAATCGC 13 80 

ACGTGTGAAA GCGAACTTAC GTCGTCATTA CTCACAACCA GCACAAGACA CTGGAAATGT 144 0 

AACGAATGAA ATCACAATTA AAGATATTGT GATTTATCCA GACGCATATT CTATTAAAAA 150 0 

ACGTGGCGAA GATATTGAAT TAACACATCG TGAATTTGAA TTGTTCCATT ATTTATCAAA 1560 

25 ACATATGGGA CAAGTAATGA CACGTGAACA TTTATTACAA ACAGTATGGG GCTATGATTA 162 0 

CTTTGGCGAT GTACGTACGG TCGATGTAAC GATTCGTCGT TTACGTGAAA AGATTGAAGA 16 80 

TGATCCGTCA CATCCTGAAT ATATTGTGAC GCGTAGAGGC GTTGGATATT TCCTCCAACA 174 0 

30 ACATGAGTAG AGGTCGAAAC GAATGAAGTG GCTAAAACAA CTACAATCCC TT CAT ACT AA 18 00 

ATTTGTAATT GTTTATGTAT TACTGATTAT CATTGGTATG CAAATT AT CG GGTTATATTT 1860 

TACAAATAAC CTTGAAAAAG AGCTGCTTGA TAATTTTAAG AAGAATATTA CGCAGTACGC 1920 

GAAACAATTA GAAATTAGTA TTGAAAAAGT ATATGACGAA AAGGGCTCCG TAAATGCACA 1980 

AAAAGATATT CAAAATTTAT TAAGTGAGTA TGCCAACCGT CAAGAAATTG GAGAAATTCG 204 0 

TTTTATAGAT AAAGACCAAA TTATTATTGC GACGACGAAG CAGTCTAACC GTAGTCTAAT 2100 

CAATCAAAAA GCGAATGATA GTTCTGTCCA AAAAGCACTA TCACTAGGAC AATCAAACGA 2160 

TCATTTAATT TTAAAAGATT ATGGCGGTGG TAAGGACCGT GTCTGGGTAT ATAATATCCC 222 0 

AGTTAAAGTC GATAAAAAGG TAATTGGTAA TATTTATATC GAATCAAAAA TTAATGACGT 2 2 80 

TTATAACCAA TTAAATAATA TAAATCAAAT ATTCATTGTT GGTACAGCTA TTTCATTATT 2 34 0 

AATgCACAGT CATCCTAGGA TTCTTTATAG CGCGAACGAT TACCAAACCA ATCACCGATA 24 0 0 

TGCGTAACCA GACGGTCGAA ATGTCCaGAG GTAACTATAC GCAACGTGTG AAGATTTATG 24 6 0 

GTAATGATGA AATTGGCGAA TTAGCTTTAG CATTTAATAA CTTGTCTAAA CGTGTACAAG 2 52 0 
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GCTAATGGGA AATATGGAAT AAATGTGATT TGGTGATCAA CACAATATTG TAATACTGCC 216 0 

TCATTTTCGC GATGCAATAA ATT AT ATT CT AACTGTACAA CATCAACGTA ACCATCTTTA 222 0 

5 TTTGCTTCTT TAAGTTGATC TAATGTGAAA TTTGATACAC CAATTGCTTT AATCTTCCCT 228 0 

TGTTCCTTAA GCTCTTGTAA TGCTGCAACT GCTTGATCTT TCGGAGTGTT GTTATCCGGA 234 0 

AAATGAATAT AATATAAATC GATATAATCA GTTTGTAGAC GTTTCAAACT ATTCTCAACT 24 0 0 

w 

TGTTGTTTTA AATATTCCGG TTGATTGTTC TGATGTACTT CTTGATTTTC ATCAAATTCA 24 6 0 

TGAGACCCTT TCGTAGCAAT TTTAATTTGC TCTCGCGGAT ATTCTTTAAC AACTTCTCCA 2520 

ACCAATTCTT CTGATCGTTC TGGCCCATAA ATATATGCCG TATCTAATAA ATTAATACCA 258 0 

75 

TGATTAATGG CTTGACGAAC AACATCTTTT CCTTGTTCTT CATCTAAGTT CGGATATAAA 264 0 

TTATGCCCAa CCTAtGCGTT CGTCCCAAGT GCGATTGGAA ACACTTCAAC ATCAGATTTA 270 0 

20 CCTAAGTTTA CAAATTGCTn CATTAGACCC AGCnCCTT 273 8 

(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 9425 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



30 (xi) SEQUENCE DESCRIPTION: i 

GATTAGATGA TATTTAACGA AAATTAaGrT 
GAAAATAGCT ATTAAAATAG AGTAGACGTA 

35 

TTTCAAAAAT AGCGTAATTA TTATAATAAA 
TATAACATTC TTCAAAAAAT AATAGCAAAA 
TTGGCTGTGC TATAATCTAT CTTGTGCTTA 

40 

ACCGCCCTTT CACGGCGGTA ACACGGGTTC 
CTCGTAGTGT AGCGGTTAAC ACGCCTGCCT 
GTCGAGACCG TACAAATGCC TATCCAAGAG 

45 

TTAATAAAAG ATATATGGAC GAATGATAAT 
TTAAAATGTA TGAACCTCAA GTAACTTAGT 
50 AAAATCTTTA TTAGACGTAC AAACATATGC 
GCCACCAGTC CTCCATAACA TCAATTGTTA 



!EQ ID NO: 87: 

GmAATACTtG AATGTArGAa GTCTGATGTC 60 

ATGtAAATGA AAGCACCTAA AATAGAAAAA 120 

TAGACTGCCA ATAAAATGCA ATTTTTCACT 180 

TTATGTAAAA AATATCTTGT CATGGCAAGA 240 

AGAACGGCTC CTTGGTCAAG CGGTTAAGAC 3 00 

GAGTCCCGTA GGAGTCACCA TTTTTTAGGT 360 

GTCACGCAGG AGATCGCGGG TTCGATTCCC 420 

GATAGGCATT TTTTTGCGTT TAATATTATA 4 80 

CATATTGATT TATCTGTTCG TCCATTTTCT 540 

GGTTGGATAT GAAAGATAAA CGTAGACAAT 600 

TACTGTCAAC ATATTTCTTC GTTGTGATAT 660 

AAGTAACGAA TAACGAATAA TGATATTTAT 720 
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TCTTTAAAAA CTTTTTCTTC TACTAATTTT AAATCTACAT ATGCGTTAGT CATTATTCCC 3 60 

CTCCTTTTCG TTTAATATAA TATTTAATTT ACTTAAAATG CTTTGTACAT AAGTGCTAAG 4 20 

5 TCTAACTTTT CGCCATACAT TTCTGGCTCA TAAGAGCGTA AGATTGTAAA ACCTTGCTCT 4 80 

TTATAGTAAG CTACTGCTTC TTCATTTTTA TTATCTACTT CTAAGTAAAC ACCTTCAAAT 54 0 

TTATCTTCAA AACGTGATAA TCCTTCATTT AAGAATGCTG TACCATAACC TGTATGTTGC 6 00 

10 

GATTCTGGTT TAACATAATG AGCTGATAAA TATAATTCTT CACCGTAAAT AAAGTTAGCA 66 0 

AAGCCAACGA TGTCATTACC TTCTTCAACG ACTAAGAATA ATTGTTCTTG AAGTCTTTTC 72 0 

TTTAAATGAT GTTCATTATA TGAAGCTtCT AACAAGTGAT TAACTGTTGT CGCAGCGTAT 78 0 

15 

ATATTTAAGT ATGTATTAAA CCAAGCTTTA GTTGCGACAT CTCTAATTTG AACAACATCT 84 0 

TTTTCAGTTG CTTGTCTTAC CTTGAACATG ACTTTCTCCC CTTATTAACA AGTTTTAATA 9 00 

ACGGCATTAT ACCACAACTT GCTCAATACT TAATAAACAA TGATTGTCTA TT CAATTT AT 96 0 

20 

ATATt TATAT TTTCCGTTAA AATTAAAAAT AAAAAATAAC GAAGCAAAAA AtCACTTCGT 102 0 

TTAGTATGAG GTATGTCTTA TTGCAATATA CTATTCCACT CAGTTGCACG TGCTAAGGCA 10 80 

25 TAGTTGTCTT TCATGATGTC ACCAGGCTTT TCAGCAGTTC CAATAATATA ACCATTTAAA 114 0 

GTGGCACCTA rAAAGTCTAA ACTATATTTC ATTTGCGTAA TTGCTGGTTC GCTTTTATTT 1200 

TTGGACAATC TCCACCAACT AAAATAACTC TAAAATCCTT TTCGGCCATT TGTGCCTTAA 12 6 0 

30 AATTAGGATA TCGTTTATCT TGTAATGTTT CTGACCAATG TTCGATAAAT GCTTTCAATG 132 0 

GTGCTGAAAT GCTATACCAA TACACTGGTG ATGCAAAAAT AATTGTATCA CTAGCCAATA 13 80 

TTTTATCTAG AATCGGCAAA TAGTCATCGT CATATGAAGT AATAGTCTCT GCTGTATGTC 144 0 

35 TCACGTCACG TATCGGTTTA AACTGATGTT GTGTCACGTC AATCCATTGA TACTCTAAAT 1500 

CTTGCAAAGC GAATTTTGTT AATTGTGCAG TATTACCGTT TGGTCTACTC CCACCAAACA 156 0 

AAACAGTAAT CATTTTAGCC TAAC CTCACT TTTGATTAAT AAATATCTGT GTTTTTCGTT 162 0 

40 

ACCTAATTAT ACTATCATAA GCTTTGCCTA CCGAATAGTA AAACGCTTAC AACTTTTATA 16 80 

TAAATTTGAC GAAATTTCGT CATGCCTTAT ATAACGTCGT TTGTGATACG GGGCTAATTC 174 0 

ATGATGAAAT TAGATACATA TATCACCATT AAATACAATT CATTTAGTCT TCAATCGGAA 19 00 

45 

ACAGTTCATC GATATATTGA AT CT CATCAT CTGATAAAAC GATATCTGCA GCTTTAATAT 186 0 

TTTCAACGAC TTGTTCTGCA CGTTTTGCAC CAGGAATAAT CACATCGATA GCTGGTCTCG 192 0 

SO TTAAATAAAA TGCTAATACA ATGTTCGCAA TTGAAGTTTG ATGTGCTGCA GCTATGCTTT 198 0 

CCAAAGCTTT TACGCGACGC ACATTTTCTT CAAATACACC TGGTTTAAAA TCACGACGTG 2 04 0 
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GTTGGGGATG 


GGCCCCAACA 


CAGAAGCTGT 


GACTATGATA 


AAGTACTACT 


ACATAGTTAA 


8040 


TCATTAGTGG 


TTCTTTATCA 


TTTTCGCCTC 


CCTTTTCTTA 


TTGTTTTGAT 


ACACAAAAAT 


8100 


TTAAGTTCAA 


ACTGTCGAAT 


AAAGTTATAT 


TTGATTTCAA 


ATTATCCCTA 


AATTATTAAT 


8160 


TkTACAATTG 


TGGCAGATTT 


TCAAAATAAT 


AATTATTTCC 


TCATTATTTA 


TAAATTTATA 


8220 


TTTAAATTTC 


ATTCTTTATA 


GGGTAAGATT 


AGGACTATAG 


TATGATGTGT 


ArATAATATA 


8280 


AATTAAGGTA 


TAGTAAAGCT 


AACTCAGAAA 


TGACTTATCA 


TTCGGAGGTT 


ACATTATGAA 


8340 


TAAACTATTA 


CAGTCATTAT 


CAGCCCTCGG 


TGTTTCTGCT 


ACACTAGTAA 


CACCAAATTT 


8400 


AAATGCAGAT 


GCAACGACGA 


ATACTACACC 


ACAAATTAAA 


GGCGCTAATG 


ATATCGTTAT 


8460 


TAAGAAAGGT 


CAAGATTATA 


ACCTTCTAAA 


CGG CAT AAGT 


GCATTTGATA 


AAGAAGATGG 


8520 


AGATTTAACC 


GATAAAATTA 


AAGT CGATGG 


CCAAATTGAT 


ACATCTAAAT 


CTGGTAAATA 


8580 


TCAAATTAAA 


TATCATGTCA 


CTGATTCAGA 


TGGTGCAATT 


AAAATTTCCA 


CTAGGTATAT 


8640 


TGAGGTTAAA 


TAGCCCTCAT 


CACTATACTG 


CAAATAAAAT 


GGTAGCAAAC 


GAACATGTTT 


8700 


TGCTACCATT 


TTATTTGTTA 


TTCTAACTTC 


ATCTGCAACT 


TTAACCCAAA 


TATTGTATTT 


8760 


TTTCTGTATA 


CCAAAGGACT 


ACCTATCAAA 


TTATTAAAAC 


TTAACTGCTC 


TTTTTAAAAA 


8820 


A. A TOTTTTG A 


TTTTGAACAA 


ACAAATTTCC 


ACTTTTCATT 


GTTTAACGAT 


AAATTACTTT 


8880 


TGGCAAATTC 


CTTATTAAAA 


TGTTTGCGCT 


TCCTTTCAAT 


CAACTAGCCA 


TCATTTTCAA 


8940 


TTTATTAGAC 


AATTTCAAAC 


TTTTTTTATT 


TTCATTCAAT 


TAACCTTTAA 


TTGAAAGCTA 


9000 


TTCTCAACTT 


TCCTTTTAAA 


TATGAAGCAA 


TTTTTTCAAA 


AACGCTATTA 


GTCACAAAAT 


9060 



GT 9062 
(2) INFORMATION FOR SEQ ID NO: 86: 

*"(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 2738 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

AAATATTTTT TCAAAACTAT GTGAAAATGG aCCATGTCtA aATCATGTAA TAATGCAGyA 60 

CATAATGCCA ACGGTCTmTC TTTATTGTCC CATGCATCAT GACCAATAAA TGACTCATCA 120 

ATTAATCGTC TAACTATTTC ATACACACCT AAAGAATGTC CAAAGCGACT ATGTTCTGCT 180 

GTGTGAAAAG ATAGGTACAG TGTTCCTAGT TGTCTAATTC GACGTAACCT TTGGAATTCC 24 0 
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AAGAATCTAT 


ACCCTTTAAA 


CTTTTTGTTT 


CAATCGGCAC 


TCCAATAACT 


GGTAGCGTCG 


6240 




TTAATGATGC 


AACCATACCT 


GGTAAATGTG 


CCGCACCGCC 


AGCGCCTGCA 


ATGATAATGT 


6300 


5 


TTATACCTCT 


TTCTCTCGCT 


TCAGAAGCAA 


ATTGAACCAT 


CATTTTTGGC 


GTACGATGTG 


6360 




CGGATACTAC 


TTGTTTTTCG 


TACGGAATTT 


CAAAATAATC 


CAACATGTTA 


CAACTCTCTT 


6420 


10 


GCATAATTTT 


CCAATCGGAA 


GAACTGCCCA 


TAATGACTGC 


TACTTTCACT 


TTGTACACCC 


6480 


TTTCAAAAGT 


TTGAATTGTG 


AATTACTTTA 


GTTGTATATT 


ATAGATATAG 


CATAACAAGC 


6540 




AATTTCTGCT 


TTTTCAATCA 


AAAATCGAAC 


TTTATTTTGA 


TTTTTTATTT 


GAATTTACGT 


6600 


15 


CTTTTGCTAT 


GTAAATTAGT 


TTTATAAACT 


AACAAAGTTA 


GGATATTGAC 


AATAGGAGGA 


6660 




GAAGTTTTTA 


TGGTTGCTAA 


AATTTTAGAT 


GGTAAACAAA 


TTGCCAAAGA 


CTACAGACAG 


6720 




GGGTTACAAG 


ATCAAGTTGA 


AGCGCTAAAA 


GAAAAGGGTT 


TTACACCTAA 


ATTATCCGTT 


6780 


20 


ATATTAGTTG 


GTAATGATGG 


CGCTAGTCAA 


AGTTATGTTA 


GATCAAAAAA 


GAAAGCAGCT 


6840 




GAAAAAATTG 


GTATGATTTc 


AGAAATCGTA 


CATTTGGAAG 


AAACAGCTAC 


TGAAGAAGAA 


6900 




GTATTAAACG 


AACTAAATAG 


ACTAAATAAT 


GATGATTCTG 


TAAGTGGTAT 


TTTGGTACAA 


6960 


25 


GTACCATTAC 


CAAAACAAGT 


TAGCGAACAG 


AAAATATTAG 


AAGCAATCAA 


TCCTGAAAAA 


7020 




GATGTGGACG 


GTTTTCATCC 


AATAAATATA 


GGGAAATTAT 


ATATCGATGA 


ACAAACTTTT 


7080 




GTACCTTGCA 


CACCGCTCGG 


CATCATGGAA 


ATATTAAAAC 


ATGCTGATAT 


TGATTTAGAA 


7140 




GGTAAAAATG 


CAGTTGTAAT 


TGGACGAAGT 


CATATTGTCG 


GACAACCAGT 


TTCTAAGTTA 


7200 




CTACTTCAAA 


AAAATGCATC 


AGTAACAATC 


TTACATTCTC 


GTTCAAAAGA 


TATGGCATCA 


7260 




TATTTAAAAG 


ATGCTGATGT 


CATTGTCAGT 


GCAGTTGGTA 


AGCCTGGTTT 


AGTAACAAAA 


7320 


35 


GATGTGGTCA 


AAGAAGGAGC 


AGTAATTATC 


GATGTTGGCA 


ATACGCCAGA 


TGAAAATGGC 


7380 




AAATTAAAAG 


GTGACGTTGA 


TTATGATGCG 


GTTAAAGAAA 


TTGCTGGAGC 


TATTACACCA 


7440 


40 


GTTCCTGGTG 


GCGTTGGTCC 


ATTAACAATT 


ACTATGGTAT 


TAAATAATAC 


TTTGCTTGCA 


7500 


GAAAAAATGC 


GTCGAGGTAT 


TGATTCGTAA 


AGAGCCTGAG 


ACATAAATCA 


ATGTTCTATG 


7560 




CTCTACAAAG 


TTATAATGGC 


AGTAGTTGAC 


TGAACGAAAA 


TTCGCTTGTA 


ACAAGCTTTT 


7620 


45 


TTCAATTCTA 


GTCAACCTTG 


CCGGGGTGGG 


ACGACGAAAT 


AAATTTTA CG 


AAAATATCAT 


7680 




TTCTGTCCCA 


CTCCCTAATA 


ACTGAGTTTT 


AATGAAGTCT 


TTTAACCCAC 


ATTAAATATT 


7740 




ATTTTGCAAT 


TGCAATGAAT 


AACAAGAAAA 


ATCTGGGACA 


TTAATCGATC 


AAATGCTCCC 


7300 


50 


TTCAAAGTAG 


ACATTGAATA 


AATGAAGGCT 


TTGAAGGGAG 


CATTTCACTT 


TGTACTTGGC 


7860 




TCAACAATTT 


TATATAGACA 


GTAGTTAATT 


GAATGAAAAT 


AAGCTTGTAA 


CAAGTTTTCA 
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ATCCATTAAT TGTTTCAACA CATTATTAAT CTTTAATGCT TTGGATTTTA GTATTTCAAT 44 4 0 

ATCTTCATCT GATGCTATAT TGAGCAATTT AACATGGTCA TCCGTTATCA ACGGATCATT 4 5 00 

TAACGCATCA TTTTTATAGA AAAATTCTAC AAGTGGTTCT CTAAAAACTT CACCATTTTC 4 5 60 

AAAACCTAAA CGCTTTGTAA TAGATCCACT AGCAATATTA CGAACAACTA CTTCTAATGG 4 620 

AATTATTTTC ACAGGCTTAA CTAATTGTTC TGTTTCAGAT AATTGTTTAA TAAAGTGACT 4 6 80 

TTCTATTCCA TTTTCTTGTA AATATTTAAA TATAATAGAA GTAATTTGAT TATTTAATCG 4 740 

CCCCTTACCT GCCATTGTGT CTTTCTTAGC CCCGTTTCCA GCAGTAACTT CATCTTTATA 4 8 00 

TTCAACTCTT AATTCATTTT CTTGATTTGT TGAGAAAATG CGcTTCGCTT TTCCTTCATA 4 860 

TAATAATGTC ATGCTTTAAT TACTCCCCTC AAATTTAGCG TACATATCTT GTTCAGTTTG 4 92 0 

GTTTACATCA TTCGTTAGTA CAGTCATATG CCCCATTTTT CTGCTATCTT TACGCTCAGA 4 980 

CTTAC CATAA ATATGTAAGT GCCACTCTGG ATGTTCATTA AATTCATTTT CCAATAAATC 504 0 

TAAATCTTTA CCTAGTAAGT TCATCATGAC TGCTGGCTTT AATAATTCAA TTGAATTTGG 5100 

TAATGATTGT CCGGTAACTG CTAAAATATG AGTATCAAAT TGTGAATAAT CACATGCTTC 516 0 

25 AATTGAATAA TGTCCGGAAT TGTGAGGCCT TGGTGCTATC TCGTTCACAT ACAATTGGTT 522 0 

GTTACTATCT ATAAAAAATT CAACTGTAAA TGTTCCAATG AAATGAATCG ATTGGATAAT 52 80 

TTTATTAACT TGCTCTTTCG CCTCAGCTGT TTTATCTATT CTCGCTGGAA CAATTGTTTT 534 0 

30 GAAAAGTATT TGATTTCTAT GCTCATTTTC TTGTAATGGG AAAAAAGTGA TTTGATTGTT 54 00 

GTTTCCTCTT GTAACAGTAA GAGATACTTC TTTCTTGATA TTCAAATATT TTTCAGCTAC 54 60 

GCATTCACTA GTTTCAATTA ATTTAAAACC TTCTTGTAAG TCTTTTTCGT TGTTAATTAA 5520 

AACTTGACCT TTGCCATCGT AGCCACCAAA TCTAGTTTTT ACAATAAAAG GATATCCTAA 5580 

TGTT^CAATT GCTTTGTCAA TATCTGTAGA TTCTTTTACT GAAATGAACG GGACAACTTT 564 0 

GGTAC CAGCA CTTTTTAATG TTTCTTTTTC AGTTAAGCGA TCTTGTAATA ACTGTATAGC 5700 

TTGGTAACCT TGCGGAATAT TGTACTTTTC ACATAATAGT TTTAATTGTT GGGCTGAAAT 576 0 

GTTTTCAAAT TCATAAGTAA TCACATCACA TTTTTGTCCT AATTGATTGA GTGCCTTTTC 582 0 

ATCGTCATAC TTGGCTTGTA TAAATTCGTG TGCAACGTAT CTACATGGAC AATCTTCAGA 588 0 

AGGATCCAAT ACAACCACTT TATAACCCAT TTTTTGAGCT GATTGTGCCA TCATCTTTCC 594 0 

AAGCTGACCA CCACCAATAA TGCCAATAGT CGCACCAAAC TTTAATTTAT TGAAGTTCAT 6000 

so TTTGCATGTC CTCCACTTTT TGAATTAACG AAGATTCATA CTGATTTAGT TTTTCAACTA 6 06 0 

AAGAAGGATT TTGAATACTT AACATTCTTG CTGCAAGTAT ACCTGCGTTT TTAGCACCTG 6120 



35 



40 



45 



55 



EP0 786 519 A2 



w 



15 



20 



CGACCAGTTT TCAAACCAAC ATAAATGACC GAATTACCTA CACCTTTTGC TGTGCCTTTT 2 64 0 

TGAATCATGT CGTGATTGaT AACACCAACA CACATTGCAT TAACAAGTGG ATTGCCATCA 2 70 0 

TAACGTTCAT CAAATTCGAT TTCACCAGCA GTTGTTGGaA TACCAATGCA GTTACCATAA 27 6 0 

CCTCCGATAC CCTTTACAAC ACCTTTAAGT AATCTTTGGT TTTGTTTATT ATCTAATTCT 2 82 0 

CCAAATCTAA GACTGTTTAA CAAATTAATA GGTCTAGCCC CAATAGAGAC AATGTCACGA 2 8 80 

ATGATTCCAC CAACGCCTGT AGCAGCCCCT TGATATGGTT CAATTGCTGA TGGATGATTG 2 94 0 

TGAGACTCTA CTTTAAATAC TACGGCTTGA TTATCACCTA TATCGACTAC CCCTGCACCT 3 000 

TCACCAGGCC CCATAAGCAC ATGGTcACCT GACGTAGGAA ATTGCTTTAA AAACGGTTTA 3 060 

GAATGTTTAT AAGAGCAATG TTCACTCCAC ATAACAGAAA AGATACCTGT TTCTGTAAAG 3120 

TTAGGTTGTC TGCCTAAAAT ATCGCAAACT TTTTCATATT CTTGATCaCT TAATCCCATA 318 0 

TCTTGATATA CTTTTTCAAG TTTAATTTCT TCAACGCTTG GTTCGATAAA TTTAGACATG 3 24 0 

TTGTTCCCTC CAACTTTTTA CCATCGCTTC AAATAATTTC ACACCACTAT CAGTACCTAA 3 3 00 

CAACGTTTCT AAAGCTCTTT CagGATGt GG CATCATGCCA CATACATTGC CTTTTTCGTT 3 360 

25 AACAATTCCT GCAATATCAT CATATG AAC C GTTCGGATTA TTCACATATT TCAGAATAAT 34 20 

TTGATTGTTA GCTTTTAATT GTTGATATAT TTCATCAGTA CAATAATAAT GACCTTCACC 34 80 

GTGAGCTACA GGATATATAA CTTTTTCACC TTGTTCATAA AGATTTGTAA ATGCCGTTTG 3 54 0 

30 ATTATTCACT ATTTCTAACT CTTCATTTCT ACTAATAAAT AAATGTGAAT CGTTATG CAA 3 6 00 

TAATGCACCA GGTAATAAGC CTATTTCAGT TAAAATTTGA AACCCATTAC AAACACCTAA 3 6 60 

TACTGGCTTA CCTTCAGCTG CAAGACGTTT AACTTCCGAA ATAATCGGsG CTACACTAGC 3 72 0 

CATTGCCCCA GAT CTTAAGT AATCCCCGAA TGAAAATCCA CCAGGAATAA GTACGCCATC 3 7 80 

AAATCCACTT AGTGATGTTT CTCTATAATC TACATATTCC GCTTCAACAC CACTTTTAAT 3 84 0 

AGCAGCATTA AACATGTCTC TATCACAATT CGAACCTGGA AAAACAAGAA CCGCAAATTT 3 90 0 

CATTTTATGC ATTCTCCTTT TCATCATCTA ACACTTTATA GCTATATTCT TCAATCACTG 3 96 0 

TATTTGCAAA CAATTTTTCA CTTAGAGTTG TAATAATGTT GTGTACCTTT TCATCACTAA 4 02 0 

CCTCATCCAC TGTCATATAT AATACTTTTC CTACACGAAT ATCATTCACT TGTG CAT AAC 4 0B0 

4o 

CTAAGTCATG TACAGCTCGA GTAAGCGTTT GTCCTTGCGT ATCTAATACT TGTGGTTGTA 414 0 

ATGTGATATG TAGTTCAATT GTTTTCATTA TTTTAAATCC TCCAATTTGT TTAAAAATAT 42 0 0 

SO TTGATATGTT TCAATCAGTG ATCCAGTGTT ATTTCTATAT ACATCTTTAT CAAAGTTTGC 4 260 

ATTGGTAGCT TTATCCCAAA TTCGACATGT ATCTGGAGAT ATTTCATCCG CTAACAAAAT 4 320 



35 



40 



EP0 786 519 A2 



15 



20 



CGGTTGAATA TTTTCAATAC CTTTATT AC C TGAAGTAGCA TAACGGACGT GACCAATTGC 84 0 

ATGTTGATAT CCTTTTAATC GTTCCATTTG ATCATCTTTA ATCGCTTCAG TTAGTAAGCC 900 

TAATCCTCGC TCGCCTTTTA ATTCATTTTG ATCAGAAACA ACTATACCTG cACCTTCTTG 960 

ACCACGATGT TGCAAACTAT GAAGTCCCAT ATAtGTTAGT TGCGCTGCtT CaGGATGATT 102 0 

CCAAATACCA AACACGCCAC ATTCTTCGTT TAATCCTGAG TAGTTAAACA TTGaGCAATT 1080 

GCCCCtTCCC ATATTTGTTT AATATCTGAA ACATTTTCAC TAATCTCTGT aTATGGTGTT 114 0 

GTTACCTTGr aATTATCACT ATCTGTTAAA AGTCCAATTT CTATTGCATT ATCAATATTT 1200 

AAAGTTTTAC CTGATTTAAC AGAAACAACA TATCGGCCTT GCGTCTCACT AAACAATTGT 1260 

GCATTTGTTA TATCTATTGA AGATTTTAAT CCTAAACCGT AATGCGCACT TAGTTTAGCT 1320 

AAGGTAATCA GTAAGCCACC TTTACCAACT GTTTGAACAT GTGATAATAG TCCTTCACGA 1380 

ATAGCGGTCT TGATTGATTC ACCTTTTTCA ACTTCTGAAC TCAAATCTAA TGACTCAAAT 144 0 

TCATGATTAA CTTTGCCATA AATTAACTTT TCAAGTTGAC TACCACCAAA GTCGTCCTTA 1500 

GTATCACCGA TTAAATATAA TTTATCTCCA ACTTGAGGTT CAAAATCATT TAAATAATTT 156 0 

25 ACATTTTCAA TCAAACCTAC CATTCCAACA ACTGGTGTTG GGAAAATAGA AGTAC CTTTC 1620 

GTTTCGTTAT ATAAAGATAC ATTAC CAGAA ACTACTGGTG TCTTAAGAAT GTCGCATGCT 16 80 

TCTGCCATAC CTTTCGTTGA ATCTATCAAC TGTTGATAGA TTTCTTTCTT TTCAGGAGAA 174 0 

30 CCATAATTTA AACAATCTGT CATTGCTAAT GGTGTTGCAC CCACGGCAAT TAAATTTCGA 180 0 

TAAGCTTCAG CTACTACCAT CTTTCCACCT TCATATGGAT TGTTATATAC ATAACGCGCT 1860 

TCACCATCAA TTGTTGAAGC AATTGCCTTA TTTGTGCCTT CCACACGTAC TACCGATGCT 192 0 

TGAAGTCCTG GCTTAATTAT CGTATTGGCA CCAACTTGTT GGTCGTATTG AT CAT AT AAA 1980 

TAGTGTTTAG ATGCTATAGT CGGATGCTTA AGTAATTTAA AGAAAGTATC TTTAACATCG 204 0 

ATGTGTGTAT AATCATTTTT AGAAGTATTA TAATCTTTTT CTTCTCCTTC TAAAATATAT 2100 

ACAGGTGCTT CATCAGCTAG TGGTTCAACT GGAATGTCAG CATAAACTTC GTCATCATAT 216 0 

GTTAAAACAA AACGATTTGT ATCTGTAACT TCACCTATAA CAGCACTATC CAATTCGTGC 222 0 

TTATCAAATA AATCTAAGAA TTTTTGTTCA GTACCTTTTT CAACAACTAG TAACATACGT 228 0 

TCTTGAGTTT CTGAAAGCAT CATTTCATAA GGAGAAATAC CTGGCTCACG TGTTGGCACT 234 0 

TGTTCTAATC TCAAATGTAA CCCACTACCA CCTTTTGCCG CCATTTCAGA CGATGAAGAT 24 0 0 

50 GTTAAACCAG CAGCACCCAT ATCTTGAATA CCAACTAATT CATCAAATGT AATTGCTTCA 24 6 0 

AGTGTTGCTT CCATTAATTT TTTACCTACA AATGGATCAC CGATTTGTAC AGAAGGTCGT 2 52 0 
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ATTGAAAATG GTGTACCTAC AGTACCAAAC GTATTTAAAG TGACATTGGG AGAAATTGTA 516 0 

GATTTATTAT ACAAGTTCAA ACAGTCACGT CTCGATCGAA CATTGCCGAA ATTAGATAAC 5220 

5 

TTGTTTGAAA AAGATTTGTA TAGTACGTAT TTAAGCTATC TACCTAGTAC aGACTTTAGT 52 80 

TAyCCCTTAC TTATGAATGT GGATGATAGG GGTTCTTTTA CAGAATTTAT AAAAACACCG 534 0 

GATCGTGGTC AAGTTTCTGT AAATATTTCT AAACCAGGTA TTACTAAAGG TAATCACTGG 54 0 0 

w 

CATCATACTA AAAACGAAAA ATTTCTAGTC GTATCAGGTA AAGGGGTAAT TCGTTTTAGA 54 60 

CATGTTAATG ATGATGAAAT CATTGAATAT TATGTTTCTG GCGACAAATT AGAAGTTGTA 552 0 

15 GACATACCAG TAGGATACAC ACATAATATT GAAAATTTAG GCGACACAGA TATGGTAACT 5580 

ATTATGTGGG TGAATGAAAT GTTTGATCCA AATCAGCCAG ATACGTATTT CTTGGAGGTA 564 0 

TAGCGCATGG aAAAACTGAA rTTAATGACA ATAGTTGGTA CAAGGCCTGA AATCATTCGT 5700 

20 TTATCATCAA CGATTAAAGC ATGTGATCAA TATtTTAA 5738 

(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 9062 base pairs 
25 ( B ) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

ATCATCAACA AGAATGATAT TTTTCCCATC TACTATATCT TTTACCGCAG ATAACTTCAC 60 

TCTCACACCT TGCTCACGTA ATTCTTGAGT TGGTTGAATA AATGTTCTTG CAACATATTG 120 

35 

ATTTTTAACT AGTCCCATTT CATATGGCAA ACCTATTTCT TCAGCATAAC CACTCGCAGC 180 

TGATAGCGAT gAATTGGGTA CACCGATGAC CATATCAGCA TTTACAGGGC TTTCTTGGGC 24 0 

TAATTTTTTA CCAGAAGCTT TACGTACTGC ATGGACATTT TTACCAGCTA TTGTTGAGTC 3 00 

40 

TGGTCTAGCA AAATAAATAT ATTCCATCGC AGAAATTGCA GTTGTCGTAT GATGTGTATA 3 50 

AGATTTAACT GTAATACCTT TATCGTTAAT CACGACATAT TCACCTGCAT GAATATCTTG 4 20 

45 AACAAATTCT GCACCTAACA CATCTATTGC ACATGTTTCA CTTGCAAGGA TGTATGTCCC 48 0 

ATCTTTCATT TTACCTACAA CAAGTGGTCT GATAGCATTT GGATCTACTG CGCCATATAA 54 0 

CGCATCTTTA GTTAAAATCG CAAATGTAAA ACCGCCTTTA ACTTTTCGCA AACTTTCTTT 60C 

50 CAACGCTTCC TCAAAAGTAG GAGCTTTACT TCGACGTATC AAATGCATAA TGACTTCAGT 66 0 

AT^A^AAGAt" HAATGGAAGA TA^CACrTT^ TTTTT^TAAA TTfTnAmrA ATnA Trr TA^^ n ~>" 
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CCGGCGAAAA AATGTTTGAA GAGCTTATGA 
TTGAAAAAAT TTATCGTGGC AAAGTACAAC 
5 TTCAAGACAT CGTCAATGAC TTTAGTAAAG 

AGGGAGATAA TTATGTTCGA TGACAAAATT 
GGTAATGCTG TTATGAAACA GTTTTTAGAT 

10 

CGCGATGAGA AAAAACAAGA TGACATTCGA 
TACATTGGTG ATGTGCGTGA TAGTCAAAGT 
GTATTCCATG CAGCAGCTTT AAAACAAGTG 

75 

GTGAAGACAA ATATTATTGG TACAGAAAAT 
AAAAAAGTCA TATGTTTATC TACAGATAAG 

20 TCAAAAGCAA TGATGGAAAA AGTATTCGTA 

ACGCTTATTT GTGGTACAAG ATACGGTAAT 
TTGTTTATCG ACAAAATCAA AGCTGGAGAA 

25 AGATTTTTAA TGAGCTTAGA AGATGCGGTA 

GAGACAGGAG ATATTATGGT TCAAAAAGCA 
GC ATT ATT AG AATTGTTTGA AGCTGATAAT 

30 GAGAAAAAAG CAGAAACATT GTTGACGAGA 

GATTATTTTA GAGTGCCGGC AGACTCCAGA 
GGTAACGAAA AGATTACGCA ATCTTATGAA 

35 

GTGGAAGAGA TAAAAGAAAA ACTTTTAACA 
TATAAAGCTT CAATGAGATA GGAGAGATTG 
GGTTTTGTAG GAAAAAACTT GAAAGCAGAT 

40 

GAAGTACATC GACAAACTAA AGAGGAAGAA 
GTCGTGCATT TAGCGGGTGT TAATCGACCT 

45 GTGAGTTATT TAGATCATGT ACTTGATATA 

TTATTATCGT CTTCAATACA AGCAACACAA 
GGGGAACAGC TATTAAGAGA GTATGCCGAA 

50 TGGCCAAATT TATTCGGCAA GTGGTGTAAG 
TGTTACAAAA TTGCACGTAA CGAAGAGATT 



ATAAAGATGA GGTTCATCCT GAACAAGTAT 3360 

ATATGAAATG TAATGAAGTT GAAGCGATTA 34 20 

AAAAAATTAT TAACTATGCC AATGGCAAAA 348 0 

TTATTAATTA CTGGGGGCAC AGGATCATTC 3 54 0 

TCTAATATTA AAGAAATTCG TATTTTTTCA 3 6 00 

AAAAAATATA ATAATTCAAA ATTAAAGTTC 3 660 

GTAGAAACAG CAATGCGAGA TGTTGATTAC 3 720 

CCGTCATGTG AATTCTTTCC AGTTGAGGCA 3780 

GTCTTACAAA GTGCTATTCA TCAAAATGTT 3840 

GCAGCGTATC CTATTAATGC TAGGGGTATT 3 900 

GCCAAATCAA GAAATATTCG TAGTGAACAA 3 960 

GTGATGGCTT CAAGAGGATC AGTAATACCT 4 02 0 

CCTTTAACGA TTACAGATCC TGATATGACA 4 080 

GAACTAGTTG TTCATGCATT TAAGCATGCA 414 0 

CCAAGCTCAA CGGTAGGGGA TCTTGCGACC 4200 

GCAATTGAAA TCATTGGTAC GCGACATGGA 4 26 0 

GAAGAATACG CACAATGTGA AGATATGGGT 4 32 0 

GATTTAAATT ATAGTAATTA TGTTGAAACC 4 3 80 

TATAACTCCG ATAATACACA TATTTTAACG 444 0 

CTAGAATATG TTAGAAACGA ATTGAATGAT 4 500 

ACGTTGAATA TTGTAATTAC AGGAGCAAAA 4 560 

TTAACTTCAA CGACAGATCA TCATATTTTC 4 620 

TTAGAGTCAG CATTGTTGAA AG CAGACTTT 4680 

GAACATGACA AAGAATTCAG CTTAGGAAAC 4 74 0 

TTAACTAGAA ATACGAAAAA GCCAGCGATA 4 800 

GATAATCCTT ATGGTGAGAG TAAGTTGCAA 4 860 

GAGTATGGCA ATACGGTTTA TATTTATCGC 4 92 0 

CCGAATTATA ACTCAGTGAT AGCAACATTT 4 9 80 

CAAGTTAATG ATCGGAATGT TGAACTAACG 5 04 0 
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GTTCTTAATG AAAGACTTAT TTAATGATAA GAAATTACGT GATTATTATG AAGATATGAA 1560 

CGGATTTATT AGTAATGCGA AGTTAGTTGT TGATGATAAA AAAATTCCTA AACGAATGCC 16 20 

5 ACAACAAGAT TATAAACAGA AAAGATGGTT TGGGTTATAA ACAGCAAATG AGGGGTTTTA 16 80 

TGGCACATTT ATCTGTGAAA TTGCGGCTTT TAATACTAGC ATTAATCGAT TCACTGATAG 174 0 

TGACATTTTC AGTATTCGTA AGTTATTACA TTTTAGAACC GTATTTCAAA ACATATTCTG 18 00 

w 

TCAAATTATT AATATTGGCA GCTATATCAC TATTCATATC G CAT CAT ATT TCaGCATTTA I9 60 

TTTTTAATAT GTATCATCGA GCGTGGGAAT ATGCCAGTGT GAGTGAATTG ATTTTAATTG 1920 

TTAAAGCTGT GACGACATCT ATCGTTATTA CGATGGTGGT CGTGACAATT GTTACAGGCA 19 80 

15 

ATAGACCGTT TTTTAGATTG TATTTAATTA CTTGGATGAT GCACTTGATT TTAATAGGTG 2 040 

GCTCAAGGTT ATTTTGGCGT ATTTATCGGA AATACCTTGG AGGTAAGTCA TTTAATAAGA 2100 

20 AGCCAACTTT AGTTGTTGGT GCTGGTCAAG CAGGTTCAAT GCTGATTAGA CAAATGTTGA 2160 

AAAGTGACGA AATGAAACTT GAACCGGTAT TAGCAGTCGA TGATGACGAA CATAAACGCA 2 2 20 

ATATCACAAT TACTGAGGGT GTAAAAGTCC AAGGTAAAAT TGCGGATATT CCAGAACTAG 22 80 

25 TGAGGAAATA TAAGATTAAA AAAATCATCA TTGCAATTCC AACTATTGGT CAAGAGCGTT 2 34 0 

TGAAAGAAAT TAATAATATT TGCCATATGG ATGGCGTTGA GTTATTGAAA ATGCCAAATA 24 00 

TAGAAGACGT CATGTCTGGT GAGTTAGAAG TGAACCAACT TAAAAAAGTT GAAGTAGAAG 24 60 

30 ATTTACTAGG CAGAGATCCT GTTGAATTAG ATATGGATAT GATATCAAAT GAATTGACGA 2 52 0 

ATAAAACTAT TTTAGTTACG GGTGCAGGTG GTTCAATAGG ATCAGAAATT TGTAGACAAG 2 58 0 

TTTGTAATTT CTATCCAGAA CGTATTATTC TACTTGGCCA TGGTGAAAAC AGTATTTATT 2 64 0 

35 

TAATCAATCG TGAATTGCGA AATCGCTTCG GwAAAAATGT TGATAT CGTT CCTATTATAG 2 700 

CGGATGTGCA AAATAGAGCG CGTATGTTTG AAATTATGGA AACGTATAAA CCATACGCAG 2 760 

TTTATCATGC AGCAGCACAC AAGCACGTGC CGTTAATGGA AGACAACCCT GAAGAAGCAG 2 820 

40 

TACGTAATAA TATTTTAGGT ACGAAAAATA CTGCTGAAGC TGCTAAAAAT GCAGAGGTAA 2 8 80 

AGAAATTCGT TATGATTTCT ACGGATAAAG CCGTTAATCC GCCTAATGTC ATGGGAGCTT 2 94 0 

CAAAGCGAAT TGCAGAAATG ATTATTCAAA GTTTAAATGA TGAAACGCAT CGAACAAATT 3 0 00 

TTGTTGCAGT GAGATTTGGT AATGTACTTG GATCGAGAGG ATCTGTGATT CCACTTTTCA 3 0 60 

AAAGTCAAAT TGAAGAAGGT GGGCCAGTTA CTGTGACACA TCCTGAAATG ACACGTTACT 312 0 

50 TTATGACAAT TCCTGAAGCT TCTAGACTAG TTTTGCAGGC AGGGGCATTA GCAGAAGGTG 3180 

GCGAAGTATT TGTGCTAGAT ATGGGAGAAC CAGTGAAAAT TGTAGATTTG GCACGTAATT 3 24 0 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 573 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



[xi> SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

w 

GCAGACGGTA CAGCAGTTAA AGTCGCACCA AaACTGTAGT GAATcTAATC GGTGcATTCT 60 

TTTTAGGATT AGTTGTCGCG CTTATATATA TCTTCTTCAA AGTAATTTTC GATAAGCGAA 120 

TTAAAGATGA AGAAGATGTA GAGAAAGAAT TAGGATTGCC TGTATTGGGT TCAATTCAAA 180 

IS 

AATTTAATTA AGGATGGTTG CTACTTATGT CAAAAAAGGA AAATACGACA ACAACACTAT 240 

TTGTATATGA AAAACCAAAA TCAACAATTA GTGAAAAGTT TCGAGGTATA CGTTCAAACA 3 00 

20 TCATGTTTTC AAAAGCAAAT GGTGAAGTAA AGCGCTTATT GGTTACTTCT GAAAAGCCTG 3 60 

GTGCAGGTAA AAGTACAGTT GTATCGAATG TAGCGATTAC TTATGCACAA GCAGGCTATA 420 

AGACATTAGT TATTGATGGC GATATGCGTA AgcCAACACA AAACTATATT TTTAATGAGC 4 80 

25 AAAATAATAA TGGACTATCA AGCTTAATCA TTGGTCGAAC GACTATGTCA GAAGCAATTA 54 0 

CGTCGACAGA AATTGAAAAT TTAGATTTGC TAACAGCTGG CCCTGTACCT CCAAATCCAT 6 00 

CTGAGTTAAT TGGGTCTGAA AGGTTCAAAG AATTAGTTGA TCTGTTTAAT AAACGTTACG 660 

30 ACATTATTAT TGTCGATACA CCGCCAGTTA ATACTGTGAC TGATGCACAA CTATATGCGC 720 

GTGCTATTAA AGATAGTCTG TTAGTAATTG ATAGTGAAAA AAATGATAAr AATGAAGTTA 780 

AAAAAGCAAA AGCACTTATG GAAAAAGCAG GCAGTAACAT TCTAGGTGTC ATTTTGAACA 84 0 

35 

AGACAAAGGT CGATAAATCT TCTAGTTATT ATCACTATTA TGGAGATGAA TAAGTATGAT 900 

TGATATTCAT AACCATATAT TGCCTAATAT CGATGACGGT CCGACAAATG AAACAGAGAT 9 60 

GATGGATCTT TTAAAACAAG CGACAACACA AGGTGTTACA GAAATCATTG TAACATCACA 1020 

40 

TCACTTACAT CCTCGATATA CCACACCTAT AGAAAAAGTG AAATCATGTT TAAACCATAT 1080 

TGAAAGCTTA GAGGAAGTAC AAGCACTAAA TCTAAAGTTT TATTATGGTC AGGAAATAAG 114 0 

AATTACCGAT CAAATCCTTA ATGATATTGA TCGAAAAGTT ATTAACGGTA TTAATGATTC 1200 

4b 

ACGCTATTTA CTAATAGAAT TTCCATCAAA TGAAGTTCCA CACTATACTG ATCAATTATt 126 0 

TTTCGAATtA CAGAGTAAAG GCTTTGTACC GATTATTGCA CATCCAGAGC GGAATAAAGC 13 20 

50 AATAAGTCAA AACCTTGACA TACTATACGA TTTAATTAAC AAAGGTGCTT TAAGTCAAGT 13 80 

GACAACGGcG TCATTAGCGG GTATTTCCGG TAAAAAAATT AGAAAATTAG CAATTCAAAT 144 0 
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w 



ATTGCAGAAG AATTTGGTTT ATTAACTTAT GTTGACGACG CTCATGGTTC AGGTGTTATG 15000 

GGTAAAGGCG CTGGTACGGT TAAACATTTT GGTTTACAAG ATAAAATCGA TTTCCAAATA 15060 

GGTACGCTTT CTAAAGCAAT TGGTGTCGTT GGCGGTT ATG TAGCAGGTAC AAAAGAGTTA 1512 0 

ATAGATTGGT TAAAAGCACA ATCACGACCA TTCTTATTCT CTACATCATT AGCACCTGGG 15180 

GATACCAAAG CAATAACTGA AGCAGTTAAA AAGTTAATGG ATTCAACTGA ATTACATGAT 15240 

AAATTATGGA ACAATGCACA ATATTTAAAA AATGGATTGT CAAAATTAGG ATATGATACA 15300 

GGTGAGTCAG AAACTCCAAT TACACCAGTA ATTATTGGTG ATGAAAAAAC AACTCAAGAA 15360 

TTTAGTAAGC GTTTAAAAGA CGAAGGTGTC TATGTGAAAT CTATCGTTTT CCCAACAGTA 15420 

CCAAGAGGTA CAGGACGTGT AAGAAATATG CCTACAGCTG CACATACAAA AG A CATGTT A 15480 

GATGAAGCAA TTGCGGCTTA TGAAAAAGTA GGAAAAGAAA TGAAGTTGAT TTAATATTTA 15540 

TTTATTCCCA CGGCAAATAT TGTCGTGGGC TTTTTTTAAT GTTTAGTTTA TTAACAGT 15598 

(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 661 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

AAGTAAATCA ACTTACTGGG ATAAGAATAA AGGCGATTAT AGTAACAAGT TGATTTTATT 6 0 

CGAAAAACAT TTTGAACCGG TTCTGGGTAT CAAGATGCAA CATAGTGGAG GTCATAGCTT 12 0 

TGGCCACACG ATTATTACGA TTGAAAGTCA AGGAGATAAA GCAGTTCATA TGGGTGATAT 180 

ATTCCCAACT ACTGCACATA AAAATCCTCT ATGGGTAACG GCATATGATG ATTATCCTAT 24 0 

GCAATCGATT CGTGAAAAAG AACGCATGAT ACCATATTTT ATTCAGCAAC AATATTGGTT 3 00 

CTTGTTTTAT CATGATGAAA ACTACTTTGC TGTAAAATAC AGCGATAATG GTGAAAACAT 3 60 

AGATGCATAT ATTTTACGTG AAACATTAGT TGATAATAAC TAAAATAAAG ATGTATTACT 4 20 

AAACAAATTT TCAAAAATAA AAAATTGAGC CACATCCAAT CTTACTAATT AGGGTGTGGC 48 0 

TCATTTTTAA GTTTTACgAT CCAAATCAAA TATGGaTAAA ATTCgTATTA ACGCTCTACa 54 0 

ATGtTAATGA CTTCACCAGT ATATGCATCT GCATAAAAAT CATAATGAAT ATTTTGACCA 6 00 

TTTTTAATAG TTGTAATTCC ACCTTGATAA ACTAAACGGT ATTTATCAGT TTCAGGATGA 6 60 
A 
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TCAGAAAGAA TGCACCTGGT CGTACTTTCA AATAATGTGA AAAATCTTCT CCAATCATCA 13200 

TTAAATCTGA TTCATTAAAG CGTACATGTA AGTCATTTGT TGCTTCTTTA ATAACTTGAT 13260 

ATGCTTTCTC GTTATTATGG ACAGGCAAAT ACCCTTTAAT ATAATTCAAA TCATAGTTAA 13 320 

TATCATTTGC TATTGCTAAA CCTTGTAGAA GCTTATCCAT TTTGTCCATT ACATGATTCT 13380 

GTATATCTGA ATCGAAAGTT CTAACTGTAC CTTTACAAAA TGCTTGATCA GGAATAACGC 1344 0 

TATCTGTGGT GCCTGCTTGA ATCATTCCAA ATGAAAGTAC AG CTTGTTT A ACTGGATCGA 1350 0 

TCGTACGTGA AATTATTTTT TGTGCACTTA AAATGAACTC TGCCATGATT ACTATTGGGT 13560 

CAATGGTTTC ATGAGGTTTG GCACCATGAC CACCACGACC TTTAAATGTG ACGCTAAATT 13620 

CATCTGGAGA GGCCATGATT GCCCCCGCAC GTGAATGAAT AGTTCCAGTA GGATAACCAC 13680 

TCCATAAATG TGTACCGTAA ATTCTATCTA CATTTTCCAG ACATCCAGCA TCTAT CATTT 13740 

CTTGAGAACC ACCTGGCATG ATTTCTTCAC CGTACTGGAA TATTAATACA ACATTAC CTT 13 800 

CTAATAAATG TTTATGTTCA TCTAAAATCT CTGCTACAGT AAGTAAAATT GCTGTATGAC 13 86 0 

CATCATGCCC ACACGCATGC ATACATCCTG GATTTTTAGA CTTATAAGGC ACATCGTTTA 13920 

25 ATTCCTCGAC AGGTAACGCA TCAAAGTCAG CTCTTAATGC AATGGTAGGT CCTGTGCCCA 13980 

AGCCTTTAAA TGTGGCTTTG ATACCATTGC GGCCGATAGG AGTTTCAATA TCACAAGATA 14 040 

ACTGGCTTAA TTGGTTAACA ATATAATCAT GTGTTTGAAA TTCTTCAAAA GATAACTCAG 14100 

30 GATATTGGTG TAAATAACGT CTGAGTTGAA TTGTTTTATT TTCTTTATTA TTTG CTAGTT 14160 

GGAACCAATC TAACACCCTT ATCACTACTT TCTAAAATAA TGTTTATAGT ATAACATTTT 14220 

ATGAAATTAT CGTACTAAAT GATTGCTTTG AGATATTTTA TCTATGAATG ATAAGG CTTT 142 8 0 

CAAGTTATGT AGAATTACTG TATGATAAAG GTATTACCAA ACAATACTTA AGGGGGATTA 1434 0 

TATACTGTGG TTCAATCATT ACATGAGTTT TTAGAGGAAA ATATAAATTA TCTAAAAGAA 14400 

AATGGTTTGT ATAATGAAAT AGATACAATT GAAGGTGCAA ACGGACCAGA AATCAAAATC 14460 

AATGGGAAAT CATACATTAA CTT AT CTT CA AATAATTATT TAGGACTAGC AACAAATGAA 14 520 

GATTTGAAAT CaGctGCAAA AGCAGCTATT GATACACATG GTGTAGGTGC AGGCGCTGTT 14 580 

CGTACAATCA ATGGTACATT AGATTTACAC GACGAATTAG AAGAAACACT AGCAAAATTT 14 640 

AAAGGAACAG AAGCTGCAAT AGCTTATCAA TCAGGATTTA ATTGTAATAT GGCTGCTATT 14 7 00 

TCAGCTGTCA TGAATAAAAA TGATGCTATT TTATCAGATG AGCTTAATCA TGCATCAATT 14 760 

50 ATTGATGGAT GTCGCTTATC TAAAGCTAAA ATTATTCGAG TTAACCATTC AGACATGGAT 14 82 0 

GATTTACGTG CGAAAGCAAA AGAAGCAGTT GAATCAGGTC AATACAATAA AGTGATGTAT 14880 
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GCCTAGGTTA 


AAATACAAGG 


TGAGCTTAAA 


TGTAAGCTAT 


CATCTTTATA 


GTTTGATTTT 


11400 


TTGGGGTGAA 


TGCATTATAA 


AAGAATTGTA 


AAATTCTTTT 


TGCATCGCTA 


TAAATAATTT 


11460 


CTCATGATGG 


TGAGAAACTA 


TCATGAGAGA 


TAAATTTAAA 


TATTATTTTT 


AATTAGAATA 


11520 


GGAGAGATTT 


TATAATGGCA 


AAAGAAAAAT 


TCGATCGTTC 


TAAAGAACAT 


GCCAATATCG 


11580 


GTACTATCGG 


TCACGTTGAC 


CATGGTAAAA 


CAACATTAAC 


AGCAGCAATC 


GCTACTGTAT 


11640 


TAGCAAAAAA 


TGGTGACTCA 


GTTGCACAAT 


CATATGACAT 


GATTGACAAC 


GCTCCAGAAG 


11700 


AAAAAGAACG 


TGGTATCACA 


ATCAATACTT 


CTCACATTGA 


GTACCAAACT 


GACAAACGTC 


11760 


ACTACGCTCA 


CGTTGACTGC 


CCAGGACACG 


CTGACTACGT 


TAAAAACATG 


ATCACTGGTG 


11820 


CTGCTCAAAT 


GGACGGCGGT 


ATCTTAGTAG 


TATCTGCTGC 


TGACGGTCCA 


ATGCCACAAA 


11880 


CTCGTGAACA 


CATTCTTTTA 


TCACGTAACG 


TTGGTGTACC 


AGCATTAGTA 


GTATTCTTAA 


11940 


ACAAAGTTGA 


CATGGTTGAC 


GATGAAGAAT 


TATTAGAATT 


AGTAGAAATG 


GAAGTTCGTG 


12000 


ACTTATTAAG 


CGAATATGAC 


TTCCCAGGTG 


ACGATGTACC 


TGTAATCGCT 


GGTTCAGCAT 


12060 


TAAAAGCTTT 


AGAAGGCGAT 


GCTCAATACG 


AAGAAAAAAT 


CTTAGAATTA 


ATGGAAGCTG 


12120 


TAGATACTTA 


CATTCCAACT 


CCAGAACGTG 


ATTCTGACAA 


ACCATTCATG 


ATGCCAGTTG 


12180 


AGGACGTATT 


CTCAATCACT 


GGTCGTGGTA 


CTGTTGCTAC 


AGGCCGTGTT 


GAACGTGGTC 


12240 


AAATCAAAGT 


TGGTGAAGAA 


GTTGAAATCA 


TCGGTTTACA 


TGACACATCT 


AAAACAACTG 


12300 


TTACAGGTGT 


TGAAATGTTC 


CGTAAATTAT 


TAGACTACGC 


TGAAGCTGGT 


GACAACATTG 


12360 


GTGCATTATT 


ACGTGGTGTT 


GCTCGTGAAG 


ACGTACAACG 


TGGTCAAGTA 


TTAGCTGCTC 


12420 


CTGGTTCAAT 


TACACCACAT 


ACTGAATTCA 


AAGCAGAAGT 


ATACGTATTA 


TCAAAAGACG 


12480 


AAGGTGGACG 


TCACACTCCA 


TTCTTCTCAA 


ACTATCGTCC 


ACAATTCTAT 


TTCCGTACTA 


12540 


CTGAGGTAAC 


TGGTGTTGTT 


CACTTACCAG 


AAGGTACTGA 


AATGGTAATG 


CCTGGTGATA 


12600 


ACGTTGAAAT 


GACAGTAGAA 


TTAATCGCTC 


CAATCGCGAT 


TGAAGACGGT 


ACTCGTTTCT 


12660 


CAATCCGTGA 


AGGTGGACGT 


ACTGTAGGAT 


CAGGCGTTGT 


TACTGAAATC 


ATTAAATAAT 


12720 


TTCTAATTTC 


TTAGATTTTA 


TATAAAAAGA 


AGATCCCTCA 


ATCGAGGGGt 


CTTTTTTTAA 


12780 


TGTGTAAATT 


TTGTAATGGC 


TATTCGATTT 


AGAAGAACAA 


TAATTGATGA 


AAGACTGACT 


12840 


AATAAAACTT 


ATAACTGATA 


ATACTGTTTA 


AATAAAATTG 


TTGAGTCTTG 


GACATTGTAA 


129C0 


AATGCTCCCT 


TCAAAGTTTT 


CATTTTTTCa 


ATGTCTACTT 


TGAAGGGAGC 


ATTTCATTAG 


12960 


TTTATGTCTC 


AGATTCATAT 


CTTTCAATTA 


A TTT AAA TGC 


TTAATTTGTT 


TTAAATACTT 


13020 


GCTCTAATTC 


TATGATTTTT 


AAAAATACAG 


CTACAGCGTA 


TTTTAATGAT 


TTTTCATCAA 
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CAGTTACAGT 


ACTTGATGCA 


CAATCAGGTG 


TTGAACCTCA 


AACTGAAACA 


GTTTGGCGTC 


9600 




AGGCTACAAC 


TTATGGTGTT 


CCACGTATCG 


TATTTGTAAA 


CAAAATGGAC 


AAATTAGGTG 


9660 


5 


CTAACTTCGA ATACTCTGTA AGTACATTAC ATGATCGTTT ACAAgCTAAC 


GCTGCTCCAA 


9720 




TPPAATTACC 


AATTGGTGCG 


GAAGACGAAT 


TCGAAGCAAT 


CATTGACTTA 


GTTGAAATGA 


9780 




AATGTTTCAA 

X w X X X — r»X» 


ATATACAAAT 


GATTTAGGTA 


CTGAAATTGA 


AGAAATTGAA 


ATTCCTGAAG 


9840 


10 


7\ r«r a PTT AG A 


TAGAGPTGAA 


GAAGCTCGTG 


CTAGCTTAAT 


CGAAGCAGTT 


GCAGAAACTA 


9900 




n PG APG AATT 


AATGGAAAAA 


TAT CTTGGTG 


ACGAAGAAAT 


TTCAGTTTCT 


GAATTAAAAG 


9960 


15 


A AGPT ATCPG 


CCAAGPTaCt 


AcTAACGTAG 


AATTCTACCC 


AGTACTTTGT 


GGTACAGCTT 


10020 


TO a A a A APAA 


AP»T2TfiTTPAA 


TTAATGPTTG 

x x nn x w^. x x 


ACGCTGTAAT 


TGATTACTTA 


CCTTCACCAC 


10080 




T A P A f^TT A A 


A PP A ATT" ATT 
nvLAnl xnx x 


GGTGACCGTG 


CTAGCAACCC 


TGAAGAAGAA 


GTAATCGCGA 


10140 


20 


a appana.pp,A 


TTP AGPTG A A 


TTPGPTGPAT 
x x Vv^jVv x x 


TAGCGTTCAA 

x x x win 


AGTTATGACT 


GACCCTTATG 


10200 




ttvt'T A a a tt 




PGTGTGTATT 


PAGGTACAAT 


GACATCTGGT 


TCATACGTTA 


10260 




ACjAAC 1 I At 


Ta a anPTA a a 


pnTn A A PHTH 


TAGGTPGTTT 

X -f^VJVJ X V> >J XXX 


ATTACAAATG 


CACGCTAACT 


10320 


25 


LAtu 1 UAnon 


a ATPP,ATAPT 


GT AT A PTPTG 
\j x n x x V— x \j 


GAGATATCGC 


TGCTGCGGTA 


GGTCTTAAAG 


10380 




A 1 1 AU 


tggtgatapt 

X VjVj X On X nU X 


TTATGTGGTG 

X X rtX w X w\J X \J 


AGAAAAATGA 


CATTAT CTTG 


GAATCAATGG 


10440 






PP APrTT ATT 

O^wV-^VjX X f\ X X 


CACTTATCAG 


TAGAGCCAAA 


ATCTAAAGCT 


GACCAAGATA 


10500 


30 


A AATGAPTPA 


AGCTTTAGTT 

X»VJ W XXX J»vJ X X 


AAATTACAAG 


AAGAAGACCC 


AACATTCCAT 


GCACACACTG 


10560 






TGGACAAGTT 


ATCATCGGTG 


GTATGGGTGA 


GCTTCACTTA 


GACATCTTAG 


10620 




TAGAPPGTAT 


GAAGAAAGAA 


TTCAACGTTG 


AATGTAACGT 


AGGTGCTCCA 


ATGGTTTCAT 


10680 


35 


a TPP.TG A A A P 


ATTPAAATCA 


TCTGCACAAG 


TTCAAGGTAA 


ATTCTCTCGT 


CAATCTGGTG 


10740 




gtpg^tggtpa 


ATACGGTGAT 


GTTCACATTG 


AATTCACACC 


AAACGAAACA 


GGCGCAGGTT 


10800 


40 


TPGAATTPGA 


AAAPGCTATC 


GTTGGTGGTG 


TAGTTCCTCG 


TGAATACATT 


CCATCAGTAG 


10860 


AAGPTGGTPT 


TAAAGATGCT 


ATGGAAAATG 


GTGTTTTAGC 


AGGTTATCCT 


TTAATTGATG 


10 920 




TTAAAGCTAA 


ATTATATGAT 


GGTTCATACC 


ATGATGTCGA 


TTCATCTGAA 


ATGGCCTTCA 


10980 


45 


AAATTGCTGC 


ATCATTAGCA 


CTTAAAGAAG 


CTGCTAAAAA 


ATGTGATCCT 


GTAATCTTAG 


11040 




AACCAATGAT 


GAAAGTAACT 


ATTGAAATGC 


CTGAAGAGTA 


CATGGGTGAT 


ATCATGGGTG 


11100 




ACGTAACATC 


TCGTCGTGGA 


CGTGTTGATG 


GTATGGAACC 


TCGTGGTAAT 


GCACAAGTTG 


11160 


50 


TTAATGCTTA 


TGTACCACTT 


TCAGAAATGT 


TCGGTTATGC 


AACATCATTA 


CGTTCAAACA 


11220 




CTCAAGGTCG 


CGGTACTTAC 


ACTATGTACT 


TCGATCACtA 


TGCTGAAGTT 


CCaAAATCaA 


11280 
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ATGTTGACGA ATTCTCTTGT TCAATGTTAA TATATTAAAG GTTGATGCAA GCAGAACTTT 78 0 0 

GGAGGATAAA TTATTGTCTA AGGAAAAAGT tGCACGCTTT AACAAACAAC ATTTTGTAGT 786 0 

5 

TGGTCTTAAA GAAACGCTTA AAGCGTTAAA GAAAGATCAA GTTACATCTT TGATTATTGC 7 920 

TGAAGACGTT GAAGTATATT TAATGACTCG CGTGTTAAGC CAAATCAATC AGAAAAATAT 7 980 

ACCTGTATCT TTTTTCAAAA GCAAACATGC TTTGGGTAAA CATGTAGGTA TTAACGTCAA 8 04 0 

10 

TGCGACAATA GTAGCATTGA TTAAATGAGA ATTAGTAAGT GTTTTACTTA CTAAATTTTA 8100 

TTTAACCTAA AAATGAACCA CCTGGATGTG TGGGATTAAA AAGTGAAGAG AGGAGGACAT 8160 

ATCACATGCC AACTATTAAC CAATTAGTAC GTAAACCAAG ACAAAGCAAA ATCAAAAAAT 822 0 

15 

CAGATTCTCC AGCTTTAAAT AAAGGTTTCA ACAGTAAAAA GAAAAAATTT ACTGACTTAA 82 80 

ACTCACCACA AAAACGTGGT GTATGTACTC GTGTAGGTAC AATGACACCT AAAAAACCTA 834 0 

2o ACTCAGCGTT ACGTAAATAT GCACGTGTGc gTtTATCAAA CAACATCGAA ATT AACG CAT 84 0 0 

ACATCCCTGG TAT CGG A CAT AACTTACAAG AACACAGTGT TGTACTTGTA CGTGGTGGAC 84 60 

GTGTAAAAGA CTTACCAGGT GTGCGTTACC ATATTGTACG TGGAGCACTT GATACTTCAG 852 0 

25 GTGTTGACGG ACGTAGACAA GGTCGTTCAT TATACGGAAC TAAGAAACCT AAAAACTAAG 8580 

AATTTAGTTT TTAATTAAAT CTTAAACTTA AAATATTTAA TATAAGGAAG GGAGGATTTA 864 0 

CATTATGCCT CGTAAAGGAT CAGTACCTAA AAGAGACGTA TTACCAGATC CAATTCATAA 870 0 

30 CTCTAAGTTA GTAACTAAAT TAATTAACAA AATTATGTTA GATGGTAAAC GTGGAACAGC 876 0 

ACAAAGAATT CTTTATTCAG CATTCGACCT AGTTGAACAA CGCAGgtTCG TGATGCATTA 8 820 

GAAGTATTCG AAGAAGCAAT CAACAACATT ATGCCAGTAT TAGAAGTTAA AGCTCGTCGC 8 8 80 

35 

GTAGGTGGTT CTAACTATCA AGTACCAGTA GAAGTTCGTC CAGAGCGTCG TACTACTTTA 8 940 

GGTTTACGTT GGTTAGTTAA CTATGCACGT CTTCGTGGTG AAAAAACGAT GGAAGATCGT 90 0 0 

TTAGCTAACG AAATTTTAGA TGCAGCAAAT AATACAGGTG GTGCCGTTAA GAAACGTGAG 90 60 

40 

GACACTCACA AAATGGCTGA AGCAAACAAA GCATTTGCTC ACTACCGTTG GTAAGATAAA 912 0 

AGCTTTTACC CTGAGTGTGT TCTATATTAA TGAATTTTCA TTAAGCGTTC ATGCTTAGGG 918 0 

45 CATCGCCATA TCTATCGTAT TTATTCAGTA ATATAAACTG GAAGGAGAAA AAATACATGG 924 0 

CTAGAGAATT TTCATTAGAA AAAACTCGTA ATATCGGTAT CATGGCTCAC ATTGATGCTG 93 0 0 

GTAAAACGAC TACGACTGAA CGTATTCTTT ATTACACTGG CCGTATCCAC AArGknGGTG 93 6 0 

50 AAaCACACGA AGGTGCTTCA CAAATGGACT GGATGGAGCA AGAACAAGAC CGTGGTATTA 94 2 0 

CTATCACATC TGC7GCAACA ACAGCAGCTT GGGAAGGTCA CCGTGTAAAC ATTATCGATA 94 9 0 
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GAAGTATTCA ACAGATTTAG CATCACTGAT ACATCAATGA TGTTAGACCG TATGAAAGAC 6 000 

TTAGGATTCA AATTCTCATC TAAAGCTGGT ATTACAGTAG GTGTTGCTGA TATCGTAGTA 60 60 

TTACCTGATA AGCAACAAAT ACTTGATGAG CATGAAAAAT TAGTCGACAG AATTACAAAA 6120 

CAATTCAACC GTGGTTTAAT CACTGAAGAA GAAAGATATA ATGCAGTTGT TGAAATTTGG 618 0 

ACAGATGCAA AAGATCAAAT TCAAGGTGAA TTGATGCAAT CACTTGATAA AACTAACCCA 624 0 

ATCTTCATGA TGAGTGATTC AGGTGCCCGT GGTAACGCAT CTAACTTTAC ACAGTTAGCA 63 00 

GGTATGCGTG GATTGATGGC CGCACCATCT GGTAAGATTA TCGAATTACC AATCACATCT 63 60 

TCATTCCGTG AAGGTTTAAC AGTACTTGAA TACTTCATCT CAACTCACGG TGCACGTAAA 6420 

GGTCTTGCCG ATACAGCACT TAAAACAGCT GACTCAGGAT ATCTTACTCG TCGTCTTGTT 64 80 

GACGTGGCAC AAGATGTTAT TGTTCGTGAA GAAGACTGTG GTACTGATAG AGGTTTATTA 654 0 

GTTTCTGATA TTAAAGAAGG TACAGAAATG ATTGAACCAT TTATCGAACG TATTGAAGGT 6600 

CGTTATTCTA AAGAAACAAT TCGTCATCCT GAAACTGATG AAATAATCAT TCGTCCTGAT 6660 

GAATTAATTA CACCTGAAAT TGCTAAGAAA ATTACAGATG CTGGTATTGA ACAAATGTAT 6720 

25 ATTCGCTCAG CATTTACTTG TAACGCACGA CATGGTGTTT GTGAAAAATG TTACGGTAAA 67 80 

AACCTTGCTA CTGGTGAAAA AGTTGAAGTT GGTGAAGCAG TTGGTACAAT TGCAGCCCAA 6840 

TCTATCGGTG AACCAGGTAC ACAGCTTACA ATGCGTACAT TCCATACAGG TGGGGTAGCA 6900 

30 GGTAGCGATA TCACACAAGG TCTTCCTCGT ATTCAAGAGA TTTTCGAAGC ACGTAACCcT 6960 

AAAGGTCAAG CGGTAATTAC GGAAATCGAA GGTGTCGTAG AAGATATTAA ATTAGCAAAA 7020 

GATAGACAAC AAGAAATTGT TGTTAAAGGT GCTAATGAAA CAAGATCATA CCTTGCTTCA 70 80 

GGTACTTCAA GAATTATTGT AGAAATCGGT CAACCAGTTC AACGTGGTGA AGTATTAACT 7140 

GAAGGTTCTA TTGAACCTAA GAATTACTTA TCTGTTGCTG GATTAAACGC GACTGAAAGC 7200 

TACTTATTAA AAGAAGTACA AAAAGTTTAC CGTATGCAAG GTGTAGAAAT CGACGATAAA 7260 

CACGTTGAGG TTATGGTTCG ACAAATGTTA CGTAAAGTTA GAATTATCGA AGCAGGTGAT 7320 

ACGAAGTTAT TACCAGGTTC ATTAGTTGAT ATTCATAACT TTACAGATGC AAATAGAGAA 73 BO 

GCATTTAAAC ACCGTAAGCG TCCTGCAACA GCTAAACCAG TATTACTTGG TATTACTAAA 7440 

GCATCACTTG AAACAGAAAG TTTCTTATCT GCAGCATCAT TCCAAGAAAC AACAAGAGTT 7500 

CTTACAGATG CAGCAATTAA AGGTAAGCGT GATGACTTAT TAGGTCTTAA AGAAAACGTA 7560 

50 ATTATTGGTA AGTTAATTCC AGCTGGTACT GGTATGAGAC GTTATAGCGA CGTAAAATAC 7620 

GAAAAAACAG CTAAACCAGT TGCAGAAGTT GAATCTCAAA CTGAAGTAAC GGAATAACAA 76 80 
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AAACCTGAAA CAATCAACTA CCGTACATTA AAACCTGAAA AAGATGGTCT ATTCTGTGAA 4 2 00 

AGAATTTTCG GACCTACAAA AGACTGGGAA TGTAGTTGTG GTAAATACAA ACGTGTTCGC 4 26 0 

5 

TACAAAGGCA TGGTCTGTGA CAGATGTGGA GTTGAAGTAA CTAAATCTAA AGTACGTCGT 4 32 0 

GAAAGAATGG GTCACATTGA ACTTGCTGCT CCAGTTTCTC ACATTTGGTA TTTCAAAGGT 4 3 80 

ATACCAAGTC GTATGGGATT ATTACTTGAC ATGTCACCAA GAGCATTAGA AGAAGTTATT 4 44 0 

10 

TACTTTGCTT CTTATGTTGT TGTAGATCCA GGTCCAACTG GTTTAGAAAA GAAAACTTTA 4 5 00 

TTATCTGAAG CTGAATTCAG AGATTATTAT GATAAATACC CAGGTCAATT CGTTGCAAAA 4 56 0 

15 ATGGGTGCAG AAGGTATTAA AGATTTACTT GAAGAGATTG ATCTTGACGA AGAACTTAAA 4 620 

TTGTTACGCG ATGAGTTGGA ATCAGCTACT GGTCAAAGAC TTA CTCGTGC AATTAAACGT 4 68 0 

TTAGAAGTTG TTGAATCATT CCGTAATTCA GGTAACAAAC CTTCATGGAT GATTTTAGAT 4 74 0 

20 GTACTTCCAA TCATC CCACC AGAAATTCGT CCAATGGTTC AATTAGATGG TGGACGATTT 4 8 00 

GCAACAAGTG ACTTAAACGA CTTATACCGT CGTGTAATTA ATCGAAATAA TCGTTTGAAA 4 860 

CGTTTATTAG ATTTAGGTGC ACCTGGTATC ATCGTTCAAA ACGAAAAACG TATGTTACAA 4 920 

25 GAAGCCGTTG ACGCTTTAAT TGATAATGGT CGTCGTGGTC GTCCAGTTAC TGGCCCAGGT 4 980 

AACCGTCCAT TAAAATCTTT ATCTCATATG TTAAAAGGTA AACAAGGTCG TTTCCGTCAA 5040 

AACTTACTTG GTAAACGTGT TGACTATTCA GGACGTTCAG TTATTGCAGT AGGTCCAAGC 5100 

30 

TTGAAAATGT ACCAATGTGG TTTACCAAAA GAAATGGCAC TTGAACTATT TAAACCATTC 5160 

GTAATGAAAG AATTAGTTCA ACGTGAAATT GCAACTAACA TTAAAAATGC GAAGAGTAAA 5220 

ATCGAACGTA TGGATGATGA AGTTTGGGAC GTATTGGAAG AAGTAATTAG AGAACATCCT 52 80 

35 

GTATTACTTA ACCGTGCACC AACACTTCAT AGACTTGGTA TTCAAGCATT TGAACCAACT 534 0 

TTAGTTGAAG GTCGTGCGAT TCGTCTACAT CCACTTGTAA CAACAGCTTA TAACGCTGAC 54 00 

TTTGACGGTG ACCAAATGGC GGTTCACGTT CCTTTATCAA AAGAGGCACA AGCTGAAGCA 54 6 0 

40 

AGAATGTTGA TGTTAGCAGC ACAAAACATC TTGAACCCTA AAGATGGTAA ACCTGTAGTT 5 52 0 

ACACCATCAC AAGATATGGT ACTTGGTAAC TATTACCTTA CTTTAGAAAG AAAAGATGCA 55 8 0 

45 GTAAATACAG GCGCAATCTT TAATAATACA AATGAAGTAT TAAAAGCATA TGCAAATGGC 564 0 

TTTGTACATT TACACACTAG AATTGGTGTA CATGCAAGTT CGTTCAATAA TCCAACATTT 57 00 

ACTGAAGAAC AAAACAAAAA GATTCTTGCT ACGTCAGTAG GTAAAATTAT ATTCAATGAA 5 76 0 

50 ATCATTCCAG ATTCATTTGC TTATATTAAT GAACCTACGC AAGAAAACTT AGAAAGAAAG 582 0 

ACACCAAA^A ^A^A^^A^ ^A.T^CTA^A AC' T TTAn(T r ' r : ra^^^A^t a.AAA/^AA^A^ n, □ a - 
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CAGGTATGGA ACACGTTGCA GCACGTGATT CTGGTGCGGC TATTACAGCT AAGCACAGAG 24 00 

GTCGTGTTGA ACATGTTGAA TCTAATGAAA TTCTTGTTCG TCGTCTAGTT GAAGAGAACG 2460 

GCGTTGAGCA TGAAGGTGAA TTAGATCGCT ATCCATTAGC TAAATTTAAA CGTTCAAACT 2520 

CAGGTACATG TTACAACCAA CGTCCAATCG TTGCAGTTGG AGATGTTGTT GAGTATAACG 25 80 

AGATTTTAGC AGATGGACCA TCTATGGAAT TAGGAGAAAT GGCATTAGGT AGAAACGTAG 2640 

TAGTTGGTTT CATGACTTGG GACGGTTACA ACTATGAGGA TGCCGTTATC ATGAGTGAAA 2700 

GACTTGTGAA AGATGACGTG TATACTTCTA TTCATATTGA AGAGTATGAA TCAGAAGCAC 2760 

GTGATACTAA GTTAGGACCT GAAGAAATCA CAAGAGATAT TCCTAATGTT TCTGAAAGTG 2820 

CACTTAAGAA CTTAGACGAT CGTGGTATCG TTTATATTGG TGCAGAAGTA AAAGATGGAG 28 80 

ATATTTTAGT TGGTAAAGTA ACGCCTAAAG GTGTAACTGA GTTAACTGCC GAAGAAAGAT 2940 

20 TGTTACATGC AATCTTTGGT GAAAAAGCAC GTGAAGTTAG AGATACTTCA TTACGTGTAC 3 000 

CTCACGGCGC TGGCGGTATC GTTCTTGATG TAAAAGTATT CAATCGTGAA GAAGGCGACG 3 060 

ATACATTATC ACCTGGTGTA AACCAATTAG TACGTGTATA TATCGTTCAA AAACGTAAAA 3120 

25 TTCATGTTGG TGATAAGATG TGTGGTCGAC ATGGTAACAA AGGTGTCATT TCTAAGATTG 3180 

TTCCTGAAGA AGATATGCCT TACTTACCAG ATGGACGTCC GATCGATATC ATGTTAAATC 3 24 0 

CTCTTGGTGT ACCATCTCGT ATGAACATCG GACAAGTATT AGAGCTACAC TTAGGTATGG 33 00 

CTGCTAAAAA TCTTGGTATT CACGTTGCAT CACCAGTATT TGACGGTGCA AACGATGACG 3360 

ATGTATGGTC AACAATTGAA GAAGCTGGTA TGGCTCGTGA TGGTAAAACT GTACTTTATG 3420 

ATGGACGTAC AGGTGAACCA TTCGATAACC GTATTTCAGT AGGTGTAATG TACATGTTGA 34 80 

AACTTGCGCA CATGGTTGAT GATAAATTAC ATGCGCGTTC AACAGGACCA TATTCACTTG 3 54 0 

tTACACAACA ACCACTTGGC GGTAAAGCGC AATTCGGTGG ACAACGTTTT GGTGAGATGG 36 00 

AGGTATGGGC ACTTGAAGCA TATGGTGCTG CATACACATT ACAAGAAATC TTAACTTACA 3660 

AATCCGATGA TACAGTAGGA CGTGTGAAAA CATACGAGGC TATTGTTAAA GGTGAAAACA 3720 

TCTCTAGACC AAGTGTTCCA GAATCATTCC GAGTATTGAT GAAAGAATTA CAAAGTTTAG 3780 

GTTTAGATGT AAAAGTTATG GATGAGCAAG ATAATGAAAT CGAAATGACA GACGTTGATG 3840 

ACGATGATGT TGTAGAACGC AAAGTAGATT TACAACAAAA TGATGCTCCT GAAACACAAA 3900 

AAGAAGTTAC TGATTAATAC GCAATTTACA AAACAGGCAA AAAGATACTA AGCTGAATTT 3 9 60 

50 TATTGATGAT TCAGTTTAGT ACTTTAAGCC ATTTTAAATA AATGCAAATC AATCAAATAG 4 020 

CACAGCTAAT CTAAATTGAA GGAGGTAGGC TCCTTGATTG ATGTAAATAA TTTCCATTAT 4 080 
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CTGGTAATTT GTCATTAGAG 


TTTGTGGATT 


ACCGTTTAGG 


AGAACCAAAA 


TATGATTTAG 


600 




AAGAATCTAA 


AAACCGTGAC 


GCTACTTATG 


CTGCACCTCT 


TCGTGTAAAA 


GTGCGTCTAA 


660 


5 


TCATTAAAGA 


AACAGGAGAA 


IjI IAAAGAAC 


AAGAAGTCTT 


TATGGGTGAT 


TTCCCATTAA 


720 




TGACTGATAC 


AGGTACGTTC 


Ul IATCAATG 


GTGCAGAACG 


TGTAATCGTA 


TCTCAATTAG 


780 


10 


TTCGTTCACC 


ATCCGTTTAT 


I i uAATGAAA 


AAATCGACAA 


AAATGGTCGT 


GAAAACTATG 


840 


ATGCAACAAT 


TATTCCAAAC 


LA> I GGTGCAT 


GGTTAGAATA 


TGAAACAGAT 


GCTAAAGATG 


900 




TTGTATACGT 


ACGTATTGAT 


AUAACACGTA 


AACTACCATT 


AACAGTATTG 


TTACGTGCAT 


960 


15 


TAGGTTTCTC 


AAGCGACCAA 


GAAATTGTTG 


AC CTTTTAGG 


TGACAATGAA 


TATTTACGTA 


1020 




ATACTTTAGA 


GAAAGACGGC 


ACTGAAAACA 


CTGAACAAGC 


GTTATTAGAA 


ATCTATGAAC 


1080 




GTTTACGTCC 


AGGTGAACCA 


CCAACTGTTG 


AAAATGCTAA 


AAGTCTATTG 


TATTCACGTT 


1140 


20 


TCTTTGATCC 


AAAACGCTAT 


GACTTAGCAA 


GCGTGGGTCG 


TTATAAAACA 


AACAAAAAAT 


1200 




TACATTTAAA 


ACATCGTTTA 


TTTAATCAAA 


AATTAGCTGA 


GCCAATTGTA 


AATACTGAAA 


1260 




CTGGTGAAAT 


TGTAGTTGAA 


GAAGGTACAG 


TGCTTGATCG 


TCGTAAAATC 


GACGAAATCA 


1320 


25 


TGGATGTACT 


TGAATCAAAT 


GCAAACAGCG 


AAGTGTTTGA 


ATTGCATGGT 


AGCGTTATAG 


1380 




ACGAGCCAGT 


AGAAATTCAA 


TCAATTAAAG 


TATATGTTCC 


TAACGATGAT 


GAAGGTCGTA 


1440 




CGACAACTGT 


AATTGGTAAT 


GCTTTCCCTG 


ACTCAGAAGT 


TAAATGCATT 


ACACCAGCAG 


1500 


30 


ATATCATTGC 


TTCAATGAGT 


TACTTCTTTA 


ACTTATTAAG 


CGGTATTGGA 


TATACAGATG 


1560 




ATATTGACCA 


TTTAGGTAAC 


CGTCGTTTAC 


GTTCTGTAGG 


TGAATTACTA 


CAAAACCAAT 


1620 


35 


TCCGTATCGG 


TTTATCAAGA 


ATGGAAAGAG 


TTGTACGTGA 


AAGAATGTCA 


ATTCAAGATA 


1680 


CTGAGTCTAT 


CACACCTCAA 


CAATTAATTA 


ATATTCGACC 


TGTTATTGCA 


TCTATTAAAG 


1740 




AATTtTTTGG 


TAGCTCTCAA 


ITATCACAAT 


TCATGGACCA 


AGCAAACCCA 


TTAGCTGAGT 


1800 


40 


TAACGCATAA 


ACGTCGTCTA 


I LACjCATTAG 


GACCTGGTGG 


TTTAACACGT 


GAACGTGCTC 


1860 




AAATGGAAGT 


ACGTGACGTT 


LftL I AUTCTC 


ACTATGGCCG 


TATGTGTCCA 


ATTGAAACAC 


1920 




CTGAGGGACC 


AAACATTGGA 


1 i.\iA 1 1 AAL I 


CATTATCAAG 


TTATGCACGT 


GTAAATGAAT 


1990 


45 


TCGGCTTTAT 


TGAAACACCA 


I AI L(j I AAAG 


TTGATTTAGA 


TACACATGCT 


ATCACTGATC 


2040 




AAATTGACTA 


TTTAACAGCT 


UALCjAAGAAG 


ATAGCTATGT 


TGTAGCACAA 


GCAAACTCTA 


2100 




AATTAGATGA 


AAATGGTCGT 


TTCATGGATG 


ATGAAGTTGT 


ATGTCGTTTC 


CGTGGTAACA 


2160 


50 


ATACAGTTAT 


GGCTAAAGAA 


AAAATGGATT 


ATATGGATGT 


ATCGCCGAAG 


CAAGTTGTTT 


2220 




CAGCAGCGAC 


AacATGTATT 


CTATTCTTAG 


AAAATGATGA 


rTCAAACCGT 
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CTTCATTCAC GACCTTTCTT AAATAAAAAT CCTAATCATT TAAATACTGA CGTTGTATTA 34 80 

GTCTTATACC AATATCGACA GTCTATATCT ATTACAAACT TTTATTTTCA AAATATTATT 3540 

TAGAAACTTT GCG TTCAATT ACTTCTCTCA ATTGACGTTT AACGTCTTCG ATAGGTAATT 3600 

CACGTACTAC TGGATCTAAG AAACCATGTA TAACAAGACG TTCCGCTTCT CTTTGAGAAA 36 60 

TACCACGACT CATTAAATAG TAAAGTTGAT CTGGATCAAC ACGACCTACT GATGCAGCAT 372 0 

GACCAGCTTG TACATCATCT TCATCAATTA ATAAAATAGG ATTCGCGTCA CCACGAGCAT 37 8 0 

GTTCAGATAA CATTAATACA CGTGATTCCT GATTAGCAAT TGATTTAGTT CCACCATGCT 3840 

TAATGTAGCC GATACCATTA AATACAGACG ATGCATGTTC TTTCATAACA CCATGTTTAA 3 900 

GGATATAACC ATCTGTTTCT TTACCATATT GTACGATTTT AGATGTTAGA TTAATTTTTT 3 960 

GTTCGCCTGT ACCTACAACT ACTGATTTAA GTGAACTTGT TGAACGATCA CCAAATAAAT 4 020 

20 TTGTTGTATT ATCAATAATT TGGCTACCCT CATTCATTAA ACCTAGTGCC CAATTAATTG 4 080 

AGGCATCCGC TTCAGTAATA CCACGTCGAA TGATATGACC TGTAAAGCCT TTATCCATAT 414 0 

AGTCCACTGA GCCATATGTG ATATTTGAAT TTGCACCAGC AATCACTTCA GAAATAATAT 42 00 

25 TtAATTGATT TCCTTCACCA GATGCATTTG mTAAGTAATT TTCAACATAT GTGACTTCGG 4260 

CGCTTTCTTC AGTAACGATG 42 8 0 
(2) INFORMATION FOR SEQ ID NO: 82: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15598 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



~~ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

TCnGACTCGA ACGGTGmAAC TAttCCGTTG TaATTCCgGA GgAAsCAAGG TATGCCCATC 60 

TGCaAAGAAA gaATGsAATG AACTTTTTGG AAATGTAGAA GTGGTAAATA AAGATAAAGG 120 

ATATTACATT CTGAGAAGTA TAAAAGCTTG AAATGAAATG GATATTCTGT TATAGTTATA 180 

TAATGTAAAA ATTTATGTTC AATAAGTGTG TACTTTTACG TTAAATAGAT AAGTTAATTA 240 

AGAATAAATA TAGAATCGAA AATGGTGTCA TCATTAGTGT TGCCGTTTTC TTTTTGTCTT 300 

TTTATTAATA TGCTTATGGT ATTTAGCTAA AAGCGGATCA CATAATTTTT GAGGGGTGAA 360 

TCTGTTTGGC AGGTCAAGTT GTCCAATATG GAAGACATCG TAAACGTAGA AACTACGCGA 420 

GAATTTCAGA AGTATTAGAA TTACCAAACT TAATAGAAAT TCAAACTAAA TCTTACGAGT 480 



55 
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TTTAATTCAT GATGTAAACC ATATTATAAC 
ACTTTTATAT GTCTAATATC AAAATTATCT 
5 CGTACCTTCT GCTTTACCTT CTTTAGCAAC 

TGTGGCACAT TTAATACGAG CTGGGAATTG 
CATTTCTTCT GTAATCACAT AGTCTTCACC 

10 

TTGCATTGCT TCTCCAAGTG AATGACCTTT 
CATTGAAATC GAACAACCTT CACCTTCAAA 
ATCAAATGTT AGTCGTATAC GGTCACCGCA 

15 

CCCGTTATCT AATACACCTT TATTTCTAGG 
ATATAATTGA TCTAGATTAT TAAAATTCAT 

20 CATTTATTAA CTGATCAACG TCTTCTTTCG 
TTGAAGACAC ATTTAACCAT TTCATTAACG 
CTACACCTTC TGTATCTACG GCTGTAGCAA 

25 ACGTTATTAC ACCTGCACGA CGATCCTTTG 

CAGACATTTG CTCATAAGCA TAT AT CGTTA 
AACCTATGCG TTCTAAATAG CGAATAGCTT 

30 GAGTACCCGC CTCAAATTTA GTAGGTAAAT 

AATCAATCAT GTCGCCACCG AACTCAATCG 
CAAATAATAC GCCAATACCT GTTGGTCCAA 

35 

CAGCATTCAT TTCTTGCATA TCAAGTTTCA 
TGATSATTGC ACCATGTTGA TGAGCTATTT 
CGAGCACATT AGATATATGT GCAATAGCAA 

40 

TAATATCCTC GATGTTTAAT TCACCGTCAG 
TTTTACGCTT TGCTAACTGT TGCCAAGGAA 

45 CAACAATTTC ATCGCCCTCT TCAACATTTG 

TCGACGCAGT TGTTCCGCGT GTAAAAATGA 
GACGAACGGT TTCACGGGCA TTTTCATAAC 

50 CACCACGATG AACGTTTGAA TTATAACGCT 
GCACAGGCGT TTGACTTGTC GCTGTTGAAT 



AATGACATGA CATCTTATAA AAATTTTTAT 16 8 0 

ATGATTAACA GCATTCTATT CTTCTTCAGT 174 0 

AGTACCTTTT TCCAATGCTT TCCAAGCTAA 18 00 

AGATACACCT TGCAATGCTT CAATATCTCC 1860 

AAGCATCATT TTCGTAAATT CTTGGCTCAT 192 0 

AACAGCTTGT GTCATCATCG ATGCACTTGC 198 0 

CTTAGCATCT TTTATAATGC CGTCTTCTAT 2 04 0 

TGTCGGGTTA TTCATATCTA CTGTCATAGA 2100 

ATTTTTATAA TGATCCATAA TGACAGATCT 2160 

AAGAGAAAAA CTCCTTCGTT TGTTTCAAGG 222 0 

TGTTGTATAT ATAAAAACTC GCTCTAGCTG 22 80 

GTTGCGCACA ATGATGCCCA GCTCTAACCG 234 0 

CATCGTGTGG ATGTACATCT TGTAAATTAA 24 00 

GCGGGCCATA AATTTCAATT CCTTCAATTG 24 6 0 

ATTCTTGTTC ATATTTATGA ATTGCATCAA 252 0 

CTGCAAGCCC AATTGCTTGA GCAATTAATG 258 0 

CAGCCCATGT TGCATCATAC TTACTTACAA 2 64 0 

GTTCCATTTT TTGTAGTAAC TCACGTTTAC 270 0 

GCATTTTATG ACCACTAAAA CTATAAAAAT 2 76 0 

TATGTGGTGC TGctTGCGCC CCATCAACAC 2 82 0 

CTGCAATGGT TTTAACATCA TTAATTGTAC 2B8 0 

CGATCTTTGT TTTATCATTA ATCGTTTGCT 2 94 0 

CTGTCATTGG TATAAATTTC AATGTCGCAT 3 00 0 

CAATATTGGC ATGATGTTCC ATTTCAGTGA 3 06 0 

CATC AC CAT A GCTATGTGCT ACAAGGTTAA 312 0 

TTTCTTCAAA ATACTTCGCA TTAATAAAAC 3180 

CATCAGTTGC CAATGATCCT AATGTATGAA 3 24 0 

TGTAGTAATC TTCTAAAACA TTTAACACTT 3 300 

CAAGATATGC TAAACGTTTG CCATTGACTT 3 36 0 
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20 



25 



<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

TTTACACCAA TCAAAAAATC GAACTGATAT AAATAAGTAC AAAGCTTATC TATCAATCCG 6 0 

ATTTAGTTAT AAAACAAAAA AAGCCACAGT AATGTGGCTT TTTGTTATAT TCAGTATCAA 120 

AATGGTATCA ATAGCCATTT TCGGAAGTCA AGAATGGCTT AACAACGCGG TTTAAAGCTA 180 

TCCAATACTA CCTTCCATTT CGAACTTGAT TAAACGGTTC ATTTCGACCG CGTATTCCAT 240 

15 TGGAAGTTCT TTTGTAAATG GTTCGATGAA TCCCATAACA ATCATTTCTG TCGCTTCTTC 300 

TTCAGAAATA CCACGACTCA TTAGATAGAA TAATTGTTCT TCAGAAACTT TTGAAACCTT 3 60 

GGCTTCATGT TCTAATGATA TTTGATCGTT GAATACTTCG TTATATGGAA TTGTATCTGA 4 20 

TGTTGATTCG TTATCTAAGA TTAATGTATC ACATTCAATA TTTGAACGAG CACCTTTTGC 4 80 

TTTACGTCCA AAATGAACAA TACCGCGATA AATAACTTTA CCACCATTTT TAGAAATAGA 540 

TTTAGAAACA ATTGTAGAAG ATGTATTAGG TGCTTTATGA ATCATTTTAG CACCGGCATC 600 

TTGAACTTGT CCTTTACCAG CAAATGCAAT AGATAATGTA CTACCTTTTG CACCTTCACC 660 

TAAAAGAACA CAGTTTGGAT ATTTCATCGT TAACTTAGAA CCTAAGTTAC CATCTACCCA 72 0 

TTCCATATTT CCGTTTTCAT AAACAAAAGT ACGTTTTGTA ACTAAATTGT ATACATTGTT 7 80 

CGCCCAGTTT TGAATCGTAG TATAACGAAC GTGCGCATCT TTATGCACAA TGATTTCCAC 84 0 

AACAGCAGAG TGTAAAGAAC TAGTTGTATA AACTGGTGCA GTACAACCTT CTACGTAATG 9 00 

TACAGAAGCA CCTTCATCAG CAATGATTAA TGTACGTTCA AATTGACCCA TGTTCTCAGA 96 0 

GTTAATACGG AAATAAGCTT GTAGTGGCGT ATCTAGTTTG ATATTTTTAG GTACATAAAT 102 0 

GAAGGAACCA CCTGACCATA CTGCTGAGTT TAACGCCGCA AATTTGTTAT CTGCTGCAGG 1080 

TACTACAGAA GCAAAGTATT TTTTGAATAA TTCTTCATTT TCTTGTAAAG CACTATCTGT 114 0 

ATCTTTAAAG ATAATACCTT TTTCTTCAAG TTCTTTTTCC ATATTATGGT AAACAACTTC 12 00 

AGATTCATAT TGAGCAGAAA CACCAGCTAA ATATTTTTGT TCAGCTTCAG GAATTCCTAA 12 60 

TTTATCGAAA GTTCTTTTAA TTTCTTCTGG CACTTCATCC CATGAACGTT CAGCTTGTTC 132 0 

TGAAGGCTTT ACATAGTAAG TAATGTCATC GAAATTCAAT TCTGATAAGT CGCCACCCCA 13 8 0 

TTGAGGCATT GGCATTTTAT AAAACAATTT TAATGATTTA AGACGGAAAT CTAACATCCA 144 0 

TTCCGGCTCA TTTTTCATGT TAGAAATTTC TCTAACGATA TTCTCAGTTA AACCACGTTC 1500 

TGATCTGAAA ATGGACACAT CATCGTCGTG GAATC CAT AT TTATAATCCC CAACATCAGG 1560 
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Ci) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1320 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

10 



15 



20 



25 
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50 



TGGTCGTCAA 


TTTCTTGATT 


ATATCTATAA 


TCCTCATTTT 


CAATATTAGA 


GTCTGTAGAA 


60 


TCATCGATAT 


TATTATCATT 


CGCATGACTA 


GAAGCAGAAT 


CATTATTTTT 


ATCATTGCTT 


120 


TCTTCTTTTT 


TGAAGTCTTT 


ATTTATCAAG 


TAAATTTCTT 


CATCAAAATC 


AGCTTGTTGA 


180 


GATGTATCAT 


CTTTATTTTG 


ATTAGAAAAA 


TGTGTAGCCT 


TTGATCTTTT 


TCTTTGCCGT 


240 


CTTTTCTTAG 


ATGTATTCCT 


CGTAAATAAT 


TCTAATTCAT 


CTTTATCTTC 


ATTTGATTCT 


300 


TGTTGATCGT 


TCTTCGTTTT 


ATCATCCATC 


AATACTCACA 


CCCTTTAATA 


AGATGGTAAA 


360 


TGGGCACGGA 


ATCTTTCAAT 


AAATTTCTCT 


CCACGCTCTT 


CAAAAGTACT 


ATATTGATCC 


420 


CAACTCGCAC 


AAGCAGGTGA 


CAATAATACA 


ACATCATTTG 


GTTCTATAAT 


ATCTTGTACT 


480 


TTATCAACAG 


CGTCTTCGAC 


ATTGTTCGCT 


TCAATGACCG 


ATTTCCCTTG 


ACTATTACCT 


540 


AGTTTAGCAA 


ACTTAGCTTT 


CGTTTGTCCG 


AATACAACCA 


TCGCGCGAAC 


ATTTTCCATA 


600 


TAAGGAATGA 


GTTCGTCAAA 


TTCATTCCCT 


CGATCCAAAC 


CACCACATAA 


CCAAATGATT 


660 


GGTTGATTAA 


ATGAATTTAA 


GGCAAACTGT 


GTTGCTAGCG 


TGTTTGTTGC 


TTTGGAATCA 


720 


TTATAATATT 


TATTAGTTCT 


ATTAGTACCA 


ACATATTGCA 


ATCTATGCTC 


TATTCCTGAA 


780 


AATGTAGTTA 


AACTATCAAT 


AATTGCtTTA 


ATAGGTACAC 


CAGCanAATA 


CAAGCAAGCA 


840 


CAGCTGCTAA 


TATATTTcTA 


AATTATGTTC 


ACCAGGCAAT 


ACTAGAtCTT 


CAGTGTTAAT 


900 


AATa CBAACA 


CCTTTATaAA 


CGATAAAACC 


ATCTTtAATA 


TAAaTACCAT 


CArCTtCTTG 


960 


TTGAGTTGAG 


AAATACAATG 


TCTTAGCTTT 


TAATTCTTCC 


GACTCTATCA 


CTTGTCTTTG 


1020 


ATGATAATTA 


CAAATCAAAT 


AATCCTCTTC 


CGTTTGATTT 


TTATATATTT 


GCTTTTTAGC 


1080 


ATTTTGATAG 


TTTTCTAAAT 


TTTCATGGTA 


ATCTAGATGC 


GCCGAATAAA 


TGTTAGTAAT 


114C 


TATAGCAATG 


TGTGGTTTAT 


ACTTTTCGAT 


TCCAAGTAAC 


TGGAATGACG 


ACAACTCTGT 


120C 


AACTAAATAA 


TCTGTAGGCT 


TTACTTCTTG 


TGCTACTTTA 


GATGCAACAT 


AACCAATATT 


1260 


GCCGG AT AAT 


CTTCCAGTTA 


AGCGACTTTT 


TTTAAACATA 


TCTCCAATTA 


GAGAAGTAAC 


1320 


(2) INFORMATION FOR SEQ ID NO: 81: 
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TAACAAATAA AGGTGCGTTA TTAATAACAG TGCCAGGCAA AAATGATGAA GTACAACGCT 18 0 

GTATTACTGC TCATGTTGAT ACTTTAGGTG CaATGGTTAA AGAAATTAAA GAAGATGGTC 24 0 

GCTTaGCAAT AGAATTAATT GGAGGATTCA CGTATAACGC GATTGAGGGT GAATATTGCC 3 00 

AAATTAAAAC TGATGCTGGT CAAATATATA CAGGAACAAT TTGTCTGCAT GAAACAAGTG 360 

TTCATGTATA TAGAAATAAT CATGAAATAC CTAGAGATCA AAAGCATATG GAAATAAGAA 4 20 

TTGATGAAGT AACTACATCA GAAGAAGATA CAAAGAGTTT AGGTATTTCA GTAGGTGATT 480 

TTGTTAGCTT TGATCCACGT ACAGTTATCA CGTCATCAGG TTTTATTAAA TCTCGTCATT 540 

TAGATGATAA AGCTAGCGTA CGgTtGATAC TACAATTACT AAAGAAATTA AAAGAAGAGC 600 

AAATAATATT ACCACATACA ACGCAATTTT ATATTTCTAA TAACGAAGAA ATAGGTTACG 660 

GTGCAAATGC ATCAATTGAT TCGAAAATCA AAGAATATAT TGCATTAGAT ATGGGCGCGT 720 

20 TGGGAGACGG TCAAGCATCG GATGAATATA CAGTTTCTAT TTGTGCCAAA GATGCTTCAG 780 

GTCCATATCA TAAGCAATTG AAATCGCACC TAGTTAATCT TTGCAAAATA AATAACATTC 840 

CATATAAAGT AGACATATAT CCATATTATG GTTCAGATGC TTCAGCAGCT TTACATGCTG 900 

25 

GTGCGGATAT CAGACATGGT TTATTTGGCG CTGGCATTGA ATCATCTCAT GCAATGGAAC 960 

GAACACATAT TGATTCTATT AAAGCGACAG AGAAATTACT ATATGCATAT TGCTTATCAC 1020 

CAATTGAGTA AACAATTAGT GTTGACAAAT GTGaACGACC TATGTAATAT AATGAACTAT 1080 

AAAAATAATT AGAATTTTCT AAAGAAATAG TAGCAGATAT GAAACGTAGC AAATAGAAAG 114 0 

CTAATGGGTG ATGGGAATTA GCACGCCATA TCTTGTGAAT TGGACTTTGG AAAACAATTG 12 0 0 

AATGAGTTTT GAAAGTGAAC ATGAATTATG TTAACTAAGG TGGCACCACG GTAACGCGTC 1260 

CTTACAGGTA TATGCGTTAT GTGGTGTCTT TTTATTTAGA CAAAATGTAG TAGTTAATTA 1320 

AAGGTAGCAA CAGAAAGTTA GTGGATGATG TGAACTAACA CCGAGATTAA TGAAATTGGG 13 80 

40 TTTTGTCTGC AACAGAAAAA TTATATATAG TAAAGAGTGA ACTATGAATA TTTCGAATAT 144 0 

TCGGTTAATT TAGGTGGTAC CACGCGTCAc nTCCTTTATA TTGATAAGGA TGCTGGCGCT 1500 

TTTTTGAAAG GAGCGTATAG AATGGATATA TTTTATAAAA AAATAAAAGC AAATGTAACG 1560 

CCCGAAGTTT TAGCACAACT TCATTCCAAG AAGaTCATTT TGGAAAGTAC AAATCAACAA 162 0 

CAAACTAAAG GTCGCTATTC AGTTGTTATT TTTGATATTT ATGGCACTTT AACTTTAGAT 16 80 

AATGATGTAT TATCAGTAAG TACTTTAAAA GAATCGTATC AAATCACTGA AAGACCGTAC 174 0 

CATTATTTAA CGACTAAnAT AAATGAAGAC TACCATAATA TTCCAAGATG AGG CAACTTA 1800 

AGTCATTA 1808 
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35 



CCAAGTGCTG 


ATGCTGAGCT 


TAATGAAATC 


CAGATAATCA 


TAATTGGTGA 


AATGACCATC 


3180 


ATCATGTAAC 


CCATTTGACG 


TTGTTCGTCT 


GGCATCGTTT 


TACTTGATAC 


ATATGCTTGG 


3240 


ATAAAGTATA 


AAACACCGGC 


AATAATTGTA 


ATCCAAATAT 


CAGGACGTCC 


TAAATCGAAC 


3300 


CATAAGAAGT 


GTGGATATTT 


AAACAAACCA 


TCTACAAGTT 


GGTCTTTAAG 


TACAAAGTAT 


3360 


AATCCCATGA 


TGATTGGTAA 


TTGGATTAGC 


ATTGGTAAAC 


AACCCAACAT 


ACTCTTAATC 


3420 


GGGTTCATGT 


catacttttt 


ATATACTTGC 


ATTAATTCTT 


GGTTTGCAGC 


CATTTTTTCT 


34B0 


TCTTGTGTAC 


GCGnCaCGTT 


CACTTTTTCT 


TGAATTTTTT 


CAACTTCTGG 


CTTTGCAACT 


3540 


TTCA'i"I"i"i"i"r 


GACGCATCAT 


ATGACTATTT 


TTATAGTTTG 


ACAACATGAA 


TGGTAATAAA 


3600 


ATAATACGAA 


TTACCAATAC 


AAGGATAATA 


ATAGCTAAAC 


CATAATTGTC 


GTTTAATAAG 


3660 


TTATTTCCCA 


ACCAATCCAA 


TACATTTTTC 


ATTGGATCTA 


CGAATGTATT GTAGAAAAAy 


3720 


rwCr. ACGTTT 


TTCAGGTTTA 


GAATAGTCAC 


AACCAGCCAA 


AAAGACCATA 


ATACCTAAAA 


3780 


ATAATGGTAG 


TAACGCTTTT 


TTCTTCATTT 


TTCCACCTCT 


ATCATTATAT 


TCACATAGGA 


3840 


TTTATTCTAT 


CACATTAATG 


AGTACGTATG 


AAACAATAAG 


TGGAAAAATT 


TAACTAATTA 


3900 


TTAAAAAAAT 


CTTTGAATCG 


ATTAACAGTC 


TTTTCAATAT 


TTTCACTTTT 


AGAAATGGCT 


3960 


GAAATGACTG 


AAATTCCATT 


GGCACCTGCT 


TCTACAATCG 


GCGCCACATT 


ATTAGTATTG 


4020 


ATACCGCCAA 


TAGCTACAAT 


CGGTAGTTGC 


GGATTCATTT 


CTTTAAACGT 


TGCAATCATT 


4080 


TCTGGACCTA 


CTGGTATATG 


CGCGTCATGC 


TTCGACGGCG 


TAGGATAGAT 


TGGTCCAACA 


4140 


CCTATATAAT 


CmACATGAGT 


TAAATCAGAT 


TTTGCATACT 


CATCTAAATC 


AGTAATACTA 


4200 


AGTCCAATAA 


TTTTATCAGT 


GAAATATTGT 


GCTATCTCTT 


TGACTTTCGC 


ATCATCTTGA 


4260 


CCGACATGTA 


TACCATCCGC 


GTTAATTTCT 


TTTGCCAAGG 


ATACATCATC 


ATTAACGATA 


4320 


AAAGGCACAT 


CATATTGATG 


ACAGAGATGC 


TGTAATTCTT 


TAGCTAATAC 


AAGTTTATCG 


4380 


TTTCtTTTTA AAGCTGATTC 


ACC 








4403 


(2) INFORMATION FOR SEQ ID NO: 79 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1808 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



50 



(xi } SEQUENCE DESCRIPTION: SEQ ID NO: 79 
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AAATTAAAAC AAG CTTTT AT AGATGATTTA GCAGTATCTT CTGAATTAAC AAAAGCACGT 13 8 0 

TATGCTAAGC GAAGTCTTTG GATTAAATTT AAAGAAGGTA TTTCACAATT ATTGTCACCT 144 0 

ATCTTATAAA ATAGAAATAT GAGGAGTGTA aCTTTAATGC AACAATCAGA CGTCATTAGT 1500 

GCTGCCAAAA AATATATGGA ATCTATTCAT CAAAATGATT ATACAGGCCA TGATATTGCG 1560 

CATGTATATC GTGTCACTGC TTTAGCTAAA TCAATCGCTG AAAATGAAGG TGTTAATGAT 1620 

ACTTTAGTCA TTGAACTCGC ATGTTTGCTT CATGATACCG TTGACGAAAA AGTTGTAGAT 168 0 

GCTAACAAAC AATATGTTGA ATTGAAGTCA TTTTTATCTT CTTTATCACT ATCAACCGAA 174 0 

GATCAAGAGC ACATTTTATT TATTATTAAT AATATGAGCT ATCGCAATGG CAAAAATGAT 1800 

CATGTCACTT TATCTTTAGA AGGTCAAATT GTCAGGGATG CAGATCGTCT TGATGCTATA 186 0 

GGCGCTATAG GTGTTGCACG AACATTTCAA TTTGCAGGAC ACTTTGGTGA ACCTATGTGG 192 0 

20 ACAGAACATA TGTCACTAGA TAAGATTAAT GATGATTTAG TTGAACAGTT GCCACCATCT 198 0 

GCAATTAAAC ATTTCTTTGA AAAATTACTT AAGTTAGAAT CTTTAATGCA TACAGATACG 2 040 

GCGAAGATGA TTGCTAAAGA ACGTCACGAC TTTATGATGA TGTACTTGAA ACAGTTTTTT 210 0 

ACGGAATGGA ATTGTCACGA CTAGACATTG AAGTTGTAGT ATGATGATGC GATGTAATGG 216 0 

CGTGTTGTTG TGGAAGCTTG GTGTCATGCC ATGTTACTTT GATGTGTTGT TGTGGGAGCT 2220 

TGGTGACATG TCATGCTACT TTGATGTGCT GGTACCACGA TGCGTCTTGA TGTAGTGCTA 2280 

TGATGTGGCA TTGCGGTGTT ATGGTGTTAT AGACAGGTTT GGCGTTGATG CCATGTTACT 234 0 

TTGATGTGCT GGTACCACGA TGCGACTTGA TGTAGTGCTA TGATGTGGCA TTGCGGTGTT 24 00 

ATGGTGTTAT AG AC CGGTTT GATGTTGATG CCATGTTACT TTGATGTGCT GGTGCTACGA 2460 

TGCGACTTGA TGTAGTGCTA TGATGTGGCG TTGCGCTGTT ATGGTGTTAT AGC CAGGTTT 2 520 

GGTGJTGATG TCATGCCGTT ACGATTCTAT GATATGTTGT TGGGACGTTG CAATGTGTAT 2580 

TATGCCGTTG TGACGTTATT ATTTCACACT GTTACATGTA TAAGTGAATT GCTGTGGAAA 264 0 

TTTGCGACAT ATACTGCTAC ACTGATGAAT CATTGTGTCA AGATGACATT GCGATGAAGA 2700 

ATGACAACTC TGTTATTAAC CACTTTTTAC ATACTGAAAA CTCGTTAATA TTATTTCAAA 2760 

45 TAAAAACAGC AGTAGGATGA CTTTCACATT TGAAATCATC TTACTGCTGT TTCTATTTAT 2 820 

CACATATTGT ATAATGTGAC ACTAAGTTTC GCTATTGAAG CGAAAAATAA TGTGCGCCCT 2 8 80 

ATAAAGTTAA AATTATCTTC AACTTTTAGG GTGCACATTA TTTGGACTTG CTAAGGTTAT 2 94 0 

TTCTTTTTCT TTTTAGACAC AACTTGTGTG TTTTTGCCTT TTTTATTGct GCCGCCGTTG 3 000 

TGCTCTCTTT CATACGCTTC AATGAAAGGT TGTACTTCTT TTTTAGCGAC TTTTTCATAA 3 060 
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TAAGTGCAGC AGGAAATGCT AAAGTTTTGA ACTTTGCTTC TAATATAGGT GCGCTTGTAT 13 30 

TATTTATGGT ATTAGGACAA GTAGATTATG TAATAGGTTT AATTATGGCT A 14 31 

5 

(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4403 base pairs 

(B) TYPE: nucleic acid 

10 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



15 (xi) SEQUENCE DESCRIPTION: \ 

AATATTATTT TAAATTCAAT ATTTATTGGT 
ACCATTATTT TCATGGAAAG ACGTTCTGCC 

20 GTTTTCITCO C" '** '* ^ 'T^T'f^vi/^ ^ * i ** i a 't** r* i t± 

GACCAAATTT TCAAAATTGA TAAGGAAGAT 
CAATTAGCTG CTTTAAAAAA TGAAAACTTT 

25 

AAAGAAATGA TTCAAATGTT GTTATATAAT 
TTArrrrtAT ACACAGACGG CCAAGAAAAA 
GCTACTGATT ATATTCATTT TCAGTACTAT 

30 

ATTTTAAATG AACTTGGTAA AAAAGCGGAA 
GACATGGGTT CTCGTGGACT GCGTAAAAAA 

35 CATGCTGAAG CATTTTTCCC ATCAAAATTA 

AACCATCGAA AAATTGTTGT AATAGATGGG 
GGTGATGAGT ACTTAGGTAA ATCAAAAAAA 

JO ATTGTCGGGG ATGCAGTGAA TGCATTGCAA 

GCCACACGTG ACCACATCTC CTATGATGAT 
ACAATTGGCG TTCAAATAGC TTCTAGTGGT 

45 

GGCTATTTGA AAATGATTTC ATCTGCTAAA 
ATACCTGATC AAGCCTTTTT AGATTCTATT 
AATATCATGA TTCCTAATAA ACCTGACCAT 

50 

GCAGCATCCT TATTAGATGC CGGTGTTAAA 



SEQ ID NO: 78: 

GCATTTATTT TAAACTTATT ATTCGCCTTT 6 0 

AATTCTATCT GGGCTTGGTT ACTAGTCTTA 12 0 

TACTTACTAT TAGGACGACA AATTCAACGT 180 

AAAAAAGGAT TAGAGTTAAT CGTTGATGAG 24 0 

TCAAATTCCA ATTATCAAAT TGTAAAATTT 3 00 

AACGCAGCAT TTTTAACAAC AGACAACGAT 3 60 

TTTGATGACC TAATACAAGA CATCCGTAAT 42 0 

ATTATTCAAA ATGATGAATT AGGTCGTACC 4 80 

CAAGGTGTAG AAGTTAAAAT TCTTTATGAT 54 0 

GGCTTACGCC CGTTTCGCAA TAAAGGTGGA 6 00 

CCTTTAATTA ACTTGCGTAT GAACAATCGA 66 0 

CAAATTGGAT ATGTTGGTGG TTTTAATGTT 72 0 

TTCGGCTATT GGCGAGATAC GCATTTACGA 78 0 

TTACGATTTA TTCTAGATTG GAATTCACAA 84 0 

CGTTATTTCC CAGATGTAAA TTCTGGTGGA 900 

CCTGACGAAG AATGGGAACA GATTAAATAC 96 0 

AAATCGATTT ATATTCAATC TCCCTATTTC 10 2 0 

AAAATTGCGG CATTAGGTGG TGTTGATGTC 10 8 0 

CCGTTTGTTT TTTGGGCTAC TTTAAAAAAT 114 0 

GTATTTCACT ACGACAATGG CTTTTTACAC 12 0 0 
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GATGGATTAA TCGATAAACA TATTATCGAA GCAGATGCGA AAAAAGATAT CCGTATGGAT 1260 
GAAATAATGA CATTTATCM TAGTGATTAT ATTCGATATT GCTGAAGC 1308 
5 (2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1431 base pairs 

(B) type: nucleic acid 

10 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



15 (xi) SEQUENCE DESCRIPTION: i 

GATGCCATTn ATnnGTATGC AAGAAGTTGT 
TGGTGGTAAG ACGTTACTAA TGCTTTCAGG 

20 AGTGATGAGA CGTGGCGTAA CAATTGAAGC 

TGATCAAGCA AAAGAAAAAG TTATTGAATT 
AATTAAATTG CATATTGTAC CATTTACAGA 

25 TCCAAGATAT ACAATGACTT CAACGAGACG 

ACATCAAATA GGGGCTTTAG CTATTGTAAA 
AACACTTCAT AGCATGTATG CAATTAATAA 

30 

ATTAACTTAC GATAAAGAAG AAATTATTAT 
ATCTATTCAA CCATTTGAAG ATTGTTGTAC 
ACCAAACTTT GATAAGGTAG TCCAATATGA 

35 

TCGTGCTGTT GAAAATATTG AAACACTTGA 
ACAG£AAACA AACCAATTAA TAAACGACTT 

4 o ACATAAGGGG ATGTTAAACT ATGGATTTGA 
TTGGTTTTAT CGCGGCG TTT ATAGATTCGG 
CAGCATTATT AGCAATCGGT CTACCACCAT 

45 GTTCGTTTGG TTCTTTAACT AGTACGATAA 
ATGTTGTTGC CAAATTATTT GGTTTTGTAT 
CAACGATGGT TCCGTCACAA ATATTGAAAC 

50 

TTATATTCAC ATTACTTAAA AAAGATTGGG 
TTAAGAAAGC CATAATATTT GCAGCACTTT 



5EQ ID NO: 77: 

TCCGGGTTCA GGTGGATTaC CAGTTGGTAC 60 

CGGTATAGAC TCACCAGTTG CTGGGATGGA 120 

GATTCATTTC CATAGTCCAC CATTTACAAG 180 

GACACGTATT TTAGCTGAAC GTGTTGGACC 24 0 

ATTGCAAAAA CAGGTAAATA AAGTTGTACA 3 00 

TATGATGATG CGTGTTGCTG ATAAATTAGT 3 60 

TGGTGAAAAC CTAGGGCAGG TAGCCAGTCA 420 

TGTAACTTCT ACTCCTGTAT TACGTCCTTT 4 80 

TAAATCGAAA GAAATTGGTA CATTTGAAAC 540 

AATTTTCACC CCTAAAAATC CAGTAACCGA 600 

AAGTGTCTTT GATTTTGAAG AGATGATTAA 660 

AATAACTAGT GATTATAAAA CTATTAAAGA 720 

TTTATAAATA AAATCCTAGA GTAAATTTAA 780 

ACTTAACGAT GATTATAATC ATAATTTTAT 840 

TTGTAGGGGG TGGCGGTTTA ATTTCTACGC 900 

CTGTGGCTTT AGGTACAAAT AAATTGGCAA 960 

AGTTTATAAG GTCCGGTAAA GTGGACTTAT 1020 

TTTTGGCATC TGCATGTGGC GCATATATTG 1080 

CTTTAATCAT CATTGCACTT TCGTCGGTGT 114 0 

GCAATACACG CACGTTTACT CAATTTACAT 1200 

TTATATTAAT CGGCTTTTAT GATGGATTTG 12 60 
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TTTAGaAATT AGTCATGAmC AAATTAAAGC AGCATTTGAC ACATTAGATG AAAAAACAAA 114 0 

ACAAGCATTA CAACAAAGTT ATGAAAGAAT TAnAGCATAT CAaGAAaGTA TtaAACAGaC 12 0 0 

GaATCAACAG TTAGAAGaAT CAGTGGaGTG tTrTGaAATA TACCATCCmC taGaAAGTGT 126 0 



CGGTATTTAT GTG X2 7 3 
(2) INFORMATION FOR SEQ ID NO: 76: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1308 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

20 GTTGATAAAT TAAAAATGTT TTTATCAGAT ATTCAAAGTT ACCAACAATA TAGTAAAGAT 6 0 

CATCCGGTGT ATCAGTTAAT TGATAAATTT TATAATGATC ATTATGTTAT TCAATACTTT 12 0 

AGTGGACTTA TTGGTGGACG TGGACGACGT GCAAATCTTT ATGGTTTATT TAATAAAGCT 180 

25 ATCGAGTTTG AGAATTCAAG TTTTAGAGGT TTATATCAAT TTATTCGTTT TATCGATGAA 24 0 

TTGATTGAAA GAGGCAAAGA TTTTGGTGAG GAAAATGTAG TTGGTCCAAA CGATAATGTC 3 00 

GTTAGAATGA TGACAATTCA TAGTAGTAAA GGTCTAGAGT TTCCATTTGT CATTTATTCT 3 60 

30 

GGATTGTCAA AAGATTTTAA TAAACGTGAT TTGAAACAAC CAGTTATTTT AAATCAGCAA 42 0 

TTTGGTCTCG GAATGGATTA TTTTGATGTG GATAAAGAAA TGGCATTTCC ATCTTTAGCT 4 80 

TCGGTTGCAT ATAGAGCTGT TGCCGArAAA GAACTTGTGT CAGAAGAAAT GCGATTAGTC 54 0 

35 

TATGTAGCAT TAACAAGAGC GAAAGAACAA CTTTATTTAA TTGGTAGAGT GAAAAATGAT 600 

AAATCATTAC TAGAACTAGA GCAATTGTCT ATTTCTGGTG AGCACATTGC TGTCAATGAA 660 

CGATTAACTT CACCAAATCC GTTCCATCTT ATTTATAGTA TTTTATCTAA ACATCAATCT 72 0 

40 

GCGTCAATTC CAGATGATTT AAAATTTGAA AAAGATATAG CACAAATTGA AGATAGTAGT 78 0 

CGTCCGAATG TAAATATTTC AATTGTGTAC TTTGAAGATG TGTCTACAGA AACCATTTTA 84 0 

45 GATAATGATG AATATCGTTC GGTTAATCAA TTAGAAACTA TGCAAAATGG TAATGAAGAT 90 0 

GTTAAAGCAC AAATTAAACA CCAACTTGAT TATCGATATC CATATGTAAA TG AT ACT AAA 96 0 

AAGCCCTCAA AACAATCTGT TTCTGAATTG AAAAGACAAT ATGAAACAGA AGAAAGTGGC 102 0 

*° ACAAGTTACG AACGAGTAAG GCAATATCGT ATCGGTTTTT CAACGTATGA ACGACCTAAA 1030 
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w 



TTCTACTGTT CTTTGTGAAA AACCACGGTA TTCAATGCCA TCATACATTC CACCAAGCAC 228 0 

ACGTGCAGTA TCTTTAGTTG TTTCTTTTTT ACCCATTTGT GATCCAGTTG GGCCTAAATA 234 0 

AGTTACATTT GCACCTTGAT CATGCGCTGC AACTTCAAAT GCACATCGCG TTCTTGTAGA 24 0 0 

ATCTTTTTCA AATAACAGTG CAATATTTTT ATTTTTTAAC ATAGGCTTTT CAGTGCCAAT 246 0 

ATATTTAGCA CGTTTTAAAT CCTCGGAGAG TGTTAATAAG GTTCTACCTC TTGTCGTGAA 252 0 

AAGTCTAATA AAGTTAAAAA ACTTCTGTTT CGTAnATTTT TCATTAAnA 256 9 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 
'5 (A) LENGTH: 1273 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
CD) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

CCTGGAACCA TCCaATCGtG CaAATCtTGa AAGaGAATAC GCAACAACAA TTAAATGTAT 6 0 

25 TGGAACACTA TATTCCAAAT GACCATCCAG CACTCGTTGA ATTAAAAATA TGGGAACGTT 120 

GGTTACATAA ACAAGGTTAC AAAGACATCC ATTTAGATAT TACTGCGCAC CACCTAGATC 18 0 

CTATTACACA GGTTTATTTA TTCAATGTCA TTTTGCTGAA AATGAATCTC GAGTTTTAAC 24 0 

AGGTGGTTAT TACAAAGGAA GCATCGAAGG GTTTGGATTA GGATTAACAC TTTAAGTAAG 3 00 

GGAGTATGCA CAATGTTAAG AATCGCCATA GCCAAAGGAC GTCTAATGGA TAGTTTAATT 360 

AACTATTTAG ATGTAATTGA ATATACGACA TTATCAGAAA CATTAAAAAA TAGAGAACGC 42 0 

CAATTATTAT TAAGTGTAGA TAATATTGAA TGCATTTTAG TAAAAGGAAG TGACGTGCCA 4 80 

ATCTATGTGG AACAAGGAAT GGCAGACATA GGCATTGTTG GTAGCGACAT ATTAGATGAG 54 0 

CGCCAATATA ATGTTAATAA TTTGTTGAAT ATGCCTTTTG GAGCATGTCA TTTTGCGGTT 60 0 

GCAGCGAAAC CTGAAACGAC CAATTATCGT AAAATCGCAA CGAGTTATGT TCATACTGCT 66 0 

GAAACATATT TTAAATCAAA AGGTATTGAT GTCGAATTGA TTAAATTGAA TGGCTCTGTT 72 0 

GAATTGGCCT GTGTTGTAGA TATGGTAGAC GGAATTGTCG ACATCGTTCA AACAGGTACT 780 

ACGCTAAAAG CGAACGGACT GGTTGAAAAG CAACATATTA GTGATATCAA TGCAAGATTA 84 0 

ATAACTAATA AAGCAGCTTA TTTTAAAAAA TCACAATTAA TAGAGCAATT TATTCGCTCT 90 0 

TTGGAGGTGT CTATTGCCAA TGCTTAATGC ACAACAATTT TTAAATCAAT TTTCATTAGA 96 0 

AGCACCATTA GATGAGTCAT TGTATCCaAT TATTCGCGAT ATTTGTCAGG AAGTTAAAGT 102 0 



30 



35 



40 



45 



50 
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TAATTGTATT TTCCACGGTT TCATCTCCTT CGACATTTAA CCTAGCATTT CTACCTTAAA 4 80 

GATTTTATAA ATATAAATTA AGAAAGTGCA CCCCGCATCA AAATAGAGGC ATTATTTTCA 54 0 

5 

GGGGGTGCAC ATAAATAATA AAAATCATGC ATTTGACATA TAGTAATTGA AAAGCGTTTC 600 

AATTCAATTA CTTTTTAATC ACAGTACCTA CTTTACCCTC TAAGGCAGCA TCTAATTCAT 660 

TTAATGATGT TATAAGCACA CTTCCTTTTG GATTGTTTTC AATAAATGAT ATGGCTGCTT 72 0 

W 

CAATTTTTGG TAACATACTT CCTTTTGCAA ATTGATTTTC GTCTATATAT CGTTTTAATT 7 80 

CATCAACATT TGTTGTTTTC AAAGGCTGTT GGTTTTCAGT GTTAAAATTA ATATATACAT 84 0 

/5 AATCAATTGC TGTTAAAATA ATCAATTGAT CGCATTGAAT ATTAGCACCC AACAACGCAC 900 

TTGTTTTATC TTTGTCTATA ACTGCATCAA TACCTTTAAA ACCATCATGT TGCTCTCTAA 96 0 

TTACTGGTAT ACCTCCACCA CCAGCAGCAA TAACGAGTGT ATCATTTTTA ATAAGTGTTT 1020 

20 TAATACTCTC TAATTCAATA ATAGAGATGG GTTGTGGTGA AGGAACAACG CGTCTATATC 10 80 

CTCTTCCAGC ATCTTCAACA AATATAAATC CTTTTTCTTT TTGAATTTGT TCAGCTTCTT 114 0 

CTTTGTTGTA AAATAACCCA ATTGGTTTTG AAGGATTGTT AAATGCCGGA TCATTTTCAT 1200 

25 CAACTTCAAC TTGTGTCACT AGTGTTACCA CTTGTTTATC CATTCCAATA GAATGCAATT 126 0 

CATTTTGTAA GCTTTCTTGT AATTGATAGC CGATGTAAGC TTGACTCATT GCGCCACATT 132 0 

CAGCAAATGG AAATGCCGGA CCTTGGTTAT GTTCTGCAGC ATAGTTAAGT CCCAAATTAA 1380 

30 

TGCTTCCAAC CTGTGGTCCA TTACCATGAC TAATAACAAT CTCATGTCCT TTTGTnATTA 14 4 0 

AyCCTACTAA TGATTtCGCA GTATTTTTAA CAAGCTCGAG TtGgTyCTTG aGGTGATTTn 1500 

CCTAAAGCAT TACCACCTAA TGCTACTACT ATTTTCGCCA TCATATTCAC TTCCTTATAT 156 0 

35 

CATTTAAAAT TCACCCAATG TAGCAACCAT GaCTGCTTTG ATTGTATGCA TTCTGTTCTC 162 0 

AGCTTCTTGG AATACAACTG AAGCTTTACT TTCGAATACT TCATCTGTAA CTTCCATTTC 16 8 0 

40 TCGAATACCA TATTTTTCAA AAATTTGTTG ACCTATTTTC GTATCAGCAT TATGGAAAGA 174 0 

TGGTAAGCAA TGCTCAAAAA TAACATTTGG ATTACCAGTT TTATCCATTA TTTCTTTATT 1800 

TACTTGATAT GGTTTCAATA ATTCAAGTCG TTCTTTCCAT ACTTCATCAG GTTCACCCAT 186 0 

45 TGATACCCAA ACATCAGTGT AAATTACATC CGAACCTTTT ACaCCTTGGT CaATATCATC 192 0 

TGTGATTAAT ATGTTGCCaC CATTTTCaGC GGCAATATTT TTACAGCGAT TTAATAATTC 198 0 

ATCTGTTGGA TTTAATTCTT TTGGACAAAC TAAATGGAAG TT CAT AC CCA TAATGGCAGC 2 04 0 

50 

ACCTTGCATT AATGCATTTG CAACGTTATT ACGACCATCT CCAACATATG TAAAGTTAAT 2100 
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GTTATTAAAG GTGTGAGATG ATGACTGAAA AACAATTTAA ATTAACTGTA CAAGATAATA 4 500 

CGAATATTGA AGTTAAAGTG AATTTTACAG ATGTAGATTC AAAAGGAATT ATTCATATAT 4 560 

5 TTCATGGTAT GGCTGAACAT ATGGAACGTT ACGATAAATT AGCACATGCA CTTTCAAAGC 4 6 20 

ATGGCTTCGA TGTGATACGT CATAATCATC GAGGACATGG TATTAATATT GATGAATCAA 4 6 80 

CAAGAGGGCA TTACGATGAT ATGAAACGAG TTATCGGTGA TGCCTTTGAA GTAGCGCAAA 4 74 0 

W CAGTGAGAGG CAATGTTGAT AAACCATACA TTATAATCGG ACATTCAATG GGATCCGTTA 4 800 

TAG CT AG ATT GTTTGTAGAA ACATATCCGC AATATGTTGA TGGTCTAATT TTAAGTGGTA 4 8 60 

CTGGTATGTA TTCATTATGG AAAGGTTTAC CAACCGTTAA AGTGTTACAA CTGATTACAA 4920 

15 

AAATTTATGG TGCTGAGAAA CGAGTTGAAT GGGTTAACCA GTTAGTATCA AATAGTTTTA 4 980 

ATAAAAnnAT ACGTCCATTA CGTACACAAA GTGATTGGAT TTCTAGTAAT CCAATTGAAG 5040 

20 TAGATAaCTT TATTAAAGAT CCATATAGTG GaTTTAATGT GTCAAATCAA TTATTATATC 5100 

AAACAGCCTA TTATATGCTA CATACATCAC AATTAAAAAA TATGAAAATG TTAAaTCATG 5160 

CCATGCCTAT ATTATTAGTT TCAGGATATG ACGATCCTTT AGGTGATTAT GGTAAAGGGA 5220 

25 TTTTAAAATT GGCGAATATA TATAGAAACG CTGGCATnAA AAATGTTAAA GTGAATCTTT 52 80 

AT CAT CAT AA ACGTCATGAA GTGTTATTTG AAAAnGATCA TGACnAAATT TGGGAAGACT 534 0 

TGTTTAAATG GTTGAATCAA TTTTATAAAA AATAAAGAAA GTGGAATTAA ATATGAATAA 54 00 

Of) 

AAATAAGCCT TTTATTGTAG TAATTGTGGG GCCAACTGCT TGCAG 5445 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 2569 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
r (D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 



TGGCTTGAAC TACGCCAATA AGTCCCCCTA GTACAAGAAT GAATACCATG ATATCGACCG 60 

45 CTTCTATCGT ACCTTCAACC ATGCTACTTG TTATTTGTTC TGGTCCAGCT GGATGTTGCT 120 

TTAATCTTTC ATAAGTATTC GGAATTGATA CCGGCTTATT AATTGCACCT GATTTAAATT 180 

GTTCAATCTT AATTTTAACC CCCATTTTGT CTAGTTCCTG TTGCGTACCC GGAACCTTTT 24 0 

50 TCACTTGGTT ATGAGGGTTA ACTATCTTTA GTTCTTGGGA TGAAGGTTCG TAAGAAAGTT 300 

TAGAATATGC ACCAGCAGGA ATAACCCATG TTGCTATAAC TGCAACAACC GTTAAAATGA 3 60 
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TT A("J T A A TT A 
x x n\j x mm x x M 


1 iGGiviGLGG 


TATTACAGGT 


GCAGGTATTG 


CACTAGACGC 


GAGTGAAAGA 


2760 
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4200 


GCAGATATGC 
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4260 


GCAATTAAAG 
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TGGaAATAAT 


CAACCAGCAG 


TTAAAGAATA 


AtTAATTTGT 


4320 
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CGAATCAATT TACTAAAATA AAAAGAAACG TAAATAGCAT AATTTAACAT GTTTGATTCA 90 0 

TGGATTATGC TATTTTTTCG CCAAAATTTA ACAGATTTTG TACAATGGGT TAGCGATTAT 96 0 

TTTTTAATAA AGGAGATACT ACTAATGGAA AAATATATTT TATCTATAGA CCAAGGAACA 1020 

ACAAGCTCAA GAGCGATTTT ATTCAATCAA AAAGGGGAAA TTGCAGGGGT AGCACAACGT 1080 

GAGTTTAAGC AATATTTTCC ACAATCAGGT TGGGTTGAAC ATGATGCAAA TGAAATTTGG 1140 

ACATCTGTGT TAGCTGTAAT GACGGAAGTA ATTAATGAAA ATGATGTTAG AGCTGATCAA 1200 

ATTGCAGGTA TCGGTATTAC AAACCAACGT GAAACAACGG TTGTTTGGGA CAAaCATACT 12 SO 

GGCCGCCCAA TTTATCACGC AATTGTTTGG CAATCACGTC AAACACAATC AATTTGTTCA 1320 

GAATTAAAAC AACAAGGATA TGAACAAACA TTTAGAGATA AGACAGGATT ACTTTTAGAT 13 30 

CCGTATTTTG CAGGTACAAA AGTTAAATGG ATTCTAGACA ATGTTGAAGG TGCACGAGAA 144 0 

AAAGCAGAAA ATGGCGATCT ATTATTTGGA ACGATTGATA CTTGGTTAGT ATGGAAATTA 1500 

TCaGGaAAAg CtGCGCATAT TACTGATTAT TCaAATGCGA GTCGTACATT AATGTTTAAT 1560 

ATCCATGATT TAGAATGGGA CGATGAGTTA TTAGAACTAt TACAGTACCT AAAAATATGT 1620 

25 TGCCAGAAGT TAAAGCTTCG AGTGAAGTAT ATGGTAAGAC AATTGATTAC CACTTCTATG 1680 

GTCAAGAAGT ACCAATCGCT GGAGTAGCTG GTGATCAACA AGCAGCATTA TTTGGACAAG 1740 

CTTGCTTCGA ACGTGGTGAC GTGAAAAACA CATATGGAAC TGGTGGCTTC ATGTTAATGA 1800 

ATACAGGTGA CAAAGCGGTT AAATCTGAAA GTGGTTTATT AACAACAATT GCTTATGGTA 1860 

TTGATGGAAA AGTAAATTAT GCGCTTGAAG GTTCCATCTT TGTTTCGGGT TCAGCAATCC 1920 

AATGGTTACG TGATGGATTA AGAATGATTA ATTCAGCACC ACAATCAGAA AGTTATGCGA 1980 

CACGAGTTGA CTCTACTGAG GGTGTTTATG TTGTTCCAGC TTTTGTAGGT TTAGGAACAC 204 0 

CATATTGGGA TTCTGAAGCA CGTGGTGCGA TTTTCGGTTT AACACGTGGA ACTGAAAAAG 2100 

AGCACTTTAT CCGTGCAACT TTAGAATCAC TATGTTACCA AACTCGTGAC GTTATGGAAG 2160 

CAATGTCAAA AGACTCTGGT ATTGATGTCC AAAGTTTACG TGTCGATGGT GGTGCAGTTA 2220 

AAAATAACTT TATTATGCAG TTCCAAGCAG ACATTGTTAA TACTTCTGTT GAAAGACCTG 2280 

45 AAATTCAAGA AACTACAGCT TTAGGTGCTG CATTTTTGGC AGGTTTAGCA GTTGGATTCT 2340 

GGGAGAGTAA AG ATG AT AT C GCTAAAAACT GGAAATTAGA AGAAAAATTC GATCCGAAAA 24 00 

TGGATGAAGG CGAAAGAGAA AAATTATATA GAGGTTGGAA AAAAGCTGTT GAAGCAACAC 24 60 

AAGTTTTTAA AACAGAATAA ACTTGTAGAT TAGACTTTTG TATAAACATT GTGATACAAT 2520 

CAATTTAAGT TAATATTTGA ATCGAGAAGC GAGAGATTTG TTCGAACATG TACAATTGAA 2580 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5445 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
25 (D) TOPOLOGY: linear 
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TTTATTTATC GGTGTAAACA AAATTGCCGA TGGTTTAAAT CCTTTAATTG TCGGAGCATT 



TGATTTAGGT CCGAGAATTG CACATGCGAT TTTACCAATA GCTGGTAAAG GTGGTTCAAA 
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GTAGGCCAGG CATGTCATCT TCTCGAATGT ATTCTGAAAG TGTCGTTAAG ATATTGACAT 

AATCATGACG GAACTTGCGC ATTTCGTTGT TGATAGCTTC AATCTTCAAT GTATATTCAT 1080 

AATAGGTTTC AATTTCTTCT TGATTACGTT TATATTTCAT CTCTTTAAGG AGAAATTGAG 114 0 

AAATAACAAA TGTTAATATA CTTAAAAATA TAGTGATACC AATAAAAATA AAAGAATACT 12 0 0 

GCCTTATTAC TTTAGCTTCA TCCGAGTTTA TTTGTGAATA AAAGAAAAAT AATGAAAAAG 126 0 

TAAGCAGTAA GATAGTCGAA ATAACTATTA AAAATCCTTT GTTTAGTATT AGATATGGTG 13 20 

TGCTAATTTT TTTGAGAACT CTATTTATTA TATATGAGAA TAGTATACTA ATAGTCACAT 13 80 

AAACTACAAA AAAGCTAGGG AATATTACAA ATATACTATC AGAAATTTTG GTGGATATAT 144 0 

GCATATATAA CTATATACCT GTAGTTAGCA CnGTnATAGG AATAATCnGG CGAGGTCCAT 15 00 

AATCCACCAA AATAGAATA 1519 
(2) INFORMATION FOR SEQ ID NO: 73: 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

GTAGGAATCT CTTTGTCTTT TTGGGAGGAC ATTTAATATG AATGTATATT TAGCAGAATT 6 0 

CCTAGGAACT GCAATCTTAA TCCTTTTTGG TGGTGGCGTT TGTGCCAATG TCAATTTAAA 12 0 

GAGAAGTGCT GCGAATGGTG CTGATTGGAT TGTCATCACA GCTGGATGGG GATTAGCGGT 180 

TACAATGGGT GTGTTTGCTG TCGGTCAATT CTCAGGTGCA CATTTAAACC CAGCGGTGTC 24 0 

TTTAGCTCTT GCATTAGACG GAAGTTTTGA TTGGTCATTA GTTCCTGGTT ATATTGTTGC 300 

TCAAATGTTA GGTGCAATTG TCGGAGCAAC AATTGTATGG TTAATGTACT TGCCACATTG 360 

GAAAGCGACA GAAGAAGCTG GCGCGAAATT AGGTGTTTTC TCTACAGCAC CGGCTATTAA 42 0 

GAATTACTTT GCCAACTTTT TAAGTGAGAT TATCGGAACA ATGGCATTAA CTTTAGGTAT 4 80 



540 



AATTGTTGCA ATCGGATTAA GTTTAGGCGG TGCTACTGGT TATGCAATCA ACCCAGCACG 6 00 



660 



TTGGTCATAT GCAATCGTTC CTATCTTAGG ACCAATTGCC GGTGGTTTAT TAGGTGCAGT 72 0 
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TTGAAGCAAC ACCTGAGCAA ATCCTAGTTA ATGGTGAACT CATTGTACAT CGTGATGACA 78 0 

TCATTACAGA ACAAGATATT CTTGCACACA TAAACTTAAT TGATCAGCTT TCAGCAGAAG 84 0 

5 TCATCGATAC ACCATCAACT GCAACGATTT CTGATAGCTT AACAGCAAAA GTTGAAGTTA 900 

CATTGCTTGA TGGATCAAAA GTGATTGTTA ATGTTCCTGT AAAAGTTGTA GAAAAAGAAT 96 0 

TGTCAGTAGT CAAACAACAG GCAATTGAaT CAATCGAAAA TGCGGCACAA CAAAAGATTA 1020 

10 

ATGAAATCAA TAATAGTGTG ACATTAACAC TGGAACAAAA AGAAGCTGCA ATTGCGnAAG 1080 

TTAATAAGCT TAAACAACAA GCAATTGGAT CATGTTnAAC AATGGCACCT GGATGTTCCA 114 0 

TTCAGTTGAA GGAAATTTCA ACAACAAGGA ACAAGCGCCn GATTGGAACA ATTTGA 1196 

15 

(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1519 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



^ {xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

CAATCGTTTC AACGCTATTA TCTTTAGACA ACAATTGTAA GCGTGTATGT GCAGTTTCTA 6 0 

AACAGTCTAT AATTCGAGTT CTTAATTCAG CTGGATCATC TTTAAAAATA AAATCCATCG 12 0 

30 

CTGCAACTTT GTAGACAAAT GTTAAATAGG TAAGTTCACT GTGACTCGTA ACGAAAATAA 180 

TGTTACCAAC TGGGTCATGC TTACGAATTT CACTGCCTAA TTTGATACCA TTAATATCAG 24 0 

TTGAAAGTTG AATATCTAAA AAGTAACAGC CTATGTCATT CATATTTTTA GCTTGCTCAA 300 

35 

GCACCTCATA AGGATTATCA GTTGCGAGGG CAATTTCCAT AGGCTTTTCT TCTATCATTA 360 

TATAATTTTT AATAATGGTA ACCATGTTTT CTCTTTGTTT TGGATCGTCT TCGCAAATGA 42 0 

40 AAATTTTCAT ACATTCACAT CCTTATGGCT AGTTGTTAAT AATTTCAACT TTTTGAATAA 480 

AGAAACCATT TTCGATAATT GTATCTAATA AGACATTGTC TGCATTATCA GCAATTTCTT 54 0 

TTAAAGTTGA TAGACCTAAA CCACGACCTT CACCTTTAGT AGAAAAACTT TCTTGGAACA 600 

45 ATTCATGAAT GCGTGGTATA TCATCAGCGC ATTTATTCAT AACAATAAAC GTTACTGAAT 6 60 

TTTCACTTTC AATAAATGCA ACGCGAATGA TAGGGTCATC AATTTCAGTT GATGCCTCAA 720 

TTGCATTATC AAGAATAATA CCAATACTGC GACTTAAATC GAT CAT ATT C AAGTTAATGC 780 

50 

TACTTACTTC ATCGGGTATT TCGATACTAA TCGGAATATT CATTTCTTGT GCACGTAAAA 84 0 

TTTT CGCAGT AATTAAGCCT TTAATTTCAC GTACTTTAAG ATTCTCGATA CCATTTAATT 900 



55 
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w 



ATACTCATTA AACCTAAAAT AATTAAAATA ATACCGAAAT GTGAACTTAA TGCATCATTG 1116 0 

CCTGGGAAAT TTAATGCTTT AAAATCGATT AGAGCCGCAG CAATCGCAAT ACCTACAGAT 112 2 0 

ACCGCCACAT TAATAATTAA ATTATAAAAA CCAATAGCCA CACCTGTCAT ATTAAGATCT 112 8 0 

ATTGTTTTAA TGGCTTCGTT A AGTAAAGGT GCATACATTA AAGCAAAGCT ACCTGCAAAG 113 4 0 

AATATCATAG AAATGACGAA GATTGAAATG TGATTACCTA CTGCAAATGC AGGTAAAATC 114 0 0 

AAGCTCAGTG CTATTAAAAT AATTGCTGTG ATAATCGCTT GTTTTGAATT CAGATATTCG 114 6 0 

CCGATTTTAC CACTTAGTGC ACCAACAATG ACTGCTACTA TATAACCCGG TACTAATAAC 11520 

15 AGTGATGTTG TGTCTAGTTG CAGATGATAA ATTTGCTCCA TTATGAATGG GAACGTAAAA 11580 

ATATAACCCA ATTGGATAGC ATACATTACA AATACTATAA ATAAAAATGA AGCATAACGT 11640 

TTATTTTGGA AAAATGATTT ATTTACTAAT GGACGTTGCG CATTTTTAAT ATATAGCGCA 1170 0 

on „ _ . _ 

AAAACGATAA TCGCAATTAA GGCACCAATC ATATATAACC AATTAAAGTT CGTAATAAAC 11760 

AGCATGACTG TTGTAGCAGG GG ATCCTCTA GAGTCGAnCC TG 11802 
(2) INFORMATION FOR SEQ ID NO: 71: 



25 



30 



35 



40 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1196 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

CTAAAGAAGA TGCGAAACAA GATGTTGATA AACAAGTTCA AGCTTTAATT GACGAAATCG 60 

ATCAAAATCC AAATCTAACA GATAAGGAAA AACAAGCACT TAAAGATCGT ATTAATCAAA 120 

TACTTCAACA AGGTCATAAC GACATTAACA ATGCGATGAC AAAAGAAGCA ATTGAACAAG 180 

CAAAAGAACG TTTAGCGCAA gCATTGCAAG ACATCAAAGA TTTAGTGAAA GCTAAAGAAG 24 0 

ATGCGAAAAA TGATATTGAT AAACGTGTAC AAGCTTTAAT TGACGAAATC GATCAAAATC 3 00 

CAAATCTAAC AGATAAGGAA AAACAAGCAC TTAAAGATCG AATTAATCAA ATACTTCAAC 360 

AAGGTCATAA CGACATTAAC AATGCGCTGA CTAAAGAAGA AATTGAGCAG GCAAAAGCAC 4 20 

AACTTGCACA AGCATTGCAA GACATCAAAG ATTTAGTGAA AGCTAAAGAA GATGCGAAAA 4 80 

ATGCAATAAA AGCCTTAGCT AATGCGAAgc GTGATCAAAT CAATTCAAAT CC AG ATTTAA 54 0 

CACCTGAGCA AAAAGCAAAA GCGCTCAAAG AAATTGACGA AGCTGAAAAA CGAGCACTAC 6 0C 
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w 



15 



20 



25 



30 



35 



40 



45 
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TGTAATAATT 


ACTAAATCGT 


CTTCTTTTAA 


GTTGCTAAAC 


AGTTCTGGCA 


AGCGATCATC 


9360 


GAAATCTTTA 


ATTGCTTGTG 


CATAACCTGG 


TTTATCACGA 


CGATGACCGT 


ATAATGCATC 


9420 


AAAGTCTACT 


AAGTTTAAGA 


AGCTAATACC 


TGTGaAATCT 


TTCTTAACAA 


TTTTCATCAA 


9480 


TTGATCCATA 


CCGTCCATGT 


TACTCTTCGT 


ACGAACCGCT 


TCTGTTACAC 


CTTCACCATC 


9540 


ATAAATGTCA 


TTAATTTTAC 


CGATGGCAAT 


AACATCATAA 


CCACCGTCTT 


TCAAATGATC 


9600 


TAAGACAGTT 


TTACCAAAAG 


GTTTTAACGC 


ATAGTCATGT 


CGATTAGATG 


TACGTGTAAA 


9660 


GTTTCCTGGT 


TCACCAACAT 


ATGGACGTGC 


GATAATACGA 


CCAATTAAAT 


ATTTAGGGTC 


9720 


TTTTGTCAAC 


TCACGAACCT 


TTTCACAAAT 


ATCATATAAC 


TCTTCTAATG 


GGATAATGTC 


9780 


TTCATGTGCA 


GCAATTTGCA 


ATACTGGGTC 


TGCACTTGTA 


TAAACAATTA 


AGTCACCAGT 


9840 


TTTCATTTGG 


TGCTCGCCCC 


ACTCATCGAT 


AATTTGCGTA 


CCCGATGCCG 


GTTTGTTAGC 


9900 


AACAACTTTA 


CGACCTGTCA 


TTTCTTCAAT 


TTGTTGAATT 


AACTCTTCAG 


GGAATCCATT 


9960 


AGGGTATACT 


TTAAAAGGTT 


G CAT AAT ATT 


TAATCCCATA 


ATTTCCCAGT 


GACCAGTCAT 


10020 


TGTATCTTTA 


CCAACTGAAG 


CTTCACTCAA 


TTTAGTATAG 


TATGCTTCTG 


GTTGTTCAAC 


10080 


TGCATTTACT 


ACTGGTAATT 


TATCGATGTT 


CCCTAGACCT 


AACTTTTCAA 


GGTTTGGTAA 


10140 


AGTTTGATCG 


AAACCTTCTA 


AGGTATGTCT 


TAAAGTATGT 


GAACCTTCAT 


CTTTAAAATC 


10200 


AGCTGCGTCT 


GGCGCTTCAC 


CAATACCTAC 


TGAATCCATT 


ACGATTAAAT 


GTACACGATT 


10260 


AAATGGTCTT 


GTCATAGCTA 


TCACTCCCAA 


AATTTATATA 


TATTAGTAAT 


CTGAAT CTGC 


10320 


TTCTAAACCT 


TGCATAATTT 


GAACACCTGC 


GCTCGCACCA 


ATACGTGTCG 


CACCTGCTTC 


10380 


AACCATTTTA 


TTGAAATCTT 


CTAAATTACG 


TACGCCACCT 


GATGCTTTTA 


CTTCTACATC 


10440 


AGCACCTACT 


GTATCTTTCA 


TTAATTTAAC 


GTCTTCTGCA 


GTCGCACCGC 


CACCTGCAAA 


10500 


ACCTGTTGAA GTTTTAACGA AGTCCGCACC AGCCGCTTTT GTTAATTCAC 


TCGCTTTTAC 


10560 


AATTTCGTCA 


TGGTCCAACA 


ATACCGTCTC 


AATAATCACT 


TTTACTGTGT 


GACCTTTCGC 


10620 


AGCTTTAACC 


ACTGCTTCAA 


TGTCTTGTTG 


TACATCATCA 


AAACGTCCAT 


CTTTTAATGC 


10680 


GCCGATGTTG 


ATGACCATGT 


CAATTTCATC 


TGCACCATTT 


TGAATTGCAT 


CTTCTGTTTC 


10740 


AAATGCTTTC 


GTTGCAGTTG 


TCGACGCACC 


TAATGGGAAT 


CCTATTACCG 


TACAAACGAG 


10800 


CACCTCTGAA 


TCAGCTAGTC 


GCTCTGCTGC 


ATATTTAACA 


TGTGTTGGAT 


TCACACATAC 


10860 


AGATTTAAAA 


TTGTATGctT 


TCGCTTCATC 


GATGATTTGA 


TCGATTTGCG 


TACGTGTTGA 


10920 


CTCAGGCTTC 


AATAAAGTGT 


GATCTATATA 


TTTCTCAAAT 


TTCATACTTA 


CTACTCCTCG 


10980 


TGTTATATAA 


TCTCTTTATT 


TAATTTTACT 


ATAAATACGA 


ATATATCTCG 


CGAATTTATA 


11040 
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